- 16 3月, 2022 9 次提交
-
-
由 xiongkun 提交于
-
由 Chen Weihang 提交于
-
由 Chen Weihang 提交于
* add grid sample phi kernel * add grid sample phi kernel and remove original kernel * replace mutable_data by alloc
-
由 qipengh 提交于
-
由 Zhanlue Yang 提交于
-
由 Chen Weihang 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Support the auto completion of while_op * [Auto Parallel] Improve the completion algorithms * [Auto Parallel] Fix bugs for ernie inference * [Auto Parallel] Remove attrs which cannot be pickled * [Auto Parallel] make the dims_mappings of LodTensorArray vars empty * [Auto Parallel] Fix bugs for the ernie inference in the pipeline parallel * [Auto Parallel] Remove unncessary comments * [Auto Parallel] Fix a bug of the CMakeLists * [Auto Parallel] Use the newest APIs to write the unit test * [Auto Parallel] Remove unnecessary statements
-
由 xiongkun 提交于
-
由 王明冬 提交于
* change some symbol names * add test * add phi to opt.cc * clean code * up * update * up * up * Update pten_pass.mlir * Update convolution_grad_kernel.cc * update * restore init_infrt_dialects * restore * up * up * up Co-authored-by: NSuperjomn <yanchunwei@outlook.com>
-
- 15 3月, 2022 28 次提交
-
-
由 Chen Weihang 提交于
* add determinant phi kernel * remove original determinant op kernel * add determinant grad [hi kernel * fix determinant test failed * remove original determinant grad op kernel
-
由 Chen Weihang 提交于
-
由 Liu-xiandong 提交于
* [phi] move matrix_power op * MatrixInverse fluid -> phi * modify the CMake to fix compile bug * delete useless comment * mutable memory -> phi Alloc * modify the include file * modify the include file * fix bug in CI compiler * [phi]modify the shape OP and move inferMeta of shape,matrix_pow,multi_dot * delete useless comment * fix bug in CI * modify after review
-
由 Roc 提交于
* add expert count op add ut for expert_count * update UT only for cuda * fix for rocm * update ut * add moe module * add expert count op add ut for expert_count * update UT only for cuda * update ut * add moe module * make expert count private * rename expert count op Co-authored-by: Nhlygit66666 <2570058140@qq.com>
-
由 xiongkun 提交于
* run python api in eager model and filter the out in argument list * fix code
-
由 Zhanlue Yang 提交于
* Fixed issues with generated scale operator * Fixed minor issues
-
由 furnace 提交于
* [NPU] add AMP O1 support * [NPU] fix NOTE and warnings
-
由 Chen Weihang 提交于
* add phi gather kernel * update year * remove original gather opkernel * add gather grad phi kernels * remove origin gather grad kernel * fix failed npu and xpu * fix xpu compile failed
-
由 Jacek Czaja 提交于
* - Prototype of third solution - fix - compilation fixes - fix - fixe - fix - fix - compilation fix - comment fix - lint update mkldnn conv_elementwise_add_fuse_pass ut - NHWC changes to prelu - alhpa dims - UT fix - fix to UT - lint - Some fixes - added to BWD of prelu NHWC support - reverted removal of resetting cu_layout in clearing of caching * - Small changes * - compilation fix * - fix * - fix * lint * - fixes after internal review * - compilation fix * - lint
-
由 zhouweiwei2014 提交于
-
由 Thunderbrook 提交于
* shard_id * format
-
由 xiongkun 提交于
[phi] Transfer lgamma, kldiv_loss, isclose, cumprod kernels into phi and pass the tests of these four kernels (#39770) * tranfer and pass the lgamma unittest * merge and pass the test * transfer kldiv_loss and kldiv_loss_grad; pass the unitest * trafer the isclose and cumprod kernel * change PT_REGISTER -> PD_REGISTER * fix by code review * fix by code review * fix * remove enforce include dependence from scalar * fix * fix by code review * fix by code review
-
由 Chen Weihang 提交于
-
由 chentianyu03 提交于
* add reduce_min kernel * remove raw reduce_min kernel * add reduce min * add reduce any all impl * add bool reduce Kernel * remove raw any/all kernel * add any all kernel * rm comment
-
由 Zhanlue Yang 提交于
* Added more signposts to dygraph profiling * Fixed minor issues * Refactored signpost names * Fixed typo * Removed debug codes * Fixed typo * Adjusted signpost names * Fixed issues from branch merge
-
由 hong 提交于
* move one hot to phi; test=develop * fix bugs; test=develop * fix bugs; test=develop * add infer meta; test=develop * fix bugs; test=develop * resolve confilct * resolve confilct * fix bug; * fix error; test=develop * update; test=develop * polish code; test=develop * add one api in eager mode; test=develop * add one hot test; test=develop * remove use less code; test=develop * fix bug; test=develop * polish code; test=develop * polish code; test=develop
-
由 ronnywang 提交于
-
由 Chang Xu 提交于
-
由 YuanRisheng 提交于
* move activation op * adjust code format * fix compile bugs * fix ci bugs * code format adjust * code format adjust2 * activate ci status * modify according to comment * move activation kernel * revert relu6 * reduce add code * perfect use_phi_functor * completing func name * fix bugs when run ci * fix bugs when run infr * modifpy infrt get kernel signature
-
由 qipengh 提交于
-
由 Allen Guo 提交于
* add ci * rm retry tests * format * restore retry tests * update timeout for ipu uts
-
由 Zhang Zheng 提交于
-
由 Haohongxiang 提交于
* refactor reducer * modify cmakelists * solve conflicts * rename group and update process_group * fix bugs of ProcessGroupNCCL * modify for CIs * refactoring reducer
-
由 石晓伟 提交于
-
由 crystal 提交于
-
由 zyfncg 提交于
* change the exception of getitem from pybind type to PADDLE_ENFORCE * fix bug * remove pybind::index_error exception
-
由 Zhang Zheng 提交于
* first commit * fix * fix * fix compile eeror * fix * fix complex * fix * fix * fix npu * fix * modify accroding to comments * fix
-
由 Chen Weihang 提交于
-
- 14 3月, 2022 3 次提交
-
-
由 Siming Dai 提交于
* Add diag grad kernel * fix unittest case * add float16, remove const & * delete diag_grad in op_utils.h
-
由 zyfncg 提交于
* move set_value_grad kernel form fluid to phi * add unittest for passing coverage ci
-
由 Tomasz Socha 提交于
* Add elementwise add and activation fuse pass * Fix copy ellision * More flexible pattern detector * More flexible fusion pass * Update lists for pass * Add support for Pow operator * Add support for more activation types * Style * Rename fusion pass * First version of tests * Dirty version of pass * Polished version * Update pbtxt * Style * Update names * Style * Use PADDLE_ENFORCE_EQ * Save error message to variable * WO for error checks * CR * Static style check * Add missing 'activation_scale' attribute * Add relu6 and sigmoid activations * Style * Fix fuse list formating * Sync filenames for fuse pass files * Fix cmake after move * Fix registration * Fix pass name in tests * Add missing activations to checker * WIPS * Working mul op * Working sub * Working Add * Remove pten includes * Remove some forward declarations * Remove Includes * Fixes * Remove default kernels * Add check if post_ops attributes are avaliable * Style * Code adjustment * Register default kernels * We have year 2022 not 2021... Co-authored-by: Njakpiase <jakpia21@gmail.com> Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com> * Fast review fixes Co-authored-by: Njakpiase <jakpia21@gmail.com> Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com> * Review Fix * Rename one_dnn -> onednn * Style after review * Fast and dirty fix for quantization * Update tests * Style * Fix mkldnn_quantizer config * Add Joanna's suggestion. * Check if operator is explicitly disables on OneDNN * Try to use unregistered attributes * Style * Test new framework * FXI * FXII * Update test * Style Co-authored-by: Njakpiase <jakpia21@gmail.com> Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>
-