- 11 1月, 2023 7 次提交
-
-
由 Yiqun Liu 提交于
* Implement a common PointerArray. * Polish codes. * Add including of header file. * Add the branch of kFix8. * Fix compiling error. * Add alignas hint to fix the performance drop. * Optimize the H2D copy in stack_grad. * Rename the macro. * Fix align hint for different compilers. * Polish the define of PADDLE_ALIGN. * Fix compiling error. * Remove the align hint on windows.
-
由 zhangxin81 提交于
* fix paddle_infer_contrib include
-
由 niuliling123 提交于
-
由 kangguangli 提交于
-
由 duanyanhui 提交于
-
由 wenbin 提交于
* compile fix * fix compile * compile fix
-
由 Wangzheee 提交于
-
- 10 1月, 2023 15 次提交
-
-
由 limingshu 提交于
* add stack grad kernel optimization * add basic optimization kernel for stack_grad_kernel * optimization of stack_grad_kernel for last dim stack and change code format with pre-commit
-
由 Wen Sun 提交于
* refactor: gloo comm context migration * fix: headers & avoid mutable_data usage * fix: cmake gloo dep * style: rename funcs * refactor: move to new files * fix: gloo deps * refactor: simplify create device
-
由 FlyingQianMM 提交于
* [Zero-Dim] support input 0D Tensor for maximum,minimum,allclose,sigmoid_focal_loss * [Zero-Dim] add backward test for sigmoid_focal_loss with 0-D input Tensor
-
由 xiaoxiaohehe001 提交于
-
由 xiaoxiaohehe001 提交于
-
由 xiaoxiaohehe001 提交于
-
由 Ryan 提交于
* try sequence_padding * fix cant use mutable_data * fix mistake fluid_sequence_scale.hh/CMakeLists.t include * fix namespace bug * fix framework::ToAbsOffset not found * fix codestyle
-
由 wangxiaoning 提交于
* fix graph_brpc_server.h code style * fix ps_local_client.h * fix ps_local_client.cc
-
由 张春乔 提交于
-
由 Yuang Liu 提交于
-
由 Sanbu 提交于
-
由 wenbin 提交于
* gn bug fix * bug fix * gn bug fix
-
由 Jiabin Yang 提交于
* proto type of composite grad in paddle * proto type of composite grad in paddle * refactor composite api with phi * fix compile error * support static graph code-gen for squeeze op * generate static graph code of unsqueeze * refine op name * fix compile error * add extra output in op_compat * remove debug log * fix clang compile error * support prim switch flag * support prim switch flag * fix dygraph error * merge develop * add code_gen * add necessary files without codegen * fix code_gen bug * add deps * modify igmnore * add ignore * delete std cout * add composite logic for backward.py * add tanh first order grad composite * support enable_prim flag for static graph * throw expection when both GrapOpMaker and GradCompOpMaker not been registered * reorganize the directory of prim api tests * fix windows error * add eager_utils * add eager_utils * modify code gen * add composite parse * add unittest for get_grad_op_desc * code optimize * fix static test on windows * support generate static graph code for imag and real op * fix windows compile error in test_static_prim * merge develop * disable test eager in inference * prim code gen * disable eager compile in inference * rm other file * rm gitignore file * code_style * add eager test * code_style * merge develop * remove useless files * modify static test * support bool flag from singlton * merge develop * recover git ignore * fix conflict * recover git ignore for generated op * fix test compile error * remove some tests * add python test * fix some name issue * add composite code gen * modify backward yaml * fix static composite grad maker code gen * remove addtional files * add some static funcs unit test * fix some bugs * fix composite grad maker register code gen * optimize some functions * remove duplicated cmake * fix cmake and codegen problem Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com> Co-authored-by: Nwangruting <wangruting@baidu.com> Co-authored-by: Ncxxly <chenxx_id@163.com> Co-authored-by: Ncharles-hit <wanghao107@baidu.com> Co-authored-by: Nxiaoguoguo626807 <100397923+xiaoguoguo626807@users.noreply.github.com>
-
由 MarDino 提交于
-
由 MarDino 提交于
-
- 09 1月, 2023 17 次提交
-
-
由 chalsliu 提交于
-
由 chalsliu 提交于
-
由 Yuanle Liu 提交于
-
由 wangzhen38 提交于
-
由 MarDino 提交于
* add concat optimization * refine * remove annotation * use alignas instead of aligned_storage
-
由 wenbin 提交于
* skip_groupnorm * init * preln * add ut * more assert * set timeout * fix windows ci issue
-
由 Hulek 提交于
* Rewritten * change mkldnn to onednn * fix cmake name
-
由 wangzhen38 提交于
-
由 HappyHeavyRain 提交于
* support the drop_empty_grad in backward * change code according to yunfei's review suggestion
-
由 QingshuChen 提交于
-
由 ykkk2333 提交于
* migrate shaple sgd, split,sign xpu kernels to phi, test=kunlun * fix dlrm throughput problem, test=kunlun * add xpu einsum, fill_diagonal, and diagonal kernels, test=kunlun
-
由 gem5 提交于
-
由 Jiabin Yang 提交于
* proto type of composite grad in paddle * proto type of composite grad in paddle * refactor composite api with phi * fix compile error * support static graph code-gen for squeeze op * generate static graph code of unsqueeze * refine op name * fix compile error * add extra output in op_compat * remove debug log * fix clang compile error * support prim switch flag * support prim switch flag * fix dygraph error * merge develop * add code_gen * add necessary files without codegen * fix code_gen bug * add deps * modify igmnore * add ignore * delete std cout * add composite logic for backward.py * add tanh first order grad composite * support enable_prim flag for static graph * throw expection when both GrapOpMaker and GradCompOpMaker not been registered * reorganize the directory of prim api tests * fix windows error * add eager_utils * add eager_utils * modify code gen * add composite parse * add unittest for get_grad_op_desc * code optimize * fix static test on windows * support generate static graph code for imag and real op * fix windows compile error in test_static_prim * merge develop * disable test eager in inference * prim code gen * disable eager compile in inference * rm other file * rm gitignore file * code_style * add eager test * code_style * merge develop * remove useless files * modify static test * support bool flag from singlton * merge develop * recover git ignore * fix conflict * recover git ignore for generated op * fix test compile error * remove some tests * add python test * fix some name issue * add composite code gen * modify backward yaml * fix static composite grad maker code gen * remove addtional files * add some static funcs unit test * fix some bugs * fix composite grad maker register code gen * optimize some functions Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com> Co-authored-by: Nwangruting <wangruting@baidu.com> Co-authored-by: Ncxxly <chenxx_id@163.com> Co-authored-by: Ncharles-hit <wanghao107@baidu.com> Co-authored-by: Nxiaoguoguo626807 <100397923+xiaoguoguo626807@users.noreply.github.com>
-
由 HongyuJia 提交于
-
由 wangzhen38 提交于
-
由 骑马小猫 提交于
* remove dumplicate element_size * remove tensor_methon_* method
-
由 LiYuRio 提交于
* comm_context and static init * refactor: move to phi/core/distributed * refactor: avoid mutable_data usage * fix: windows sock * fix: device without nccl Co-authored-by: Wen Sun <syl1887415157@126.com>
-
- 07 1月, 2023 1 次提交
-
-
由 Ruibiao Chen 提交于
* Enable standalone executor for fleet training * Update code * Replace use_standalone_executor utils in auto parallel * Update code * Diable standalone executor for test_pass_sharding * Update code * Set sequential run for auto parallel * Fix dist_attr bug * Set sequential run for auto parallel
-