- 13 2月, 2023 1 次提交
-
-
由 RedContritio 提交于
* support size 0 dot input * prevent div 0 in grad * add unittest * remove unnecessary vlog * add unittests
-
- 12 2月, 2023 1 次提交
-
-
由 Xiaoxu Chen 提交于
-
- 11 2月, 2023 1 次提交
-
-
由 HongyuJia 提交于
* init commit * fix tensor operator* * fix compile bug * bug reproduce * update commit * polish codes * fix compile bug * test begin * test begin * compile finish * restore origin composite_backward_api * pass local CI * fix merge error * fix merge error * change py_test from GPU->CPU, test custom op * polish codes, modify prim unittest * modify prim unittest * determine phi_tensor_operants location * polish codes * add header file * solve windows unresolved symbol * fix some CI error * add overload defination * fix CI inference and Windows * polish codes according to reviewers' opinion * polish codes according to reviewers' opinion
-
- 10 2月, 2023 9 次提交
-
-
由 umiswing 提交于
-
由 Leo Guo 提交于
d_bias are nullptr. Modify the code style of full_kernel.cc. Add new data type for concat, elementwise_add, gather, scale, scatter ops. test=kunlun
-
由 ykkk2333 提交于
-
由 Infinity_lee 提交于
-
由 RedContritio 提交于
* add dim check in scatter * add check in scatter.cu * add unittest * remove unnecessary log and comment --------- Co-authored-by: RedContritio <>
-
由 zhupengyang 提交于
-
由 Huang Jiyi 提交于
* remove AllocatorFacade in phi * fix include * fix bugs
-
由 Huang Jiyi 提交于
* rm gradient_accumulator in phi * update
-
由 wangshengxiang 提交于
-
- 09 2月, 2023 6 次提交
-
-
由 Leo Guo 提交于
-
由 Huang Jiyi 提交于
* decouple strided_memcpy * move strided_memcpy * move strided_memcpy to phi * fix namespace * update * fix gpu compile bugs
-
由 Huang Jiyi 提交于
-
由 yuehuayingxueluo 提交于
* add multi_tenosr_adam * update multi_tensor_base.py, test_multi_tensor_adam.py, adamw.py * fix adam.py optimizer.py * fix adamw.py * fix test_multi_tensor_adam.py * fix CI bug * fix CI coverage * fix ci bug * fix betapow * fix some bugs * fix test_adamw_op.py * fix CI coverage * fix multi_tensor_adam_kernel.cc * fix CI bug * fix multi_tensor_adam_op.cc and test_multi_tensor_adam.py * fix code style * update C++ parts * remove python parts modification temporarily * add C++ ut * update betapow copy code logic * fix ci ut * fix windows ci * fix coverage ci * improve coverage rate --------- Co-authored-by: Nsneaxiy <sneaxiy@126.com>
-
由 zhangyikun02 提交于
-
由 傅剑寒 提交于
-
- 08 2月, 2023 7 次提交
-
-
由 Paulina Gacek 提交于
* QuantTranpose pattern is being found by pass * quant + transpose fuse * code style changes * UT written, reorder fixed * Dequantize + transpose2 fuse added * pass name changed * UT added & shift corrected * got rid of redundancy * review changes * AsIntermediate corrected * compat added
-
由 HongyuJia 提交于
-
由 zhangkaihuo 提交于
* fix bn performance degradation
-
由 HongyuJia 提交于
* support unsigned int and bfloat16 * update unit test * update DenseTensor datatype * unsupport more datatype of mutable_data(Place) * fix unittest
-
由 Zhong Hui 提交于
-
由 Huang Jiyi 提交于
-
由 YuanRisheng 提交于
* unify_kernel * fix compile bugs * modify macro name * perfect code according comment * fix compile bugs * fix compile bugs * fix ci bugs * fix ci bug * fix ci bugs * fix ci bugs * modify code according comment * rm conv_fusion_op
-
- 07 2月, 2023 5 次提交
-
-
由 zyfncg 提交于
* remove axis in some elementwise api * fix inplace bug eager-gen * fix bug * revert change for CheckInplace * polish code
-
由 张春乔 提交于
-
由 cyber-pioneer 提交于
move composite test case remove unuseful var add composite op blacklist
-
由 chalsliu 提交于
-
由 Yuang Liu 提交于
-
- 06 2月, 2023 7 次提交
-
-
由 YuanRisheng 提交于
-
由 zyfncg 提交于
* remove extra input of conv2d * fix bug * fix unittest bug * adjust conv2d.pbtxt * fix cpu_quantize_pass_tester * revert use_addto of conv2d * fix runtime attribute * fix bug * recover force_fp32_output in conv2d * refine error info * fix bug
-
由 张春乔 提交于
-
由 houj04 提交于
-
由 duanboqiang 提交于
-
由 engineer1109 提交于
-
由 risemeup1 提交于
-
- 03 2月, 2023 3 次提交
-
-
由 RedContritio 提交于
* support negative index in unique_consecutive * add unittest * add unittest
-
由 Sławomir Siwek 提交于
* replace matmul with matmul_v2 in fuse passes * Remove fusion logic from matmul * removing fusion methods * add proper name * adjust namespaces * clean attrs in python tests * delete checkpoint and restore matmul version * remove unused code * matmul and reshape/transpose fuses migrated * split MatmulOneDNN headers * fuse activation and eltwise_add * add fuse_activation * matmul_transpose_reshape/reshape_transpose_matmul * matmul + elementwise_add (fused) * activation temporary modifciation * merge newest develop * remove depedency from other PR * revert pbtxt * remove placeholders from matmul_v2 * add description in OPMaker * remove matmul_v2_op.h and all depedencies * remove dims changing in base op * add possibility to fuse already fused_matmul * restart broken CI * Empty-Commit * revert matmul_utils.h * codestyle * adjust imports * add pbtxt file * 100% matmul unit tests coverage * trigger CI with minimal changes to develop * adjust changes to develop * add fused_matmul op * inherit base ops * add "v2" * move OPMaker * Gradually add fused_matmul files * second batch of fused_matmul changes * split infershapes of matmul_v2 and fused_matmul * inherit fused_matmul from matmul_v2 * Update paddle/phi/backends/onednn/onednn_reuse.h Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> * Update paddle/phi/kernels/fusion/onednn/fused_matmul_kernel.cc Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> --------- Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
-
由 RedContritio 提交于
-