- 31 5月, 2021 4 次提交
-
-
由 whs 提交于
-
由 XiangGao 提交于
* improve group norm cpu precision and performance * add unit test to group norm
-
由 Aurelius84 提交于
-
由 wangguanzhong 提交于
* support params groups, test=develop * simplify updating opt attr * update according to review
-
- 29 5月, 2021 1 次提交
-
-
由 Guanghua Yu 提交于
* fix uint8 check in assign * fix assign unittests * fix xpu test_assign
-
- 28 5月, 2021 3 次提交
-
-
由 wenbin 提交于
* add more conditions * dynamic shape * ut * correct contidions * commnent * remove rebandadnt op type * remove rebandant if
-
由 Aurelius84 提交于
* Add GLIBCXX_USE_CXX11_ABI flag * fix typo * fix typo
-
由 levi131 提交于
* run pre-commit * use HOST or DEVICE instead of HOSTDEVICE in implementation of lgamma op * add test for fp32 * add lgamma to op_threshold_white_list * add cuda kernel for lgamma kernel * modify numeric grad delta * fix small English issue * change LaunchElementwiseCudaKernel to LaunchSameDimsElementwiseCudaKernel
-
- 27 5月, 2021 4 次提交
-
-
由 wanghuancoder 提交于
* modify API nn.Bilinear's doc, test=develop * speed up paddle.add paddle.nn.Linear, test=develop * fix bug, test=develop
-
由 Qi Li 提交于
-
由 cc 提交于
-
由 Thunderbrook 提交于
* support ssd in PsCore * remove log * remove bz2 * defalut value * code style * parse table class * code style * add define
-
- 26 5月, 2021 6 次提交
-
-
由 tangwei12 提交于
Change-Id: I2e82dfcee6a1d0512b94cebc32281123fa5bf597 * pretty print for datafeed error Change-Id: I056a8b6f03608e96679a83846c97aed289cef7e6 * fix fleet dist infer ut
-
由 Leo Chen 提交于
* refine ~npuOpRunner * implement destructor and forbid copy * use reference to avoid copy * use const reference * relax adam precision * fix top_k
-
由 Yuang Liu 提交于
-
由 Zhanlue Yang 提交于
Sigmoid: Out = Sigmoid(X) SigmoidGrad: DX = DOut*(1-Out)*Out [This Patch] Out DOut -> SigmoidGradGrad -> DOutNew DDX DDOut DDOut = (1-Out)*Out*DDX DOutNew = (1-2*Out)*DOut*DDX
-
由 JZ-LIANG 提交于
-
由 jakpiase 提交于
* added op cast functionality for fp32/bf16 * added newline * added entries in static mode white list and unity build * fixed failing tests * changes after review * added formatting * upgraded tests file as reviewer suggested * changes after review * minor change
-
- 25 5月, 2021 4 次提交
-
-
由 Ming-Xu Huang 提交于
-
由 ShenLiang 提交于
-
由 tangwei12 提交于
* add async save for sparse table * add load profiler for sparse table * add load info for sparse table
-
由 jakpiase 提交于
-
- 24 5月, 2021 3 次提交
-
-
由 ShenLiang 提交于
* fix pipeline * fix mp pp dp * fix utest of hybrid parallel * add utest for tuple
-
由 wangguanzhong 提交于
-
由 gongweibao 提交于
-
- 22 5月, 2021 2 次提交
-
-
由 wangguanzhong 提交于
-
由 jakpiase 提交于
* added support for most matmul cases * added more functionality * full functionality of matmul op, fp32 only * added bf16 tests and functionality * added formatting * changes after review * minor change * added reviewers suggestions
-
- 21 5月, 2021 3 次提交
-
-
由 chentianyu03 提交于
* replace complex in set tensor from and to numpy * replace complex template in cast op
-
由 Leo Chen 提交于
-
由 pangyoki 提交于
* cast indices and label if their type is not consistent * fix bug * add unittest
-
- 20 5月, 2021 3 次提交
-
-
由 TTerror 提交于
* fix gather op and add logsumexp op on kunlun * update xpu depence * update tests and fix elementwise_add
-
由 Aurelius84 提交于
* Support convert sublayers in Sequential Container * remove paddle.jit.set_code_level
-
由 liym27 提交于
-
- 19 5月, 2021 5 次提交
-
-
由 WeiXin 提交于
* fix test_paddle_save_load and test_paddle_save_load_binary * fix unittest:test_paddle_save_load and test_paddle_save_load_binary * delete *.pyc * add comment for unittest
-
由 zhulei 提交于
* [Rocm] fix test of random_crop_op * [Rocm] fix test of random_crop_op * [Rocm] fix test of random_crop_op & simple_rnn_op * [Rocm] fix test of random_crop_op & simple_rnn_op & logsumexp * [Rocm] fix test of random_crop_op & simple_rnn_op & logsumexp * [Rocm] fix test of random_crop_op & simple_rnn_op & logsumexp * [Rocm] fix test of random_crop_op & logsumexp
-
由 GT-Zhang 提交于
* Fix slow data reading, In the old version, one epoch read time of this data set was about 5371 seconds(MacBook Pro Retina, 13-inch, Early 2015 2.7 GHz), and a batch took 211 seconds, It's too painful to use. Now decompress the data in advance (about 10 seconds). Each epoch of reading takes about 3 seconds(MacBook Pro Retina, 13-inch, Early 2015 2.7 GHz), and a batch takes 0.017 seconds more. * Run CI, test=allcase * fix qq group number. test=document_fix fix qq group number. test=document_fix * fix qq group number. test=document_fix fix qq group number. test=document_fix
-
由 Aurelius84 提交于
* BugFix StaticAanlysis with gast.Subscript * remove codes
-
由 Jacek Czaja 提交于
-
- 18 5月, 2021 2 次提交
-
-
由 pangyoki 提交于
* Output indices of top_k npu op change to int64 * fix accuracy npu bug * fix errors * change cast method to FillNpuTensorWithConstant * change cast method to FillNpuTensorWithConstant
-
由 joanna.wozna.intel 提交于
* Update paths to Quant models * Update description
-