- 28 9月, 2020 4 次提交
-
-
由 liym27 提交于
* Support assignment to a Variable in dynamic mode. Note: not deal with backward. * Rewrite VarBase __setitem__ for high-performance. * try to test 3 means to do __setitem__ and test the performance of 3 means. * Retain the means of the highest performance: C++ code and don't trace op.
-
由 lilong12 提交于
* include ncclRecv and ncclSend, test=develop
-
由 lilong12 提交于
* add gloo initializer, test=develop
-
由 furnace 提交于
-
- 27 9月, 2020 7 次提交
-
-
由 Double_V 提交于
* Error message opt, test=develop * solve comments, test=develop * fix typo, test=develop
-
由 Li Fuchen 提交于
* add float64 input to ctc_loss * modified error message of warpctc * update repo and tag of warpctc * add test for warpctc with float64 input * modified warpctc.cmake to make sure build always * resolved sample code bug of warpctc * add core.ops in warpctc dygraph * fix a bug of test
-
由 QingshuChen 提交于
* support elementwise add, activation, matmul on Baidu Kunlun * test=kunlun * minor * test=kunlun * reconstuct the xpu directory * test=kunlun * minor * test=kunlun * minor * test=kunlun * minor * test=kunlun * minor * test=kunlun * minor * test=kunlun
-
由 Jack Zhou 提交于
register log double grad kernel for cpu and cuda
-
由 Chengmo 提交于
* fix sgd/momentum/dpsgd/rmsprop error message
-
由 Leo Chen 提交于
* refine broadcast_op_handle * refine some error messages * refine some files * fix bug * fix bug * fix bug * follow comments * follow comments
-
由 Chengmo 提交于
* fix test_dist_fleet_heter_ctr & peformance update
-
- 26 9月, 2020 2 次提交
-
-
由 Zhong Hui 提交于
fix cpplint error for the autmic max/min
-
由 joanna.wozna.intel 提交于
-
- 25 9月, 2020 13 次提交
-
-
由 Leo Chen 提交于
* refine err msg * follow comments
-
由 Thunderbrook 提交于
* add xpu in heter mode test=develop * BOOST_CONST_GET; PADDLE_THROW test=develop * code style test=develop * code style test=develop * code style test=develop * refine test=develop * refine test=develop * refine test=develop * refine code test=develop
-
由 ceci3 提交于
* fix bn double grad, test=develop * update, test=develop
-
由 ShenLiang 提交于
* add fp16 for matmul
-
由 Zhong Hui 提交于
add abs support double grad for the api 2.0
-
由 GaoWei8 提交于
* refine PADDLE_ENFORCE
-
由 liym27 提交于
-
由 Leo Chen 提交于
-
由 Shang Zhizhou 提交于
* [buf fix]:fix unittest test_activation_op error * split long-time unittests to smaller ones * rename some unittests
-
由 Zhong Hui 提交于
fix cuda atomic for ARCH<350 for the automic_max
-
由 WangXi 提交于
-
由 cc 提交于
* Add pass compatibility check for four mkldnn pass, test=develop
-
由 mapingshuo 提交于
-
- 24 9月, 2020 11 次提交
-
-
由 Wilber 提交于
-
由 Wilber 提交于
-
由 ruri 提交于
-
由 Kaipeng Deng 提交于
* fix error message in conv/conv_transpose. test=develop
-
由 Wilber 提交于
-
由 ruri 提交于
* refine err msg in shuffle channel op
-
由 Aurelius84 提交于
-
由 Shibo Tao 提交于
* fix tensorrt 6 build error. test=develop * fix. test=develop * bug fix * test=develop
-
由 wanghuancoder 提交于
* use iwyu clean include, test=develop, test=win * compilation error, test=develop * fix compilation error2, test=develop * fix compilation error3, test=develop * fix compilation error4, test=develop * fix compilation error5, test=develop * fix compilation error6, test=develop * fix compilation error7, test=develop * fix compilation error8, test=develop * fix compilation error8, test=develop * fix compilation error10, test=develop * fix compilation error11, test=develop
-
由 wangchaochaohu 提交于
* refine the error message for bath size like
-
由 Zhong Hui 提交于
Add GPU Kernels of Segment Ops, support, sum, max, min, mean
-
- 23 9月, 2020 3 次提交
-
-
由 YUNSHEN XIE 提交于
-
由 Shang Zhizhou 提交于
* [bug fix]:Memory increases after adapting the cudnn version to 8 * [bug fix]cudnnGetConvolutionForwardAlgorithm not defined
-
由 Zhou Wei 提交于
* Makes the Bind Method more intelligent * Makes the Bind Method more intelligent * fix unittest * fix unittest * fix conflict
-