- 13 11月, 2020 1 次提交
-
-
由 Zhou Wei 提交于
-
- 04 11月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
-
- 03 11月, 2020 4 次提交
-
-
由 Shang Zhizhou 提交于
* fp16 result ok * change -DWITH_NVINFER_PLUGIN toconfig.EnableTensorRtOSS * auto detect special slice op converter for ernie with trt oss * ernie oss only support fp16 * fix special_slice_plugin serialize bug * matmul in tensorrt ok * ernie unittest ok * add matmul tensorrt unittest * remove demo code
-
由 Jacek Czaja 提交于
-
由 Wilber 提交于
-
由 Guo Sheng 提交于
* Add rnn_op. test=develop * Fix rnn_op grad maker's drop_empty_grad. test=develop
-
- 02 11月, 2020 2 次提交
-
-
由 wangchaochaohu 提交于
-
由 Huihuang Zheng 提交于
This PR is follow up of #28213. On that PR we tried to decrease GPU usage, however the CI still randomly failed. So I added retry logic for the initialization of nccl and cusolver. If the initialization failed, we can retry to avoid the random failure.
-
- 30 10月, 2020 1 次提交
-
-
由 Leo Chen 提交于
-
- 28 10月, 2020 1 次提交
-
-
由 Jacek Czaja 提交于
-
- 27 10月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add multiple exception type * define all exception & polish compile pystack * mapping paddle error to python exception * polish static mode error format * fix failed unittests * fix dytostatic test_error * fix check_nan_inf failed * add unittest for coverage * revert some code try to solve compile error * refactor enforce & error change * polish code & add unittest
-
- 23 10月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add compile limit for paddle enforce * polish elementwise_op_function.cu.h * fix failed unittest * fix windows compile failed * detail polish * revert no type constructor
-
- 21 10月, 2020 1 次提交
-
-
由 Zhou Wei 提交于
-
- 20 10月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 19 10月, 2020 1 次提交
-
-
由 Pei Yang 提交于
-
- 16 10月, 2020 1 次提交
-
-
由 lidanqing 提交于
* conv dilated mkldnn support: forward and backward pass * add mkldnn conv_transpose dilation UT test=develop * remove unnecessary PADDLE_ENFORCE * add int8 and bf16 dilated conv UT * update according to reviews
-
- 14 10月, 2020 1 次提交
-
-
由 Zhang Ting 提交于
* use exhaustive_search for float16 * tune algo only when dtype is float16
-
- 12 10月, 2020 2 次提交
-
-
由 Jacek Czaja 提交于
-
由 chen.zhiyu 提交于
-
- 01 10月, 2020 1 次提交
-
-
由 Jacek Czaja 提交于
* - condidate fix to issue #25537 test=develop * - UT for transpose NHWC test=develop
-
- 30 9月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Add avx instructions check * Small fix * Change function name * Change uint to unsigned int
-
- 29 9月, 2020 2 次提交
- 28 9月, 2020 4 次提交
- 27 9月, 2020 2 次提交
-
-
由 Li Fuchen 提交于
* add float64 input to ctc_loss * modified error message of warpctc * update repo and tag of warpctc * add test for warpctc with float64 input * modified warpctc.cmake to make sure build always * resolved sample code bug of warpctc * add core.ops in warpctc dygraph * fix a bug of test
-
由 QingshuChen 提交于
* support elementwise add, activation, matmul on Baidu Kunlun * test=kunlun * minor * test=kunlun * reconstuct the xpu directory * test=kunlun * minor * test=kunlun * minor * test=kunlun * minor * test=kunlun * minor * test=kunlun * minor * test=kunlun
-
- 26 9月, 2020 1 次提交
-
-
由 Zhong Hui 提交于
fix cpplint error for the autmic max/min
-
- 25 9月, 2020 1 次提交
-
-
由 Zhong Hui 提交于
fix cuda atomic for ARCH<350 for the automic_max
-
- 24 9月, 2020 3 次提交
-
-
由 Shibo Tao 提交于
* fix tensorrt 6 build error. test=develop * fix. test=develop * bug fix * test=develop
-
由 wanghuancoder 提交于
* use iwyu clean include, test=develop, test=win * compilation error, test=develop * fix compilation error2, test=develop * fix compilation error3, test=develop * fix compilation error4, test=develop * fix compilation error5, test=develop * fix compilation error6, test=develop * fix compilation error7, test=develop * fix compilation error8, test=develop * fix compilation error8, test=develop * fix compilation error10, test=develop * fix compilation error11, test=develop
-
由 Zhong Hui 提交于
Add GPU Kernels of Segment Ops, support, sum, max, min, mean
-
- 23 9月, 2020 2 次提交
-
-
由 Shang Zhizhou 提交于
* [bug fix]:Memory increases after adapting the cudnn version to 8 * [bug fix]cudnnGetConvolutionForwardAlgorithm not defined
-
由 Chen Weihang 提交于
* polish some lost error msg * add some math file to white list * polish detail based reviewer commnet
-
- 21 9月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* support use add instead of sum to do gradient accumulation * add inplace addto pass * add grad_add op and inplace addto pass * remove debug code * code refine * fix bug when sereral sum ops inserts at same op_idx * fix Flags type * add addto attribute for conv3d * fix ut * code clean * fix type
-
- 18 9月, 2020 1 次提交
-
-
由 GaoWei8 提交于
* fix cudnn dyload error
-
- 17 9月, 2020 1 次提交
-
-
由 Jack Zhou 提交于
enhance reduce op which can reduce tensor with arbitrary rank
-
- 15 9月, 2020 1 次提交
-
-
由 GaoWei8 提交于
* replace sequence length attr to input
-