- 26 9月, 2020 1 次提交
-
-
由 Zhong Hui 提交于
fix cpplint error for the autmic max/min
-
- 25 9月, 2020 1 次提交
-
-
由 Zhong Hui 提交于
fix cuda atomic for ARCH<350 for the automic_max
-
- 24 9月, 2020 3 次提交
-
-
由 Shibo Tao 提交于
* fix tensorrt 6 build error. test=develop * fix. test=develop * bug fix * test=develop
-
由 wanghuancoder 提交于
* use iwyu clean include, test=develop, test=win * compilation error, test=develop * fix compilation error2, test=develop * fix compilation error3, test=develop * fix compilation error4, test=develop * fix compilation error5, test=develop * fix compilation error6, test=develop * fix compilation error7, test=develop * fix compilation error8, test=develop * fix compilation error8, test=develop * fix compilation error10, test=develop * fix compilation error11, test=develop
-
由 Zhong Hui 提交于
Add GPU Kernels of Segment Ops, support, sum, max, min, mean
-
- 23 9月, 2020 2 次提交
-
-
由 Shang Zhizhou 提交于
* [bug fix]:Memory increases after adapting the cudnn version to 8 * [bug fix]cudnnGetConvolutionForwardAlgorithm not defined
-
由 Chen Weihang 提交于
* polish some lost error msg * add some math file to white list * polish detail based reviewer commnet
-
- 21 9月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* support use add instead of sum to do gradient accumulation * add inplace addto pass * add grad_add op and inplace addto pass * remove debug code * code refine * fix bug when sereral sum ops inserts at same op_idx * fix Flags type * add addto attribute for conv3d * fix ut * code clean * fix type
-
- 18 9月, 2020 1 次提交
-
-
由 GaoWei8 提交于
* fix cudnn dyload error
-
- 17 9月, 2020 1 次提交
-
-
由 Jack Zhou 提交于
enhance reduce op which can reduce tensor with arbitrary rank
-
- 15 9月, 2020 1 次提交
-
-
由 GaoWei8 提交于
* replace sequence length attr to input
-
- 14 9月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
-
- 07 9月, 2020 1 次提交
-
-
由 GaoWei8 提交于
* add lstm cudnn of padding data and refine cudnn codes
-
- 03 9月, 2020 2 次提交
-
-
由 wangchaochaohu 提交于
-
由 joanna.wozna.intel 提交于
-
- 28 8月, 2020 1 次提交
-
-
由 Zhen Wang 提交于
* update the demo code and the doc of varbase.backward. * update the doc of the fake interface `paddle.fluid.Variable`. * remove BackwardStrategy.
-
- 27 8月, 2020 1 次提交
-
-
由 lilong12 提交于
add collective op for cpu using gloo and paddle.distributed.* apis
-
- 26 8月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
-
- 21 8月, 2020 2 次提交
-
-
由 Adam 提交于
* Add mechanism for blocking oneDNN cache clearing * Review changes and Add thread guards
-
由 QingshuChen 提交于
* support Baidu AI Accelerator * test=kunlun * minor * test=kunlun * support xpu op in separate file * test=kunlun * update XPU error message and remove duplicated code * test=kunlun * minor * test=kunlun * minor * test=kunlun
-
- 19 8月, 2020 1 次提交
-
-
由 GaoWei8 提交于
-
- 17 8月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* print user-friendly error message * adjust error sumary
-
- 16 8月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 08 8月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Change use_quantizer attribute name and data type * Fix problem with setting attribute * Add changes due to review * Small change in function * Restore use_quantizer attr for compatibility
-
- 07 8月, 2020 2 次提交
- 06 8月, 2020 1 次提交
-
-
由 Adam 提交于
* Add oneDNN fusion_gru kernel and fix fc+gru pass test=develop * Formatting changes test=develop * Lint fixes test=develop * Add memory::format_tag::any to GRU weights test=develop * Fix build with CUDA * Fix build with CUDA v2
-
- 05 8月, 2020 2 次提交
-
-
由 Zhaolong Xing 提交于
* cunn8 support test=develop * fix ci error test=develop
-
由 Pei Yang 提交于
* develop dynamic shape serilization * add test param for gelu * fix bugs * delete redundant comments * debug * fix conflict. test=develop * fix bug. test=develop * add trt dynamic shape serialized support * fix ernie serialized bug test=develop * fix codestyle test=develop * fix bug test=develop * fix bug.test=develop * modify cmakelist test=develop * fix bug test=develop * fix error message. test=develop * fix trt register plugin based on pr#25003 * add trt dynload * fix deserialization bug of not finding plugin registration * refine code style * recover engine key in tensorrt_subgraph_pass * for ci coverage * add unittest for deserialization Co-authored-by: Nhaozech <chenhaoze94@gmail.com>
-
- 04 8月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
-
- 30 7月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* refine error stack format * polish compile traceback format * polish detail format
-
- 29 7月, 2020 2 次提交
-
-
由 Chen Weihang 提交于
* unified signal error format * refine signal error message
-
由 Chen Weihang 提交于
* simplify buffered reader to improve DataLoader performance * fix 22 failed unittests * fix cuda pinned context condition * fix test_reader_reset failed * fix two failed unittests * change unittest place * polish error messaage * polish cast op GetExpecctedKernelType * remove debug info in unittest
-
- 28 7月, 2020 1 次提交
-
-
由 arlesniak 提交于
* Added DNNL cache management for DyGraph * move FLAGS_use_mkldnn to more general CMakeLists, getu use of the flag in ClearGradients * missing file * Fixes after review * Bringing back original idea of place for 'use_mkldnn' flag to be accessible from platform nad imperative. * Removed duplicate and added docs * Fixes for CI
-
- 23 7月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
test=develop
-
- 20 7月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* polish install error hint msg, test=develop * fix variable error, test=develop * polish hint messgae again
-
- 17 7月, 2020 1 次提交
-
-
由 Jacek Czaja 提交于
-
- 15 7月, 2020 2 次提交
-
-
由 LielinJiang 提交于
* add bilateral slice op
-
由 GaoWei8 提交于
* Refine PADDLE_ENFORCE in paddle/fluid/platform test=develop
-
- 11 7月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* fix softmax_with_cross_entropy cuda kernel overflow bug, test=develop * replace old macro & for condition, test=develop * polish details, test=develop
-