- 19 3月, 2021 1 次提交
-
-
由 ronnywang 提交于
-
- 22 2月, 2021 2 次提交
- 04 2月, 2021 1 次提交
-
-
由 wanghuancoder 提交于
* use iwyu clean include second time, test=develop
-
- 28 1月, 2021 1 次提交
-
-
由 Qi Li 提交于
* [ROCM] update fluid platform for rocm35 (part1), test=develop * address review comments, test=develop
-
- 20 1月, 2021 1 次提交
-
-
由 wanghuancoder 提交于
* delete empty line of pybing.cc, test=develop * use nvtx push pop in timeline, test=develop * change year, test=develop * add #ifdef PADDLE_WITH_CUDA, test=develop * add #ifndef WIN32, test=develop * is_pushed to is_pushed_, test=develop
-
- 06 1月, 2021 1 次提交
-
-
由 Zhou Wei 提交于
* Polish and Optimize the print/repr message of all layer * fix some code format
-
- 25 12月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add support for complex grad accumulated * add unittest for coverage * update test dtype * remove useless blank line
-
- 16 12月, 2020 1 次提交
-
-
由 Y_Xuan 提交于
* 添加rocm平台支持代码 * 修改一些问题 * 修改一些歧义并添加备注 * 修改代码格式 * 解决冲突后的代码修改 * 修改operators.cmake * 修改格式 * 修正错误 * 统一接口 * 修改日期
-
- 01 12月, 2020 1 次提交
-
-
由 chentianyu03 提交于
* add complex64 and complex128 type; add +-*/@ and slice opreator for complex types * add test cases for complex elementwise, matmul and getitem unittest * add test cases for complex types * add test cases for complex matmul unittest
-
- 27 11月, 2020 1 次提交
-
-
由 Zhou Wei 提交于
-
- 23 11月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* change avg pooling and global pooling to trt layer * add support for static shape global pooling * modify trt errmsg
-
- 17 11月, 2020 1 次提交
-
-
由 lilong12 提交于
-
- 03 11月, 2020 1 次提交
-
-
由 Shang Zhizhou 提交于
* fp16 result ok * change -DWITH_NVINFER_PLUGIN toconfig.EnableTensorRtOSS * auto detect special slice op converter for ernie with trt oss * ernie oss only support fp16 * fix special_slice_plugin serialize bug * matmul in tensorrt ok * ernie unittest ok * add matmul tensorrt unittest * remove demo code
-
- 21 10月, 2020 1 次提交
-
-
由 Zhou Wei 提交于
-
- 19 10月, 2020 1 次提交
-
-
由 Pei Yang 提交于
-
- 14 10月, 2020 1 次提交
-
-
由 Zhang Ting 提交于
* use exhaustive_search for float16 * tune algo only when dtype is float16
-
- 28 9月, 2020 1 次提交
-
-
由 lilong12 提交于
* include ncclRecv and ncclSend, test=develop
-
- 27 9月, 2020 1 次提交
-
-
由 Li Fuchen 提交于
* add float64 input to ctc_loss * modified error message of warpctc * update repo and tag of warpctc * add test for warpctc with float64 input * modified warpctc.cmake to make sure build always * resolved sample code bug of warpctc * add core.ops in warpctc dygraph * fix a bug of test
-
- 24 9月, 2020 2 次提交
-
-
由 Shibo Tao 提交于
* fix tensorrt 6 build error. test=develop * fix. test=develop * bug fix * test=develop
-
由 wanghuancoder 提交于
* use iwyu clean include, test=develop, test=win * compilation error, test=develop * fix compilation error2, test=develop * fix compilation error3, test=develop * fix compilation error4, test=develop * fix compilation error5, test=develop * fix compilation error6, test=develop * fix compilation error7, test=develop * fix compilation error8, test=develop * fix compilation error8, test=develop * fix compilation error10, test=develop * fix compilation error11, test=develop
-
- 23 9月, 2020 1 次提交
-
-
由 Shang Zhizhou 提交于
* [bug fix]:Memory increases after adapting the cudnn version to 8 * [bug fix]cudnnGetConvolutionForwardAlgorithm not defined
-
- 18 9月, 2020 1 次提交
-
-
由 GaoWei8 提交于
* fix cudnn dyload error
-
- 07 9月, 2020 1 次提交
-
-
由 GaoWei8 提交于
* add lstm cudnn of padding data and refine cudnn codes
-
- 03 9月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 19 8月, 2020 1 次提交
-
-
由 GaoWei8 提交于
-
- 07 8月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* fix trt plugin registry without trt lib * support trt4 * refine code style
-
- 05 8月, 2020 2 次提交
-
-
由 Zhaolong Xing 提交于
* cunn8 support test=develop * fix ci error test=develop
-
由 Pei Yang 提交于
* develop dynamic shape serilization * add test param for gelu * fix bugs * delete redundant comments * debug * fix conflict. test=develop * fix bug. test=develop * add trt dynamic shape serialized support * fix ernie serialized bug test=develop * fix codestyle test=develop * fix bug test=develop * fix bug.test=develop * modify cmakelist test=develop * fix bug test=develop * fix error message. test=develop * fix trt register plugin based on pr#25003 * add trt dynload * fix deserialization bug of not finding plugin registration * refine code style * recover engine key in tensorrt_subgraph_pass * for ci coverage * add unittest for deserialization Co-authored-by: Nhaozech <chenhaoze94@gmail.com>
-
- 20 7月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* polish install error hint msg, test=develop * fix variable error, test=develop * polish hint messgae again
-
- 15 7月, 2020 1 次提交
-
-
由 GaoWei8 提交于
* Refine PADDLE_ENFORCE in paddle/fluid/platform test=develop
-
- 09 7月, 2020 2 次提交
-
-
由 Chen Weihang 提交于
-
由 Zhen Wang 提交于
-
- 07 7月, 2020 1 次提交
-
-
由 GaoWei8 提交于
* refine PADDLE_ENFORCE test=develop
-
- 03 7月, 2020 1 次提交
-
-
由 GaoWei8 提交于
* fix PADDLE_ENFORCE and refine the description test=develop
-
- 02 7月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* refactor dynamic dso search func, test=develop * polish details, test=develop * polish detail based review comments, test=develop * revert string type change, test=develop
-
- 24 6月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add default cudnn lib path, test=develop * change default path in func, test=develop * move to linux branch, test=develop * fix var error in other plat, test=develop
-
- 05 6月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* support selectedrows allreduce in multi-cards dygraph, test=develop * remove useless import modules in unittests, test=develop * add nccl cmake to get nccl version, test=develop * add if-condition to compiled correctly, test=develop * add detail version parseing for old nccl, test=develop * polish camke details, test=develop * fix remove test cmake error, test=develop * fix cmake condition, test=develop * change unittest camke list, test=develop * fix unittest cmake rule, test=develop, test=framep0
-
- 18 5月, 2020 1 次提交
-
-
由 Yiqun Liu 提交于
* Add the check for whether CUDA Driver and NVRTC is available for the runtime system. * Call cuInit to initialize the CUDA Driver API before all CUDA callings. test=develop * Change the behavior when libnvrtc.so can not be found, printing a warning instead of exiting. test=develop * Do not initialize CUDA Driver API for windows and macos. test=develop * Remove the call of cuInit when entering paddle and enable the test_code_generator. test=develop * Add some built-in functions for __half. test=develop * Change save_intermediate_out to false in unittest. test=develop * Fix error reference to tempropary variable when seting including path for device_code. test=develop
-
- 08 5月, 2020 1 次提交
-
-
由 Guo Sheng 提交于
test=develop test=win_gpu
-