- 23 9月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* optimize slice TRT plugin This patch removes unnecessary barrier for data transfer of needed offset, so data transfer can be overlap with GPU kernel execution. This patch also fixes incorrect name of slice plugin. That is, replaces "layernorm" with "slice" test=develop * add serialize/deserialize to slice plugin * add static shape slice trt plugin * fix slice trt op convertor dynamic shape bug * fix format by clang-format * fix pylint format error * fix problems commented by peiyang Co-authored-by: NRyan Jeng <rjeng@nvidia.com> Co-authored-by: NShang Zhizhou <shangzhizhou@baidu.com> Co-authored-by: NRyan Jeng <rjeng@nvidia.com>
-
- 22 9月, 2020 2 次提交
-
-
由 tianshuo78520a 提交于
-
由 guofei 提交于
* Fix test_gast_with_compatibility.py due to the problem of gast in python3.8 (#27433) test=develop * fix dll load bug on windows from python3.8 (#27324) * Support python3.8 (#26850) * Support python3.8 test=notest * Replace the 'spawn' start method with 'fork' start method for multiprocessing, on MacOS with python>=3.8 (#27317) * Replace the 'spawn' start method with 'fork' start method for multiprocessing, on MacOs when python>=3.8 test=develop * Correct the error in decorator.py (#27409) test=develop Co-authored-by: NZhou Wei <52485244+zhouwei25@users.noreply.github.com>
-
- 21 9月, 2020 2 次提交
-
-
由 Pei Yang 提交于
-
由 yaoxuefeng 提交于
-
- 18 9月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* [Paddle-TRT] Stack op plugin (#25605) * add stack_op to CMakeLists * add dim=3 support for scale op * add trt stack op, test=develop * remove debug message * add stack plugin serialize * remove slice, scale op, will add later * enhence error message * revise trt ernie test to conver the stack op CI testi, test=develop * add stack op serialization * fix test shape after adding stack op * remove slice op, will add after implementing serialization * roll back to min_graph=5 to avoid using slice op * fix scale op output layer * implement stack op createPlugin * use workspace and move the defination to .cu * move stack plugin creator definition to .cu, test=develop * sync ut with develop Co-authored-by: Nzlsh80826 <zlsh80826@gmail.com>
-
- 16 9月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* fix multihead matmul shared params (#27121) * fix multihead matmul shared params
-
- 14 9月, 2020 1 次提交
-
-
由 Jacek Czaja 提交于
test=develop
-
- 11 9月, 2020 1 次提交
-
-
由 lilong12 提交于
* add double grad for expand, test=develop
-
- 10 9月, 2020 1 次提交
-
-
由 Qinghe JING 提交于
* add double grad to reduce sum
-
- 03 9月, 2020 1 次提交
-
-
由 yaoxuefeng 提交于
-
- 02 9月, 2020 2 次提交
-
-
由 Thunderbrook 提交于
cherry-pick fix cvm check test=develop Co-authored-by: N123malin <malin10@baidu.com>
-
由 Thunderbrook 提交于
* fix eigen in push sparse; fix hadoop command test=develop * add log in load_combine_op test=develop
-
- 01 9月, 2020 1 次提交
-
-
由 Pei Yang 提交于
This commit fixs the compiling bug regarding unique_ptr of IOptimizationProfile. IOptimizationProfile has protected dtor and is controlled by TensorRT internally. Application shouldn't delete the pointer of IOptimizationProfile. See TensorRT document: https://docs.nvidia.com/deeplearning/sdk/tensorrt-api/c_api/classnvinfer1_1_1_i_builder.html#a9ac47e100454151d8206ac91d543299a test=develop Co-authored-by: NJeng Bai-Cheng <jeng1220@users.noreply.github.com>
-
- 24 8月, 2020 1 次提交
-
-
由 yaoxuefeng 提交于
* mod cvm test=develop * mod code format test=develop
-
- 20 8月, 2020 1 次提交
-
-
由 tianshuo78520a 提交于
-
- 19 8月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 18 8月, 2020 1 次提交
-
-
由 Thunderbrook 提交于
* add mock barrier all (#24786) * add mock barrier all test=develop * fix test=develop * fix test=develop * fix test=develop * fix gloo error test=develop Co-authored-by: Nxujiaqi01 <173596896@qq.com>
-
- 12 8月, 2020 1 次提交
-
-
由 Thunderbrook 提交于
* fix dataset py3 (#25012) * fix dataset py3 error * test=develop * fix logger (#24682) * fix logger of FetchHandler,which may print log twice * test=develop * add timeout and http store in communication (#23436) * add timeout and http store in communication, add revert and confirm in fleet * test=develop * modify datanorm op test=develop (#23030) Co-authored-by: Nxujiaqi01 <173596896@qq.com> Co-authored-by: Nyaoxuefeng <yaoxuefeng@baidu.com>
-
- 11 8月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* add macro check for using TRT api dynamicRangeIsSet() (#25694) * adjust minimum trt version for hard_sigmoid converter to 5130. test=develop (#24746)
-
- 10 8月, 2020 1 次提交
-
-
由 MRXLT 提交于
* fix bug bug fix
-
- 07 8月, 2020 3 次提交
-
-
由 iducn 提交于
Co-authored-by: NTao Luo <luotao02@baidu.com>
-
由 Pei Yang 提交于
* fix cpuid.h not found * fix-jetson-compile-pyramid_hash * add more info to version.txt, test=develop (#24551)
-
由 iducn 提交于
Co-authored-by: NWilber <jiweibo@baidu.com>
-
- 06 8月, 2020 5 次提交
-
-
由 iducn 提交于
Co-authored-by: NWilber <jiweibo@baidu.com>
-
由 Zhen Wang 提交于
-
由 Pei Yang 提交于
* solve conflict * fix crash when trt not found in python; update unittest model path
-
由 Chen Weihang 提交于
* simply C++ error stack once again, test=develop * refactor code remove string pointer and recursive, test=develop
-
由 Pei Yang 提交于
* fix multhead matmul's instable test=develop * fix multihead matmul bug test=develop * fix converage problem test=develop Co-authored-by: NZhaolong Xing <nhzlx.dragon@gmail.com>
-
- 05 8月, 2020 3 次提交
-
-
由 hong 提交于
* Fix dygraph grad bugs (#25781) * fix double grad visitid unit; test=develop * change name hash_pair to HashPair; test=develop * follow comment; test=develop * remove manual seed; test=develop * change create_graph from True to False; test=develop
-
由 Zhen Wang 提交于
* Fix the double grad bug for the star gan. (#25655) * update the retain_graph parameter doc. test=develop
-
由 Zhen Wang 提交于
* Add some error messages for the op without double grads. * fix the test_imperative_double_grad UT.
-
- 04 8月, 2020 4 次提交
-
-
由 MRXLT 提交于
* fix conflict * fix conflict * fix code cherry pick encryption api Co-authored-by: NYanghello <915769235@qq.com> Co-authored-by: NYanghello <yangqingyou@baidu.com>
-
由 zhangchunle 提交于
-
由 GaoWei8 提交于
Fix the condition of concat dimension judgment.
-
由 Pei Yang 提交于
-
- 30 7月, 2020 2 次提交
-
-
由 石晓伟 提交于
* ignore warnings of external libraries, test=develop (#24193) * fix repeat definitions in liengine.cc, test=develop (#25020) * remove paddle_use_kernel and paddle_use_op. test=develop (#25189) * fix compile for lite subgraph. test=develop (#25285) * [CI] [Lite-Subgraph] CI add lite subgraph check. (#25346) * supports xpu runtime, test=develop (#25554) * fix cmake of lite, test=develop (#25680) * change commit files, test=release/1.8 Co-authored-by: NWilber <jiweibo@baidu.com>
-
由 Chen Weihang 提交于
* fix softmax_with_cross_entropy cuda kernel overflow bug, test=develop * replace old macro & for condition, test=develop * polish details, test=develop
-
- 29 7月, 2020 1 次提交
-
-
由 Pei Yang 提交于
-
- 27 7月, 2020 1 次提交
-
-
由 Adam 提交于
-