- 17 8月, 2021 1 次提交
-
-
由 Fan Zhang 提交于
* [CPU-PSLIB] Fix bug for consistency insepection of op's embedding name and sparse table name in config_fleet.py (#34441) * [CPU-PSLIB] Add config for scale_sparse_grad in config_fleet.py
-
- 12 8月, 2021 1 次提交
-
-
由 Fan Zhang 提交于
[CPU-PSLIB] Fix bug for consistency insepection of op's embedding name and sparse table name in config_fleet.py (#34441) (#34454)
-
- 26 7月, 2021 1 次提交
-
-
由 Fan Zhang 提交于
[CPU-PSLIB] Add consistency insepection of op's embedding name and sparse table name in config_fleet.py (#34287)
-
- 18 5月, 2021 2 次提交
-
-
由 Thunderbrook 提交于
-
由 Thunderbrook 提交于
* save multi table one path * format
-
- 30 3月, 2021 1 次提交
-
-
由 Shang Zhizhou 提交于
* add n-d input support for trt scale converter (#31316) * add n-d input support for trt scale converter * add flatten for ut * fix dims * fix batchnorm when inpu dims < 3 (#31933) * fix batchnorm when inpu dims < 3 * add unittest for batchnorm dims = 2 * fix unittest Co-authored-by: NPei Yang <peiyang@baidu.com>
-
- 24 3月, 2021 1 次提交
-
-
由 Shang Zhizhou 提交于
* fix conflict * fix compile error * cherry-pick #31316 * Refine cudnn softmax (#25757) * refine cudnn softmax * Trt elementwise plugin serialize (#31587) * add serialize unittest * fix element_op trt plugin serialize bug * remove PassVersionChecker.IsCompatible * fix unittest Co-authored-by: NPei Yang <peiyang@baidu.com> Co-authored-by: NGaoWei8 <53294385+GaoWei8@users.noreply.github.com>
-
- 07 12月, 2020 1 次提交
-
-
由 Shang Zhizhou 提交于
* cherry-pick PR #27933 * fix: cuda version is in varibale CUDA_VERSION in 1.8 cuda.cmake * close unittest failed temporarily * cherry-pick PR #27544, fix layer_norm and softmax bug in tensorRT
-
- 01 12月, 2020 1 次提交
-
-
由 yaoxuefeng 提交于
-
- 13 11月, 2020 1 次提交
-
-
由 Shang Zhizhou 提交于
* 裁剪transformer模型trt支持;修复tensorRT不支持DeletePass的bug (#28517) * skip_layernorm_op done * add unittest * slice op convertor support trt < 6 * skip_layernorm only work in ernie * fix unittest * fix unittest
-
- 05 11月, 2020 1 次提交
-
-
由 Shang Zhizhou 提交于
* Fix TRT plugin registry without TRT lib (#25982) * fix trt plugin registry without trt lib * support trt4 * refine code style * pick ea851796 from develop * cherry-pick develop PR #26273 && #27796 * fix unittest error * fix unittest error * remove const_cast Co-authored-by: NPei Yang <peiyang@baidu.com>
-
- 13 10月, 2020 1 次提交
-
-
由 wangxinxin08 提交于
-
- 12 10月, 2020 1 次提交
-
-
由 TeslaZhao 提交于
* cherry-pick:Add double grad in Squeeze and Unsqueeze to release/1.8, test=develop Signed-off-by: NTeslaZhao <zhaolisoftware@163.com> * cherry-pick:Add double grad in Squeeze and Unsqueeze to release/1.8, test=develop
-
- 10 10月, 2020 2 次提交
-
-
由 wanghuancoder 提交于
-
由 wangxinxin08 提交于
* add matmul doublegrad op * fix compile errors * modify code according to review * delete float16 * delete GetDimForInput to be consitent with release/1.8
-
- 28 9月, 2020 1 次提交
-
-
由 Jack Zhou 提交于
[Cherry-Pick] Add Log double grad kernel (#27604)
-
- 27 9月, 2020 2 次提交
- 23 9月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* optimize slice TRT plugin This patch removes unnecessary barrier for data transfer of needed offset, so data transfer can be overlap with GPU kernel execution. This patch also fixes incorrect name of slice plugin. That is, replaces "layernorm" with "slice" test=develop * add serialize/deserialize to slice plugin * add static shape slice trt plugin * fix slice trt op convertor dynamic shape bug * fix format by clang-format * fix pylint format error * fix problems commented by peiyang Co-authored-by: NRyan Jeng <rjeng@nvidia.com> Co-authored-by: NShang Zhizhou <shangzhizhou@baidu.com> Co-authored-by: NRyan Jeng <rjeng@nvidia.com>
-
- 22 9月, 2020 2 次提交
-
-
由 guofei 提交于
test=release/1.8
-
由 guofei 提交于
* Fix test_gast_with_compatibility.py due to the problem of gast in python3.8 (#27433) test=develop * fix dll load bug on windows from python3.8 (#27324) * Support python3.8 (#26850) * Support python3.8 test=notest * Replace the 'spawn' start method with 'fork' start method for multiprocessing, on MacOS with python>=3.8 (#27317) * Replace the 'spawn' start method with 'fork' start method for multiprocessing, on MacOs when python>=3.8 test=develop * Correct the error in decorator.py (#27409) test=develop Co-authored-by: NZhou Wei <52485244+zhouwei25@users.noreply.github.com>
-
- 17 9月, 2020 1 次提交
-
-
由 Bai Yifan 提交于
-
- 15 9月, 2020 1 次提交
-
-
由 danleifeng 提交于
* add DISTRIBUTED_TRAINER_ENDPOINTS env for cloud
-
- 11 9月, 2020 1 次提交
-
-
由 lilong12 提交于
* add double grad for expand, test=develop
-
- 10 9月, 2020 1 次提交
-
-
由 Qinghe JING 提交于
* add double grad to reduce sum
-
- 08 9月, 2020 1 次提交
-
-
由 yinhaofeng 提交于
* delate some wrong message test=develop, test=document_fix * delate some wrong message
-
- 03 9月, 2020 1 次提交
-
-
由 yaoxuefeng 提交于
-
- 27 8月, 2020 1 次提交
-
-
由 yaoxuefeng 提交于
-
- 21 8月, 2020 1 次提交
-
-
由 liu zhengxi 提交于
-
- 19 8月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 18 8月, 2020 1 次提交
-
-
由 Thunderbrook 提交于
* add mock barrier all (#24786) * add mock barrier all test=develop * fix test=develop * fix test=develop * fix test=develop * fix gloo error test=develop Co-authored-by: Nxujiaqi01 <173596896@qq.com>
-
- 17 8月, 2020 2 次提交
-
-
由 MRXLT 提交于
-
由 YUNSHEN XIE 提交于
-
- 12 8月, 2020 2 次提交
-
-
由 Thunderbrook 提交于
* fix dataset py3 (#25012) * fix dataset py3 error * test=develop * fix logger (#24682) * fix logger of FetchHandler,which may print log twice * test=develop * add timeout and http store in communication (#23436) * add timeout and http store in communication, add revert and confirm in fleet * test=develop * modify datanorm op test=develop (#23030) Co-authored-by: Nxujiaqi01 <173596896@qq.com> Co-authored-by: Nyaoxuefeng <yaoxuefeng@baidu.com>
-
由 Leo Chen 提交于
* fix prelu for compatibility with saved model of old version * reshape alpha
-
- 07 8月, 2020 2 次提交
-
-
由 YUNSHEN XIE 提交于
test=develop Co-authored-by: Nchalsliu <45041955+chalsliu@users.noreply.github.com>
-
由 Pei Yang 提交于
* fix cpuid.h not found * fix-jetson-compile-pyramid_hash * add more info to version.txt, test=develop (#24551)
-
- 06 8月, 2020 1 次提交
-
-
由 cc 提交于
-
- 05 8月, 2020 2 次提交
-
-
由 hong 提交于
* Fix dygraph grad bugs (#25781) * fix double grad visitid unit; test=develop * change name hash_pair to HashPair; test=develop * follow comment; test=develop * remove manual seed; test=develop * change create_graph from True to False; test=develop
-
由 Zhen Wang 提交于
* Fix the double grad bug for the star gan. (#25655) * update the retain_graph parameter doc. test=develop
-