- 06 11月, 2020 2 次提交
-
-
由 joanna.wozna.intel 提交于
* Add bfloat16 softmax and gelu * Add pass attr bfloat16_enabled_op_types * Changes from review
-
由 iducn 提交于
-
- 03 11月, 2020 1 次提交
-
-
由 Shang Zhizhou 提交于
* fp16 result ok * change -DWITH_NVINFER_PLUGIN toconfig.EnableTensorRtOSS * auto detect special slice op converter for ernie with trt oss * ernie oss only support fp16 * fix special_slice_plugin serialize bug * matmul in tensorrt ok * ernie unittest ok * add matmul tensorrt unittest * remove demo code
-
- 29 10月, 2020 3 次提交
- 26 10月, 2020 1 次提交
-
-
由 Adam Osewski 提交于
-
- 21 10月, 2020 2 次提交
- 20 10月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Add AVX512 instruction check for C-API * Fix formatting
-
- 15 10月, 2020 1 次提交
-
-
由 joanna.wozna.intel 提交于
-
- 13 10月, 2020 3 次提交
-
-
由 joanna.wozna.intel 提交于
-
由 Wilber 提交于
-
由 Shang Zhizhou 提交于
* add info log for trt input dynamic shape check * fix error msg error
-
- 12 10月, 2020 2 次提交
-
-
由 Wilber 提交于
-
由 chen.zhiyu 提交于
-
- 10 10月, 2020 1 次提交
-
-
由 石晓伟 提交于
-
- 01 10月, 2020 1 次提交
-
-
由 Wojciech Uss 提交于
-
- 28 9月, 2020 2 次提交
- 25 9月, 2020 1 次提交
-
-
由 Shang Zhizhou 提交于
* [buf fix]:fix unittest test_activation_op error * split long-time unittests to smaller ones * rename some unittests
-
- 24 9月, 2020 3 次提交
-
-
由 Wilber 提交于
-
由 Wilber 提交于
-
由 wanghuancoder 提交于
* use iwyu clean include, test=develop, test=win * compilation error, test=develop * fix compilation error2, test=develop * fix compilation error3, test=develop * fix compilation error4, test=develop * fix compilation error5, test=develop * fix compilation error6, test=develop * fix compilation error7, test=develop * fix compilation error8, test=develop * fix compilation error8, test=develop * fix compilation error10, test=develop * fix compilation error11, test=develop
-
- 23 9月, 2020 4 次提交
-
-
由 YUNSHEN XIE 提交于
-
由 Zhou Wei 提交于
-
由 Wilber 提交于
-
由 Chen Weihang 提交于
* polish some lost error msg * add some math file to white list * polish detail based reviewer commnet
-
- 22 9月, 2020 3 次提交
- 21 9月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 18 9月, 2020 1 次提交
-
-
由 Pei Yang 提交于
-
- 17 9月, 2020 1 次提交
-
-
由 Zhou Wei 提交于
-
- 16 9月, 2020 2 次提交
- 15 9月, 2020 4 次提交
-
-
由 Shang Zhizhou 提交于
* optimize slice TRT plugin This patch removes unnecessary barrier for data transfer of needed offset, so data transfer can be overlap with GPU kernel execution. This patch also fixes incorrect name of slice plugin. That is, replaces "layernorm" with "slice" test=develop * add serialize/deserialize to slice plugin * add static shape slice trt plugin * fix slice trt op convertor dynamic shape bug * fix format by clang-format * fix pylint format error * fix problems commented by peiyang Co-authored-by: NRyan Jeng <rjeng@nvidia.com>
-
由 石晓伟 提交于
-
由 Shang Zhizhou 提交于
* optimize errror report * add test case for pad op converter * fix some spelling mistake commented by peiyang
-
由 Pei Yang 提交于
* fix trt_dynamic_shape_ernie_deserialize_test * support when opt cache dir does not exist
-