- 02 3月, 2021 5 次提交
-
-
由 Pei Yang 提交于
* add n-d input support for trt scale converter * add flatten for ut * fix dims
-
由 Shang Zhizhou 提交于
* support trt serialize when load model from memory * delete conv_bn_fuse_pass before tensorrt, with which trt serialize engine id is not stable * Revert "delete conv_bn_fuse_pass before tensorrt, with which trt serialize engine id is not stable" performance degradation, fix in the future This reverts commit fa6cd17e60b15df351efda379ddd00e9e9c1fea9. * add delete conv_bn * delete path when delete_cache_files
-
由 Gradie 提交于
* lamb_op_xpu;test=kunlun * modify lamb_op_xpu.cc;test=kunlun * delete atol lamb_op_xpu; test=kunlun * update xpu.cmake;test=kunlun * test_error 1e-5,lamb_op_xpu;test=kunlun * error1e-5,lamb_op_xpu,test=kunlun * delete atol lamb_xpu;test=kunlun * modify atol,lamb_op_xpy;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu, XPUOptest;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu,modify xpu_cmake; test=kunlun * lamb_op_xpu;test=kunlun * lamb_op_xpu,modify xpucmake;test=kunlun
-
由 danleifeng 提交于
* topo and memory performance for heterps; test=develop * add trainwithprofiler in heter trainier; test=develop
-
由 Qi Li 提交于
-
- 01 3月, 2021 10 次提交
-
-
由 cucuzg 提交于
* add clip_by_norm on kunlun, *test=kunlun * opt matmul and matmul_v2 on kunlun, *test=kunlun
-
由 Wilber 提交于
-
由 wuhuanzhou 提交于
-
由 石晓伟 提交于
-
由 wuhuanzhou 提交于
* optimize unity build, test=develop * fix compilation error on Windows, test=develop * fix compilation error, test=develop * fix code style error, test=develop
-
由 jiangcheng 提交于
-
由 alncat 提交于
-
由 Chen Weihang 提交于
-
由 Qi Li 提交于
-
由 niuliling123 提交于
* Optimized the adaptive_avg_pool2d op when output_size == 1
-
- 28 2月, 2021 1 次提交
-
-
由 石晓伟 提交于
-
- 27 2月, 2021 2 次提交
- 26 2月, 2021 7 次提交
-
-
由 Jiabin Yang 提交于
-
由 Jiabin Yang 提交于
* remove remove_unsupport_dtype * remove remove_unsupport_dtype * remove test dtype * add more include * change dtype.h's enum as enum class to avoid conflict with inference lib * make enum as enum class * remove additional test * merge develop * polish code
-
由 WangXi 提交于
-
由 Chen Weihang 提交于
* split build op marco & polish details * revert register api del * fix other unittest
-
由 tangwei12 提交于
Change-Id: I6210ce9c60bed48f3323c47b16500302b66cedf2
-
由 Qi Li 提交于
-
由 Qi Li 提交于
-
- 25 2月, 2021 8 次提交
-
-
由 Qi Li 提交于
-
由 Wilber 提交于
-
由 Guanghua Yu 提交于
-
由 chentianyu03 提交于
* add cache for VariableWrapper * modify args names and vlog level * format code style * add log when set cache to variable_wrapper * add log when set cache to variable_wrapper * add comment to variableWrapper cache * format code style
-
由 wangchaochaohu 提交于
-
由 joanna.wozna.intel 提交于
-
由 jakpiase 提交于
-
由 Chen Weihang 提交于
* add simple attr support and test * add int, float attr support * support other attribute * add custom attrs test in cmake * polish details * fix test failed * add backward test * update test flags
-
- 24 2月, 2021 7 次提交
-
-
由 Zhou Wei 提交于
-
由 Leo Chen 提交于
* revert the modification of set_expected_place * set device before op run * add ut
-
由 lilong12 提交于
* update, test=develop
-
由 lilong12 提交于
* update, test=develop
-
由 Thunderbrook 提交于
* push multi node * multi node * MultiThread * remove log * solve bug in 30829
-
由 liu zhengxi 提交于
* add get_cublas_handle() api * update format * add unittests * alter function name
-
由 Pei Yang 提交于
* add group norm plugin * fix compile problems * move concat axis check to trt op teller * add nbDims for scale and bias nv dims * add group norm unit test * fix unittest * add trt version restriction for group norm op teller * fix unittest
-