- 30 11月, 2020 1 次提交
-
-
由 Jack Zhou 提交于
fix gru gcc7.4 bug for the gru compile
-
- 28 11月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 27 11月, 2020 8 次提交
-
-
由 ShenLiang 提交于
* add reducer * refine envent for memorycopy * add concat&split for allreduce * apply concat & split for fuse tensor * fix nccl dep * fix the untest, compile problem and ddp initialize problem * fix untest for mac & add some comments & solve the repeated param in sublayers * fix untest for windows & fix document
-
由 lilong12 提交于
update expand as op to use the shape of the target tensor instead of the target tensor itself. (#29020) * update, test=develop
-
由 Zhou Wei 提交于
-
由 Jack Zhou 提交于
Add eigen gru and fix the dropout bug in the rnn
-
由 yaoxuefeng 提交于
-
由 arlesniak 提交于
-
由 Shang Zhizhou 提交于
* remove -DSUPPORTS_CUDA_FP16 in cuda.cmake * comile with cuda9 * add some unittest * notest;test=coverage * add unittest for trt plugin swish && split * update ernie unittest * fix some error message * remove repeated judgement of CUDA version in mbEltwiseLayerNormOpConverter * fix comile errror when CUDA_ARCH_NAME < Pascal" * fix comile error * update unittest timeout * compile with cuda9 * update error msg * fix code style * add some comments * add define IF_CUDA_ARCH_SUPPORT_FP16 * rename IF_CUDA_ARCH_SUPPORT_FP16 to CUDA_ARCH_FP16_SUPPORTED
-
由 Leo Chen 提交于
-
- 26 11月, 2020 9 次提交
-
-
由 Noel 提交于
Fix ops doc for some ops
-
由 Leo Chen 提交于
* split train_mode and has_grad * fix format * fix ci problems * fix sample code
-
由 Aurelius84 提交于
-
由 WangXi 提交于
-
由 Shang Zhizhou 提交于
-
由 Shibo Tao 提交于
add API serialize_program, serialize_persistables, save_to_file, deserialize_program, deserialize_persistables, load_from_file. (#29034)
-
由 joanna.wozna.intel 提交于
* Add bf16 pool2d and unify bf16 unit tests * Add change default ops test
-
由 joanna.wozna.intel 提交于
* Fix cpu_bfloat16_pass * Add output_format * Fix incorrect SetOutput * Change fromating
-
由 Qi Li 提交于
* fix win ci failure, test=develop * add ci test, test=develop
-
- 25 11月, 2020 7 次提交
-
-
由 Chen Weihang 提交于
* default not show cpp statck & add hint * fix failed unittest * fix failed unittests
-
由 joejiong 提交于
add uint8 for reshape operator
-
由 Zhou Wei 提交于
* fix tensor detach to zero copy * fix tensor detach to zero copy
-
由 taixiurong 提交于
-
由 joejiong 提交于
Simple code clean up
-
由 wawltor 提交于
remove eigen threadpool for the speed up
-
由 Wojciech Uss 提交于
* Add multi_gru_fuse_pass and tests * fix date * cleaned up headers
-
- 24 11月, 2020 3 次提交
-
-
由 lilong12 提交于
-
由 Wojciech Uss 提交于
* Add multi_gru_seq_fuse_pass and tests * fix date * removed unused functions
-
由 123malin 提交于
* test=develop, optimize global_step
-
- 23 11月, 2020 8 次提交
-
-
由 lilong12 提交于
* update, test=develop
-
由 Thunderbrook 提交于
* ps gpu transpile * ps gpu * remove op * gps trainer * local ps * add macro * HeterBox * def cuda * tab * code style * style Co-authored-by: Thunderbrook <a754913769#163.com>
-
由 Chen Weihang 提交于
-
由 furnace 提交于
* refactor momentum op to combine weight_decay (scale op and sum op)
-
由 Jacek Czaja 提交于
-
由 Pei Yang 提交于
* change avg pooling and global pooling to trt layer * add support for static shape global pooling * modify trt errmsg
-
由 yaoxuefeng 提交于
-
由 HappyAngel 提交于
-
- 21 11月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 20 11月, 2020 2 次提交
-
-
由 gongweibao 提交于
-
由 Chen Weihang 提交于
-