- 15 12月, 2020 2 次提交
-
-
由 ShenLiang 提交于
* fix bug of matmul_v2 for broadcast
-
由 qingqing01 提交于
-
- 14 12月, 2020 8 次提交
-
-
由 wangchaochaohu 提交于
-
由 arlesniak 提交于
-
由 lilong12 提交于
-
由 WangXi 提交于
-
由 tangwei12 提交于
* add service, remove ut on mac * fix heter_profiler & add heter stop method * fix code style
-
由 Leo Chen 提交于
* fix compile problem when cuda_arch < 6000 * refine code * refine code
-
由 QingshuChen 提交于
* support roi_align & affine_channel for kunlun * minor
-
由 Jacek Czaja 提交于
-
- 11 12月, 2020 9 次提交
-
-
由 Wilber 提交于
-
由 JZ-LIANG 提交于
* Sharding add hybrid-dp feature * update sharding in distributed_strategy * update sharding unitest * revise code format for sharding
-
由 Leo Chen 提交于
-
由 Zhang Ting 提交于
* improve drop out * add VectorizedRandomGeneratorWithGenerator * fix bug * modify according to comments
-
由 Zhang Ting 提交于
-
由 LoveAn 提交于
* Add the strategy of skipping cc/cu test compilation and execution in CI, test=develop * fix if error with CI_SKIP_TEST, test=develop * fix add properties to test error on Linux/MAC, test=develop * fix set test properties of test_code_generator error, test=develop * remove test codes and advance judgment of file modification on Linux, test=develop * rename CI_SKIP_TEST to CI_SKIP_CPP_TEST, test=document_fix * Add branch judgement on Linux, test=develop
-
由 Aurelius84 提交于
* Add more value to calculate hash key * fix size_t * polish code
-
由 taixiurong 提交于
* 1.fix matmul bug 2. add one hot * add xpu error msg
-
由 Jacek Czaja 提交于
* - Added infrastructre for new test - Added UT for Multiple models prediction - cosmetic fixes - lint - lint fixes * - Removed timeout for MMP test
-
- 10 12月, 2020 5 次提交
-
-
由 Zhong Hui 提交于
fix p_norm with empty shape (#29500)
-
由 Leo Chen 提交于
* layernorm fw opt * layernorm bw opt * fix typo, test=develop * remove const dim3 for windows CI compatibility * merge develop Co-authored-by: Nzlsh80826 <zlsh80826@gmail.com>
-
由 tangwei12 提交于
* add ps table Change-Id: I468a04bd071d21ff52654926fcf4d5f3da19e178
-
由 ShenLiang 提交于
-
由 Zhen Wang 提交于
* remove tensor copy in the update_loss_scaling op * not use thrust. * fix some cuda memory access error.
-
- 09 12月, 2020 7 次提交
-
-
由 Zhou Wei 提交于
* support deepcopy for Layer/Tensor/Paramerbase * fix some code
-
由 joejiong 提交于
As the title
-
由 zlsh80826 提交于
* vec softmax fw * vec softmax bw * add a message argument for compiler compatibility
-
由 ShenLiang 提交于
* add tensor_indices in AssignGroupBySize * add rebuild group in reducer
-
由 procr 提交于
-
由 Huihuang Zheng 提交于
Add Sleep Time for CUDA Retry, which is similar to our GPU retry logic. This is a try to avoid init GPU allocation random failure in unit test.
-
由 Leo Chen 提交于
-
- 08 12月, 2020 7 次提交
-
-
由 Zhang Ting 提交于
This reverts commit befd6d53.
-
由 jakpiase 提交于
* added external reorder to profiler * added external and internal reorders to profiler * added internal and external reorder to profiler * added formatting to int/ext reorder commit * removed unnecessary comment
-
由 Pei Yang 提交于
* change hard_swish from plugin to layer * add ut when threshold != scale
-
由 taixiurong 提交于
Co-authored-by: Nroot <root@bjhw-sys-rpm0223.bjhw.baidu.com>
-
由 LoveAn 提交于
-
由 TTerror 提交于
* update reduce_sum op on xpu * update reduce_sum op on xpu * support running on xpu
-
由 Jack Zhou 提交于
-
- 07 12月, 2020 2 次提交
-
-
由 Zhang Ting 提交于
-
由 Leo Chen 提交于
-