- 15 7月, 2021 5 次提交
-
-
由 ceci3 提交于
* fix bn * fix * add unittest * fix cpu
-
由 Qi Li 提交于
-
由 Qi Li 提交于
-
由 王明冬 提交于
[pass enhance] make the attribute check only object to which defined in op proto. test=develop (#34146)
-
由 Aurelius84 提交于
* Refine Constructor logic of ParallelExecutor * Replace executor into ParallelExecutor in run_program_op
-
- 14 7月, 2021 5 次提交
-
-
由 ShenLiang 提交于
* add scatter-gather opt * fix topo for pp * rename function
-
由 Leo Chen 提交于
* adam add input SkipUpdate * add unittest * add npu unittest * fix xpu compile * remove param stream
-
由 zhouweiwei2014 提交于
* Support sccache to speed up compilation on Windows * Support sccache to speed up compilation on Windows
-
由 Haohongxiang 提交于
-
由 Yuang Liu 提交于
-
- 13 7月, 2021 5 次提交
-
-
由 LiuWei 提交于
-
由 WangXi 提交于
-
由 ronnywang 提交于
-
由 ronnywang 提交于
-
由 Haohongxiang 提交于
* Fix gather_op by adding OurOfRangeCheck for param[Index] * Code Optimization
-
- 12 7月, 2021 9 次提交
-
-
由 WangXi 提交于
-
由 houj04 提交于
* add npu operator for gaussian random. * bugfix: add wait after memory copy. * update gaussian random op: use TensorCopy.
-
由 Qi Li 提交于
-
由 Zhang Zheng 提交于
-
由 WeiXin 提交于
-
由 pangyoki 提交于
* add dropout npu op * fix bugs * add unittest * fix bugs * support 1-D input
-
由 pangyoki 提交于
* change ScatterAdd to EmbeddingDenseGrad in lookup_table NPU op * EmbeddingDenseGrad only supports dim 32 * fix shape error
-
由 pangyoki 提交于
-
由 Yuang Liu 提交于
* softmax mask fuse upper triangle * cover not implemented cpu code
-
- 09 7月, 2021 2 次提交
-
-
由 arlesniak 提交于
* Use CBLAS for SelectedRows elementwise add operation. It's faster. * template compilation fix * reverted template compilation fix * slimmed template compilation fix Co-authored-by: NAdam Osewski <adam.osewski@intel.com>
-
由 Leo Chen 提交于
* fix vector overflow * refine code * refine ut
-
- 08 7月, 2021 1 次提交
-
-
由 王明冬 提交于
-
- 07 7月, 2021 8 次提交
-
-
由 王明冬 提交于
-
由 pangyoki 提交于
-
由 Leo Chen 提交于
* NpuOpRunner supports host tensor as input * fix compile issue
-
由 xiayanming 提交于
-
由 jakpiase 提交于
* added prelu bf16/fp32 fwd/bwd kernel
-
由 jakpiase 提交于
* added checking if md uses blocking format * minor change * removed unnecessary line
-
由 taixiurong 提交于
-
由 sunli 提交于
-
- 06 7月, 2021 5 次提交
-
-
由 Zeng Jinle 提交于
* add gpu implementation of shuffle batch test=develop * add thrust cuda patches test=develop * fix macro guard * fix shuffle batch compile on windows/hip * fix hip compilation error * refine CMakeLists.txt * fix windows compile error * try to fix windows CI compilation error * fix windows compilation again * fix shuffle_batch op test on Windows
-
由 Wangzheee 提交于
-
由 xiaoting 提交于
* fix interpolate for shape[i]=0, test=develop * fix test_trilinear_interp_v2 random failure, test=develop
-
由 Lijunhui 提交于
-
由 Qi Li 提交于
-