- 11 1月, 2021 1 次提交
-
-
由 石晓伟 提交于
-
- 10 1月, 2021 1 次提交
-
-
由 wangchaochaohu 提交于
reduce the occupied size of memory for the fused pattern of elementwise_add Op and activation Op(relu Op for example) (#29885)
-
- 08 1月, 2021 4 次提交
-
-
由 Zhen Wang 提交于
* add cast ops before and after unsupported fp16 ops. * Keep partial net in FP32 pattern. * Support check_finite_and_unscale and update_loss_scaling for FP16 calculation mode. * Add fp16 support for adam op. * add multi precision attr for adam. * Fix the bug of test_multi_precision_fp16_train UT. * Code format for CI. * Fix the redefine error about MPTypeTrait on windows. * fix bugs of the _create_accumulators func in Momentum. * fix bug when inserting post cast op. * Add the update_loss_scaling op in allow_set of UnusedVarCheck. * Update for ci coverage. * Add some doc for OptimizerWithMixedPrecision. * Fix the code style. * Imporve the doc of `amp_init`. * Change for fp16 testing if users have the infer program defined in separate way.
-
由 Leo Chen 提交于
-
由 Leo Chen 提交于
* change to tensor copy sync * change to tensor copy sync * make copy_to safe when use TensorCopy * refine code * add ut * add cudapinned garbagecollector * add testcase: cpu place -> cuda pinned place
-
由 Chengmo 提交于
* add tensor table
-
- 07 1月, 2021 3 次提交
-
-
由 Huihuang Zheng 提交于
Improve some error messages in parallel_executor.cc, conditional_block_op.cc, recurrent_op.cc
-
由 Chen Weihang 提交于
* simplify prepared op impl to improve performance * fix kunlun compile error * continue fix kunlun compile error * only transform diff place when dtype diff * fix failed unittests * remove useless file * polish impl by review comment
-
由 liuyuhui 提交于
-
- 06 1月, 2021 1 次提交
-
-
由 石晓伟 提交于
-
- 05 1月, 2021 2 次提交
-
-
由 liuyuhui 提交于
-
由 Thunderbrook 提交于
* add topo aware * resource.h * topo aware * format
-
- 04 1月, 2021 2 次提交
-
-
由 WangXi 提交于
-
由 Shang Zhizhou 提交于
* fix op version checker of pass bug * fix code style * update pass version
-
- 31 12月, 2020 1 次提交
-
-
由 cc 提交于
* Add mkldnn nearest_interp and bilinear_interp op * don't run mkldnn interpolate in default * add interpolate_mkldnn_pass
-
- 30 12月, 2020 3 次提交
-
-
由 wawltor 提交于
* add the support the op version check for matmul, test=op_version
-
由 wawltor 提交于
* add the op version check for the elementwise ops, test=op_version * add the support check for elementwise_ops, test=op_version
-
由 Leo Chen 提交于
* add debug code * add place info * fix compile problem * add place for output
-
- 29 12月, 2020 3 次提交
-
-
由 wawltor 提交于
change the elementwise ops version check, test=op_version
-
由 Thunderbrook 提交于
-
由 cc 提交于
* map matmul/squeeze2+matmul/reshape2+matmul to mul
-
- 28 12月, 2020 4 次提交
- 27 12月, 2020 1 次提交
-
- 26 12月, 2020 1 次提交
-
-
由 liuyuhui 提交于
-
- 25 12月, 2020 4 次提交
-
-
由 YUNSHEN XIE 提交于
-
由 Chen Weihang 提交于
* try to add fwd op input dtypes * refactor base impl * return tmp_ins after dygraph prepare data * fix typo found in debug * polish comment & add complex net test * revert detail change * fix unittest failed * add complex kernel condition control * fix xpu test failed & polish comment * polish details by review comments
-
由 Leo Chen 提交于
-
由 QingshuChen 提交于
* feat: support check_nan_inf for kunlun device * support kunlun stack * minor
-
- 24 12月, 2020 2 次提交
-
-
由 tangwei12 提交于
* oneps (3/4) Co-authored-by: NMrChengmo <cmchengmo@163.com> Co-authored-by: Nmalin10 <malin10@baidu.com> Co-authored-by: Nchengmo <chengmo@baidu.com>
-
由 jakpiase 提交于
-
- 23 12月, 2020 2 次提交
-
-
由 YUNSHEN XIE 提交于
* remove duplicate ut reload * remove duplicate ut define in cmakelist
-
由 Thunderbrook 提交于
* add heter box * add trainer, worker, wrapper... * format * for ci * format * remove boost get * boost & copyright * rename * rename * format * format * format Co-authored-by: Nyaoxuefeng6 <yaoxuefeng@baidu.com>
-
- 22 12月, 2020 1 次提交
-
-
由 Jacek Czaja 提交于
* - Tensor copy fix to oneDNN tensors * - Fixes after review
-
- 21 12月, 2020 1 次提交
-
-
由 Leo Chen 提交于
-
- 18 12月, 2020 1 次提交
-
-
由 石晓伟 提交于
-
- 17 12月, 2020 2 次提交
-
-
由 WangXi 提交于
-
由 Chen Weihang 提交于
* add complex real op & api & unittest * add imag op & api & unittest * refactor op impl * revert simplify writing due to complile failed * polish details * polish grad op code
-