- 16 8月, 2021 4 次提交
-
-
由 zyfncg 提交于
* Support NPU OP hard_swish and hard_swish_grad * Support NPU OP hard_swish and hard_swish_grad * add the unittest to compare the result between npu ans cpu * format the prompt of exception * replace Min and Max op by ClipByValue op * fix the precision problem for fp16 * Using HardtanhGrad to improve performace
-
由 shangliang Xu 提交于
* fix dice_loss bug
-
由 Zhanlue Yang 提交于
-
由 ronnywang 提交于
* add p_norm_op_npu * remove p_norm_grad op * update
-
- 14 8月, 2021 1 次提交
-
-
由 WangXi 提交于
-
- 13 8月, 2021 6 次提交
-
-
由 Tongxin Bai 提交于
* OP dot: refactor CPU kernels and get better loop performance. * Minor fix on code format. * Fixed minor errors. * Add new API: einsum * Update the Einsum unit test. One case failed with matmul_v2, where the dtype is int64: a = np.arange(2 * 3 * 1).reshape(2, 3, 1) b = np.arange(1) paddle.einsum("...i, ...i", a, b) * Test cases in test_einsum test floating point dtypes only. As of now Paddle only supports float/double dtypes in matmul, which is one of building blocks of this Einsum implementation. We decide not to test einsum against other dtypes. * Polish format. * More formatting. * Format... * Einsum: improve test coverage. * Einsum: bug fixes and more testcases for testing error messages * Einsum: fix format.. * Einsum: fixed typo and format. * Einsum: format again... * Einsum: applied suggested changes. * Einsum API: improve API documentation. * Einsum API: apply suggested changes. * Einsum API: Add dygraph only note. * Einsum API: Add dygraph only note. * Einsum API: fixed unittest.
-
由 zyfncg 提交于
-
由 zyfncg 提交于
* Fix a bug : can't load more than one custom op module * Fix a bug : can't load more than one custom op module * add test for load multiple modules of custom c++ op * add config for Coverage CI
-
由 Qi Li 提交于
-
由 Baibaifan 提交于
-
由 andyjpaddle 提交于
-
- 12 8月, 2021 6 次提交
-
-
由 Qi Li 提交于
-
由 zhouweiwei2014 提交于
-
由 zhulei 提交于
* Fix safety-bug of functional.linear * Fix safety-bug of functional.linear * Fix safety-bug of functional.linear * Fix safety-bug of functional.linear
-
由 ShenLiang 提交于
* add recompute for pp * add recompute offload * add recompute partition
-
由 wuhuachaocoding 提交于
-
由 Fan Zhang 提交于
* [NPU] Support npu op expand_v2 and expand_v2_grad * [NPU] Support npu op expand_v2 and expand_v2_grad * [NPU] Support npu op expand_v2 and expand_v2_grad * update test_expand_v2_op_npu.py * update test_expand_v2_op_npu.py * modify expand_v2_op_npu.cc * modify expand_v2_op_npu.cc
-
- 11 8月, 2021 14 次提交
-
-
由 zhangbo9674 提交于
* add state_dict and load_state_dict and unittest for class GradScaler * refine unittest for coverage of load_state_dict * refine comments of code-block * refine some comments * refine state_dict code and unittest * add #require gpu, xpu for GradScaler get/set example code * add #require gpu, xpu for GradScaler get/set example code * refine example code * refine unittest for state_dict * refine unittest for state_dict * fix bug of DataLoader in TestGradScalerStateDict * add flag FLAGS_cudnn_deterministic
-
由 WeiXin 提交于
* add set_value_grad op * add unittest. * polish unittest. * polish code. * support cuda kernel * polish code according to CI * polish code. * polish code * remove *.pyc * polish code. * add unittest to improve coverage. * polish code.
-
由 Fan Zhang 提交于
-
由 pangyoki 提交于
* add while read_from_array write_to_array npu op * optimize unittest
-
由 Roc 提交于
-
由 ronnywang 提交于
* add momentum_op_npu and test * update * fix hang
-
由 ronnywang 提交于
* add reduce_mean_op_npu and test * remove skip.If * update
-
由 ronnywang 提交于
* add batch_norm_op_npu and tests * remove skip.If * fix bug
-
由 WangXi 提交于
-
由 lilong12 提交于
* add auto_parallel apis
-
由 ShenLiang 提交于
* add save/load for pipelineparallel * add save/load
-
由 0x45f 提交于
* add exp and exp_grad npu op * modify support register type * remove empty line and remove exp_grad support data type int/int64 * move exp and epx_grad kernel to activation_op_npu.cc, delete attrs * move code to activation_op_npu.cc
-
由 andyjpaddle 提交于
-
由 From00 提交于
* Add NPU kernel for TopKV2 op * deleted unnecessary cache file static_mode_white_list.cpython-37.pyc * A draft for error checking * A commit with accuracy error for float32 data * Modify codes according to the review comments * Modify codes according to the review comments
-
- 10 8月, 2021 8 次提交
-
-
由 Liu-xiandong 提交于
* fix npu compile error, test=develop * [NPU] Support npu kernel for flatten_contiguous_range op, test=develop * [NPU] Support npu kernel for flatten_contiguous_range op, test=develop * [NPU] Support npu kernel for flatten_contiguous_range op, test=develop * [NPU] Support npu kernel for flatten_contiguous_range op, test=develop * [NPU] Support npu kernel for flatten_contiguous_range op, test=develop * [NPU] Support npu kernel for flatten_contiguous_range op, test=develop * [NPU] Support npu kernel for flatten_contiguous_range op, test=develop * Update flatten_op_npu.cc * Update flatten_op_npu.cc Co-authored-by: Nqili93 <qili93@qq.com>
-
由 Aganlengzi 提交于
* [NPU] add squared_l2_norm squared_l2_norm and tests * [NPU] replace Square&ReduceSumD with SquareSumV1
-
由 zyfncg 提交于
* Support npu kernel for fill_any_like op * modify the description of exception * remove useless template element * remove useless decorator * fix the code format error
-
由 andyjpaddle 提交于
* fix npu compile error, test=develop * add fill constant batch size lilke op npu,test=develop Co-authored-by: Nqili93 <qili93@qq.com>
-
由 XGZhang 提交于
-
由 chenjian 提交于
* Support npu kernel for tile op * modify according to the comments * fix compute function
-
由 chenjian 提交于
* Support npu kernel for expand_as_v2 op * mofify the registry data type name * fix test unit * fix npu compile error, test=develop * fix compute function Co-authored-by: Nqili93 <qili93@qq.com>
-
由 Linjie Chen 提交于
* Fix error of HSigmoidLoss * update unittest * update unittest
-
- 09 8月, 2021 1 次提交
-
-
由 YuanRisheng 提交于
-