- 31 8月, 2021 8 次提交
-
-
由 Yuang Liu 提交于
[cherry-pick][hybrid performance] optim the grad fuse for pipeline mode by sorting the grad by dtype (#35070) (#35300)
-
由 Yuang Liu 提交于
[cherry-pick][hybrid performance] Grad fuse for gradient merge under pipeline mode (#35004) (#35299)
-
由 Roc 提交于
Co-authored-by: Ngongweibao <weibao.gong@gmail.com>
-
由 Roc 提交于
Co-authored-by: NWangXi <wangxi16@baidu.com>
-
由 Roc 提交于
Co-authored-by: NWangXi <wangxi16@baidu.com>
-
由 Roc 提交于
Co-authored-by: NWangXi <wangxi16@baidu.com>
-
由 Yuang Liu 提交于
[cherry-pick][Hybrid Performance] Move the cast op of AMP which cast fp32 param to fp16 param to the optimizer (#34965) (#35296) Co-authored-by: NWangXi <wangxi16@baidu.com>
-
由 Yuang Liu 提交于
Co-authored-by: NWangXi <wangxi16@baidu.com>
-
- 18 8月, 2021 2 次提交
-
-
由 Guoxia Wang 提交于
* support class center sample of PartialFC
-
由 Wangzheee 提交于
* unitest_quant_dequant * fix * fix * deleted: test_trt_quant_conv2d_dequant_fuse_pass.py * fix
-
- 17 8月, 2021 8 次提交
-
-
由 Roc 提交于
-
由 Aganlengzi 提交于
-
由 WeiXin 提交于
* polish unittest. * polish code * polish code
-
由 shangliang Xu 提交于
* [bug fix] fix unfold negative_size_param
-
由 Hui Zhang 提交于
* dygraph support more ctc grad scale * scale for 1.x * fix unitest * fix unitest * format code * fix unittest * fix log info * unittest cov * fix format;notest,test=cpu,coverage * skip ctc_loss egs;test=cpu * warpctc grad cov;test=coverage * add dygraph test;test=coverage * format;test=cpu,coverage * format;test=cpu * add api compat;test=cpu * add cpu test * rename * rename * fix * fix test * format * eigen cpu * eigen gpu grad pass * cuda gpu pass * format * fix ci
-
由 Zeng Jinle 提交于
* add inplace passes and tests * update * fix use_cuda undefined fix compile error of op compat * add more ut * fix CPU CI error * check adam unique * fix mac/windows ci, improve coverage * fix ci error * follow weihang's comment * fix BlockDesc::MoveFrom * follow qiuliang's comment * update * follow huihuang's comments
-
由 zhiboniu 提交于
-
由 Kaipeng Deng 提交于
* fix drop_last not work in IterableDataset. test=develop
-
- 16 8月, 2021 14 次提交
-
-
由 Li Min 提交于
* Fix typos in english docs for diag and diagflat.
-
由 veyron95 提交于
* [NPU] Support npu op:(1)arg_min (2)arg_max * Modify and add unit test cases * Modify unit test cases
-
由 0x45f 提交于
* add size npu op * modify support data type * no longer use NPU size OP * remove useless comments, add test case * fix copyright, remove useless include
-
由 Fan Zhang 提交于
-
由 zhangchunle 提交于
-
由 Qi Li 提交于
-
由 From00 提交于
* Add NPU kernel for nearest_interp op * Add grad op * Modify codes according to the review comments * Modify codes according to the review comments
-
由 duanboqiang 提交于
* add unique_consecutive_op * add unique_consecutive_op * add unique_consecutive_op * add unique_consecutive_op * add unique_consecutive_op * add unique_consecutive_op * add unique_consecutive_op * add unique_consecutive_op * remove unity build * add unique_consecutive op * add unique_consecutive op * add enable static * add noqa * add space line * add default case. * add comma * add space line * modify unique_consecutive unittest * optimize ut coverage * rebase develop * improve coverage * update en docs * update en docs * update en docs * update en docs * update en docs * update en doc
-
由 Leo Chen 提交于
* dygraph amp support param_group * remove unused code * fix doc
-
由 Guoxia Wang 提交于
* support margin loss (arcface, cosface, sphereface)
-
由 zyfncg 提交于
* Support NPU OP hard_swish and hard_swish_grad * Support NPU OP hard_swish and hard_swish_grad * add the unittest to compare the result between npu ans cpu * format the prompt of exception * replace Min and Max op by ClipByValue op * fix the precision problem for fp16 * Using HardtanhGrad to improve performace
-
由 shangliang Xu 提交于
* fix dice_loss bug
-
由 Zhanlue Yang 提交于
-
由 ronnywang 提交于
* add p_norm_op_npu * remove p_norm_grad op * update
-
- 14 8月, 2021 1 次提交
-
-
由 WangXi 提交于
-
- 13 8月, 2021 7 次提交
-
-
由 Tongxin Bai 提交于
* OP dot: refactor CPU kernels and get better loop performance. * Minor fix on code format. * Fixed minor errors. * Add new API: einsum * Update the Einsum unit test. One case failed with matmul_v2, where the dtype is int64: a = np.arange(2 * 3 * 1).reshape(2, 3, 1) b = np.arange(1) paddle.einsum("...i, ...i", a, b) * Test cases in test_einsum test floating point dtypes only. As of now Paddle only supports float/double dtypes in matmul, which is one of building blocks of this Einsum implementation. We decide not to test einsum against other dtypes. * Polish format. * More formatting. * Format... * Einsum: improve test coverage. * Einsum: bug fixes and more testcases for testing error messages * Einsum: fix format.. * Einsum: fixed typo and format. * Einsum: format again... * Einsum: applied suggested changes. * Einsum API: improve API documentation. * Einsum API: apply suggested changes. * Einsum API: Add dygraph only note. * Einsum API: Add dygraph only note. * Einsum API: fixed unittest.
-
由 zyfncg 提交于
-
由 zyfncg 提交于
* Fix a bug : can't load more than one custom op module * Fix a bug : can't load more than one custom op module * add test for load multiple modules of custom c++ op * add config for Coverage CI
-
由 Qi Li 提交于
-
由 ShenLiang 提交于
* fix bug of py36 import
-
由 Baibaifan 提交于
-
由 andyjpaddle 提交于
-