- 09 5月, 2023 1 次提交
-
-
由 zhangkaihuo 提交于
-
- 08 5月, 2023 1 次提交
-
-
由 张春乔 提交于
-
- 27 4月, 2023 1 次提交
-
-
由 Zhang Ting 提交于
* support OD level and skip dynamic loss scaling for bf16
-
- 24 4月, 2023 3 次提交
-
-
由 niuliling123 提交于
-
由 张春乔 提交于
-
由 Zhang Ting 提交于
-
- 18 4月, 2023 1 次提交
-
-
由 Zhang Ting 提交于
-
- 14 4月, 2023 1 次提交
-
-
由 Yiqun Liu 提交于
* Unify the static amp codes of fp16 and bf16. * Polish apis and add unittest. * Add operator stats collecting tools for program. * Add the check of number of bloat16 operators in unittest. * Add warning for operator not supported for amp. * Add testing of BF16 O1 and O2.
-
- 13 4月, 2023 1 次提交
-
-
由 niuliling123 提交于
-
- 12 4月, 2023 2 次提交
- 10 4月, 2023 1 次提交
-
-
由 Zhang Ting 提交于
* support set master_grad * move register_hook to auto_cast * update unittest * fix fp16 test * update for review comments
-
- 06 4月, 2023 1 次提交
-
-
由 Kim Yann 提交于
* rem is_compiled_with_npu * rem nup related code * make lint happy * rem test * remove some tests * Update grad_scaler.py * fix an error
-
- 03 4月, 2023 1 次提交
-
-
由 Kim Yann 提交于
* rem is_compiled_with_mlu * fix some mlu_place and mlu_device_coount * make lint happy
-
- 30 3月, 2023 3 次提交
-
-
由 Yiqun Liu 提交于
* [AMP] Add python API for collecting operator stats. * Fix import and polish codes. * Add more unittest. * Add doc for the new APIs.
-
由 Zhang Ting 提交于
-
由 wanghuancoder 提交于
* delete old dygraph op test
-
- 15 3月, 2023 1 次提交
-
-
由 wanghuancoder 提交于
* refine _found_inf
-
- 09 3月, 2023 1 次提交
-
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * Remove climits. * Fix bug of hybrid parallel strategy with recompute using bf16. * Fix bug of recompute_hybrid ctx.amp_dtype * Fix bug of amp_dtype. * Fix bug of auto_cast.
-
- 24 2月, 2023 1 次提交
-
-
由 Weilong Wu 提交于
* Revert "fixoptminizer _set_auxiliary_var bug (#50335)" This reverts commit c44005f0. * Revert "refine optimizer create accumulators (#50188)" This reverts commit 244e7546. * Revert "fix found_inf bug for custom optimizer (#50158)" This reverts commit 64573f9f. * Revert "refine amp scaler found_inf (#49864)" This reverts commit 382e9a06. * fix code format * fix conflict
-
- 13 2月, 2023 1 次提交
-
-
由 Ryan 提交于
test=docoument_fix
-
- 03 2月, 2023 1 次提交
-
-
由 wanghuancoder 提交于
-
- 30 1月, 2023 1 次提交
-
-
由 wanghuancoder 提交于
* refine _found_inf
-
- 19 1月, 2023 1 次提交
-
-
由 jameszhang 提交于
* [KUNLUN] add op: maxpool_with_index * use DeviceContext::Alloc() instead of DenseTensor::mutable_data() * fix file format * solve clip unittest failure * minor fix * Revert "solve clip unittest failure" since the issue is fixed in #49535 This reverts commit 1127adc66e79afe35ac3c00bb34e6aaa7cd7d78b. * align with xdnn on the definition of mask in max_pool_with_index * minor
-
- 12 1月, 2023 1 次提交
-
-
由 zhangkaihuo 提交于
-
- 11 1月, 2023 1 次提交
-
-
由 niuliling123 提交于
-
- 06 1月, 2023 1 次提交
-
-
由 niuliling123 提交于
-
- 05 1月, 2023 1 次提交
-
-
由 zhangkaihuo 提交于
-
- 15 12月, 2022 1 次提交
-
-
由 mjxs 提交于
* 涉及到的api有 paddle.amp.decorate paddle.static.npu_places paddle.signal.istft paddle.signal.stft paddle.linalg.eigvalsh paddle.randint_like * change signal.stft * randint_like的low增加optional * ; test=docs_preview * 修改了注解格式; test=docs_preview * 修改了公式格式 * 修改了decorate的models等 * test=document_fix Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>
-
- 29 11月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* isort all files * revert conflicting files * revert conflicting files * revert conflicting files
-
- 03 11月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* [CodeStyle][py2][U008] remove unnecessary args in `super()` * remove remained args * revert changes in test_pylayer_op * Revert "revert changes in test_pylayer_op" This reverts commit ff185a9ae738afac3b0264f61bde6c6b7f72e7c4. * revert some changes in example code
-
- 23 10月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format
-
- 14 9月, 2022 2 次提交
-
-
由 Nyakku Shigure 提交于
* trim trailing whitespace * fix `.cmake-format.py` * revert npu ut changes, avoid npu ci error
-
由 zhangbo9674 提交于
* support bfloat16 for amp_decorate * add check_finite for bf16 * fix bug * add ut * add ut * refine code
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
* use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed
-
- 09 5月, 2022 1 次提交
-
-
由 Liyulingyue 提交于
* auto_cast; test=document_fix * static.save; test=document_fix * cuda_places; test=document_fix
-
- 07 3月, 2022 1 次提交
-
-
由 zhangbo9674 提交于
* refine amp.decorate code example * refine code
-
- 18 2月, 2022 1 次提交
-
-
由 zhangbo9674 提交于
* support dtype param for auto_cast * add amp_dtype for tracer * add unsupported bf16 list * support bf16 amp for O2 * refine python interface for bfloat16 * refine code * refine code * refine unittest * refine code * refine code * add bf16 o1 * refine code by comment * add gradient accumulator * add recompute
-
- 29 11月, 2021 1 次提交
-
-
由 zhangbo9674 提交于
* amp.decorate optimizers set to None is ok * refine unittest * add unittest and refine example code * refine unittest
-
- 21 10月, 2021 1 次提交
-
-
由 zhangbo9674 提交于
-