- 25 4月, 2023 1 次提交
-
-
由 niuliling123 提交于
新增enable_tensor_checker, disable_tensor_checker API (#52936)
-
- 14 4月, 2023 1 次提交
-
-
由 Yiqun Liu 提交于
* Unify the static amp codes of fp16 and bf16. * Polish apis and add unittest. * Add operator stats collecting tools for program. * Add the check of number of bloat16 operators in unittest. * Add warning for operator not supported for amp. * Add testing of BF16 O1 and O2.
-
- 13 4月, 2023 1 次提交
-
-
由 niuliling123 提交于
-
- 12 4月, 2023 2 次提交
- 10 4月, 2023 1 次提交
-
-
由 Zhang Ting 提交于
* support set master_grad * move register_hook to auto_cast * update unittest * fix fp16 test * update for review comments
-
- 06 4月, 2023 1 次提交
-
-
由 Kim Yann 提交于
* rem is_compiled_with_npu * rem nup related code * make lint happy * rem test * remove some tests * Update grad_scaler.py * fix an error
-
- 03 4月, 2023 1 次提交
-
-
由 Kim Yann 提交于
* rem is_compiled_with_mlu * fix some mlu_place and mlu_device_coount * make lint happy
-
- 30 3月, 2023 3 次提交
-
-
由 Yiqun Liu 提交于
* [AMP] Add python API for collecting operator stats. * Fix import and polish codes. * Add more unittest. * Add doc for the new APIs.
-
由 Zhang Ting 提交于
-
由 wanghuancoder 提交于
* delete old dygraph op test
-
- 15 3月, 2023 1 次提交
-
-
由 wanghuancoder 提交于
* refine _found_inf
-
- 09 3月, 2023 1 次提交
-
-
由 Ghost Screaming 提交于
* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result is wrong. * Remove climits. * Fix bug of hybrid parallel strategy with recompute using bf16. * Fix bug of recompute_hybrid ctx.amp_dtype * Fix bug of amp_dtype. * Fix bug of auto_cast.
-
- 24 2月, 2023 1 次提交
-
-
由 Weilong Wu 提交于
* Revert "fixoptminizer _set_auxiliary_var bug (#50335)" This reverts commit c44005f0. * Revert "refine optimizer create accumulators (#50188)" This reverts commit 244e7546. * Revert "fix found_inf bug for custom optimizer (#50158)" This reverts commit 64573f9f. * Revert "refine amp scaler found_inf (#49864)" This reverts commit 382e9a06. * fix code format * fix conflict
-
- 13 2月, 2023 1 次提交
-
-
由 Ryan 提交于
test=docoument_fix
-
- 03 2月, 2023 1 次提交
-
-
由 wanghuancoder 提交于
-
- 30 1月, 2023 1 次提交
-
-
由 wanghuancoder 提交于
* refine _found_inf
-
- 19 1月, 2023 1 次提交
-
-
由 jameszhang 提交于
* [KUNLUN] add op: maxpool_with_index * use DeviceContext::Alloc() instead of DenseTensor::mutable_data() * fix file format * solve clip unittest failure * minor fix * Revert "solve clip unittest failure" since the issue is fixed in #49535 This reverts commit 1127adc66e79afe35ac3c00bb34e6aaa7cd7d78b. * align with xdnn on the definition of mask in max_pool_with_index * minor
-
- 12 1月, 2023 1 次提交
-
-
由 zhangkaihuo 提交于
-
- 11 1月, 2023 1 次提交
-
-
由 niuliling123 提交于
-
- 06 1月, 2023 1 次提交
-
-
由 niuliling123 提交于
-
- 05 1月, 2023 1 次提交
-
-
由 zhangkaihuo 提交于
-
- 15 12月, 2022 1 次提交
-
-
由 mjxs 提交于
* 涉及到的api有 paddle.amp.decorate paddle.static.npu_places paddle.signal.istft paddle.signal.stft paddle.linalg.eigvalsh paddle.randint_like * change signal.stft * randint_like的low增加optional * ; test=docs_preview * 修改了注解格式; test=docs_preview * 修改了公式格式 * 修改了decorate的models等 * test=document_fix Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>
-
- 29 11月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* isort all files * revert conflicting files * revert conflicting files * revert conflicting files
-
- 03 11月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* [CodeStyle][py2][U008] remove unnecessary args in `super()` * remove remained args * revert changes in test_pylayer_op * Revert "revert changes in test_pylayer_op" This reverts commit ff185a9ae738afac3b0264f61bde6c6b7f72e7c4. * revert some changes in example code
-
- 23 10月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format
-
- 14 9月, 2022 2 次提交
-
-
由 Nyakku Shigure 提交于
* trim trailing whitespace * fix `.cmake-format.py` * revert npu ut changes, avoid npu ci error
-
由 zhangbo9674 提交于
* support bfloat16 for amp_decorate * add check_finite for bf16 * fix bug * add ut * add ut * refine code
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
* use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed
-
- 09 5月, 2022 1 次提交
-
-
由 Liyulingyue 提交于
* auto_cast; test=document_fix * static.save; test=document_fix * cuda_places; test=document_fix
-
- 07 3月, 2022 1 次提交
-
-
由 zhangbo9674 提交于
* refine amp.decorate code example * refine code
-
- 18 2月, 2022 1 次提交
-
-
由 zhangbo9674 提交于
* support dtype param for auto_cast * add amp_dtype for tracer * add unsupported bf16 list * support bf16 amp for O2 * refine python interface for bfloat16 * refine code * refine code * refine unittest * refine code * refine code * add bf16 o1 * refine code by comment * add gradient accumulator * add recompute
-
- 29 11月, 2021 1 次提交
-
-
由 zhangbo9674 提交于
* amp.decorate optimizers set to None is ok * refine unittest * add unittest and refine example code * refine unittest
-
- 21 10月, 2021 1 次提交
-
-
由 zhangbo9674 提交于
-
- 22 9月, 2021 1 次提交
-
-
由 zhangbo9674 提交于
* split minimize() to step() + update() * add unscale and step for grad_scaler * add unittest * refine code in minimize * delete step in loss_scaler * fix example bug * refine comment * refine unittest * add unittest
-
- 17 9月, 2021 1 次提交
-
-
由 zhangbo9674 提交于
* add pure fp16 major function in auto_cast & tracer * support master weight in dygraph for pure fp16 * check mix dtype of fp16&fp32 for check_finite_and_unscale op * change pure fp16 funtion name * refine some bug in auto_cast * refine auto_cast interface logic * add param _casted_by_pure_fp16 for class Layer * support state_dict hook for save model by user appointed dtype in pure_fp16_decorator * refine pure_fp16_decorator as decorator * add unittest * add comment * add comment * support recompute * add comment for auto_cast and decorator * support to_static_state_dict for paddle.jit.save * unlimite models num and optimizers num * add lookup_table in black_list * fix momentum and layer state_dict * fix bug in layer state_dict * fix bug in layer state_dict_helper * refine unittest * refine test_momentun_op * refine interface and some code * refine amp_decorator interface * refine pure fp16 interface * refine master weight interface
-
- 16 8月, 2021 1 次提交
-
-
由 Leo Chen 提交于
* dygraph amp support param_group * remove unused code * fix doc
-
- 11 8月, 2021 1 次提交
-
-
由 zhangbo9674 提交于
* add state_dict and load_state_dict and unittest for class GradScaler * refine unittest for coverage of load_state_dict * refine comments of code-block * refine some comments * refine state_dict code and unittest * add #require gpu, xpu for GradScaler get/set example code * add #require gpu, xpu for GradScaler get/set example code * refine example code * refine unittest for state_dict * refine unittest for state_dict * fix bug of DataLoader in TestGradScalerStateDict * add flag FLAGS_cudnn_deterministic
-
- 01 7月, 2021 1 次提交
-
-
由 zhangbo9674 提交于
* add get and set for Grad_scaler * refine some API name and comments * refine API name and comments * refine some comments
-
- 11 6月, 2021 1 次提交
-
-
由 zhiboniu 提交于
* update 2.0 public api in all left files * reverse device.py all list; fix some flake8 errors
-