1. 16 6月, 2022 1 次提交
    • Q
      [Cherry-pick] Fix numpy 1.20+ deprecation warnings (#43513) · 689e0999
      Qi Li 提交于
      * Fix numpy 1.20+ deprecation warnings (#42929)
      
      * Replace np.bool/np.bool8 with np.bool_
      
      * Replace np.object with np.object_
      
      * Replace np.complex with np.complex128
      
      * Replace np.float with np.float64
      
      * Replace np.int with np.int_
      
      * Rerun pre-commit for newer pre-commit configuration
      
      * Use builtin bool instead of np.bool_ based on the context
      
      * fix mode dtype
      Co-authored-by: Nzlsh80826 <rewang@nvidia.com>
      689e0999
  2. 25 3月, 2022 2 次提交
    • Z
      fix sync_bn error in fp16 amp-o2 (#40943) · 9ab3c76b
      zhangbo9674 提交于
      9ab3c76b
    • J
      Refactor Dygraph Flags (#40786) · 3085d5e4
      Jiabin Yang 提交于
      * refactor eager flags
      
      * fix flags error when we switch from eager to dygraph
      
      * fix ci problem
      
      * fix ci
      
      * fix ci
      
      * merge develop and fix code style
      
      * merge develop and fix code style
      
      * fix op test error
      
      * fix op test error
      
      * fix op test error
      
      * fix op test error
      
      * fix op test error
      
      * merge develop
      3085d5e4
  3. 16 3月, 2022 1 次提交
  4. 15 3月, 2022 1 次提交
  5. 07 3月, 2022 1 次提交
  6. 28 2月, 2022 1 次提交
  7. 27 2月, 2022 1 次提交
  8. 23 2月, 2022 1 次提交
  9. 22 2月, 2022 1 次提交
  10. 18 2月, 2022 1 次提交
    • Z
      [AMP] support GPU BF16 amp for dygraph (#39029) · 7d6d3848
      zhangbo9674 提交于
      * support dtype param for auto_cast
      
      * add amp_dtype for tracer
      
      * add unsupported bf16 list
      
      * support bf16 amp for O2
      
      * refine python interface for bfloat16
      
      * refine code
      
      * refine code
      
      * refine unittest
      
      * refine code
      
      * refine code
      
      * add bf16 o1
      
      * refine code by comment
      
      * add gradient accumulator
      
      * add recompute
      7d6d3848
  11. 11 1月, 2022 1 次提交
  12. 29 12月, 2021 1 次提交
  13. 28 12月, 2021 1 次提交
  14. 27 12月, 2021 1 次提交
  15. 15 12月, 2021 1 次提交
  16. 02 12月, 2021 1 次提交
  17. 29 11月, 2021 1 次提交
  18. 24 11月, 2021 1 次提交
    • 0
      [Dy2stat]support pure fp16 for dy2stat (#36944) · 52edad6a
      0x45f 提交于
      * run dy2stat pure fp16 in Linear model
      
      * no use self._pure_fp16_inputs
      
      * add test and fix Adam error in dy2stat pure fp16 training
      
      * use paddle.optimizer.Adam
      
      * run test in gpu
      
      * change test time for CI
      
      * enlarge atol for test_resnet_pure_fp16
      
      * refine code and enlarge atol
      
      * make custom_white_list and custom_black_list take effect for AMP and pure fp16
      
      * check tracer is not None
      
      * use default atol
      
      * change filter_size
      
      * change atol and add some NOTE
      52edad6a
  19. 09 11月, 2021 1 次提交
  20. 22 10月, 2021 1 次提交
    • L
      [hapi] support dygraph amp O2 (#36441) · 08248db0
      Leo Chen 提交于
      * [hapi] support dygrapg amp O2
      
      * fix problem of static pure fp16 in hapi
      
      * fix bug
      
      * fix format
      
      * fix ut
      
      * follow comments
      
      * update ut
      
      * update amp save/load
      
      * fix ut
      
      * refine code format
      08248db0
  21. 13 10月, 2021 2 次提交
  22. 22 9月, 2021 2 次提交
  23. 17 9月, 2021 1 次提交
    • Z
      [AMP] Support pure fp16 training mode for dygraph (#35521) · adaeee4d
      zhangbo9674 提交于
      * add pure fp16 major function in auto_cast & tracer
      
      * support master weight in dygraph for pure fp16
      
      * check mix dtype of fp16&fp32 for check_finite_and_unscale op
      
      * change pure fp16 funtion name
      
      * refine some bug in auto_cast
      
      * refine auto_cast interface logic
      
      * add param _casted_by_pure_fp16 for class Layer
      
      * support state_dict hook for save model by user appointed dtype in pure_fp16_decorator
      
      * refine pure_fp16_decorator as decorator
      
      * add unittest
      
      * add comment
      
      * add comment
      
      * support recompute
      
      * add comment for auto_cast and decorator
      
      * support to_static_state_dict for paddle.jit.save
      
      * unlimite models num and optimizers num
      
      * add lookup_table in black_list
      
      * fix momentum and layer state_dict
      
      * fix bug in layer state_dict
      
      * fix bug in layer state_dict_helper
      
      * refine unittest
      
      * refine test_momentun_op
      
      * refine interface and some code
      
      * refine amp_decorator interface
      
      * refine pure fp16 interface
      
      * refine master weight interface
      adaeee4d
  24. 10 9月, 2021 1 次提交
  25. 16 8月, 2021 1 次提交
  26. 11 8月, 2021 1 次提交
    • Z
      [AMP] add state_dict and load_state_dict and unittest for class GradScaler (#34300) · 99f8f5c8
      zhangbo9674 提交于
      * add state_dict and load_state_dict and unittest for class GradScaler
      
      * refine unittest for coverage of load_state_dict
      
      * refine comments of code-block
      
      * refine some comments
      
      * refine state_dict code and unittest
      
      * add #require gpu, xpu for GradScaler get/set example code
      
      * add #require gpu, xpu for GradScaler get/set example code
      
      * refine example code
      
      * refine unittest for state_dict
      
      * refine unittest for state_dict
      
      * fix bug of DataLoader in TestGradScalerStateDict
      
      * add flag FLAGS_cudnn_deterministic
      99f8f5c8
  27. 05 8月, 2021 1 次提交
  28. 15 7月, 2021 1 次提交
  29. 05 7月, 2021 1 次提交
  30. 01 7月, 2021 1 次提交
  31. 29 6月, 2021 1 次提交
  32. 21 6月, 2021 1 次提交
  33. 18 11月, 2020 1 次提交
  34. 14 9月, 2020 1 次提交
    • Z
      Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for... · d708b210
      Zhen Wang 提交于
      Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for static graph amp training. (#26240)
      
      * update amp_check_finite_and_scale_op for static_amp.
      
      * use amp_check_finite_and_scale in static graph amp.
      
      * update grads to zero when grads own infinite values(as for amp_checkout_finite_and_scale op).
      
      * add update_loss_scaling op in cpp.
      
      * add update_loss_scaling_op unit test.
      
      * update the doc of the check_finite_and_unscale op
      
      * Update the process of gradients updating skipping if the gradients have infinite values.
      
      * update the way to zero grads.
      
      * update test_update_loss_scaling_op.py
      
      * add log info when find infinite grads.
      
      * add the unit test for UpdateLossScaling Layer.
      d708b210
  35. 13 8月, 2020 1 次提交
    • L
      Feature/Enable Auto-Mixed-Precision in dynamic graph (#24903) · 2d95280e
      Leo Chen 提交于
      * add auto_cast, test=develop
      
      * add loss scaler, test=develop
      
      * add comments, test=develop
      
      * refine code, test=develop
      
      * refine code, test=develop
      
      * do not set flags automatically, test=develop
      
      * fix custom op bug, test=develop
      
      * add more test, test=develop
      
      * refine enable logic, test=develop
      
      * enable amp test with GPU, test=develop
      
      * add unittest
      
      * add test for found_inf
      
      * follow comments
      
      * follow comments
      
      * remove global variable, use singleton
      
      * add some notes
      
      * update comments
      
      * update comments
      
      * update comments
      
      * add use_dynamic_loss_scaling argument
      
      * refine found_inf
      
      * refine found_inf
      2d95280e