1. 23 4月, 2021 1 次提交
    • L
      [NPU] refactor check_finite_and_scale npu kernel (#32407) · 39a59dcf
      Leo Chen 提交于
      * refactor_check_finite_and_scale_npu_kernel
      
      * fix compile
      
      * add alloc_float_status op
      
      * add alloc_float_status op
      
      * add FloatStatus for check_finite_and_unscale
      
      * refine code
      
      * remove unneccessary logic
      
      * refine for fleet
      39a59dcf
  2. 22 4月, 2021 2 次提交
  3. 21 4月, 2021 2 次提交
  4. 16 4月, 2021 1 次提交
  5. 15 4月, 2021 1 次提交
  6. 14 4月, 2021 1 次提交
  7. 08 4月, 2021 2 次提交
  8. 02 4月, 2021 1 次提交
  9. 30 3月, 2021 1 次提交
  10. 26 3月, 2021 2 次提交
  11. 24 3月, 2021 1 次提交
  12. 23 3月, 2021 1 次提交
  13. 22 3月, 2021 1 次提交
  14. 21 3月, 2021 1 次提交
  15. 19 3月, 2021 1 次提交
  16. 17 3月, 2021 1 次提交
  17. 12 3月, 2021 1 次提交
  18. 24 2月, 2021 1 次提交
    • T
      fix entry (#31079) · ebbdf525
      tangwei12 提交于
      * fix entry
      
      * fix distributed lookup table fuse case
      
      * fix entry bug at first time
      
      * move entry from paddle.fluid -> paddle.distributed
      
      * fix ut with paddle.enable_static()
      Co-authored-by: Nmalin10 <malin10@baidu.com>
      ebbdf525
  19. 05 2月, 2021 1 次提交
  20. 25 1月, 2021 1 次提交
  21. 20 1月, 2021 2 次提交
  22. 18 1月, 2021 2 次提交
  23. 13 1月, 2021 4 次提交
  24. 12 1月, 2021 1 次提交
    • T
      add sparse embedding & load vars for 2.0 & gloo bug fix (#30306) · 5e839e4d
      tangwei12 提交于
      * add sparse embedding & load vars for 2.0
      
      Change-Id: I36b59ed5f015189dc9d9d2e34a9357722d369f1b
      
      * fix hdfs gloo
      
      Change-Id: Ia84d579053720ad804183e54c9a04b4f031c79c6
      
      * fix gloo hdfs
      
      Change-Id: I5ab982fd483cddc10adcdef0b8aa83aca976cb9e
      
      * move loadvar/sparse embedding from incubute to static
      
      Change-Id: I57081d3545ad2efab78c72420d2162c0eacaf3a0
      5e839e4d
  25. 08 1月, 2021 2 次提交
    • Z
      Support pure fp16 training for AMP API. (#29544) · 7f7dfccf
      Zhen Wang 提交于
      * add cast ops before and after unsupported fp16 ops.
      
      * Keep partial net in FP32 pattern.
      
      * Support check_finite_and_unscale and update_loss_scaling for FP16 calculation mode.
      
      * Add fp16 support for adam op.
      
      * add multi precision attr for adam.
      
      * Fix the bug of test_multi_precision_fp16_train UT.
      
      * Code format for CI.
      
      * Fix the redefine error about MPTypeTrait on windows.
      
      * fix bugs of the _create_accumulators func in Momentum.
      
      * fix bug when inserting post cast op.
      
      * Add the update_loss_scaling op in allow_set of UnusedVarCheck.
      
      * Update for ci coverage.
      
      * Add some doc for OptimizerWithMixedPrecision.
      
      * Fix the code style.
      
      * Imporve the doc of `amp_init`.
      
      * Change for fp16 testing if users have the infer program defined in separate way.
      7f7dfccf
    • G
      Quantization supports 2.0 APIs (#30036) · 1bdf9242
      guofei 提交于
      * Quantization supports 2.0 APIs
      
      * Fix the error of save_quantized_model
      1bdf9242
  26. 07 1月, 2021 1 次提交
  27. 05 1月, 2021 2 次提交
  28. 30 12月, 2020 1 次提交
  29. 29 12月, 2020 1 次提交