1. 25 6月, 2023 1 次提交
  2. 16 6月, 2023 1 次提交
  3. 14 6月, 2023 1 次提交
    • S
      Fix A100 CUDA12 ut (#54487) · a96c6dc7
      sneaxiy 提交于
      * fix A100 CUDA12 ut
      
      * fix ci uts
      
      * fix test_sync_batch_norm_op
      
      * fix sync bn op ut again by separating 2 files
      
      * fix codestyle ci
      
      * combine other PRs
      
      * fix codestyle
      
      * fix codestyle ci
      a96c6dc7
  4. 12 6月, 2023 2 次提交
  5. 09 6月, 2023 1 次提交
  6. 01 6月, 2023 1 次提交
  7. 26 5月, 2023 1 次提交
    • L
      Move QAT API from PaddleSlim to Paddle (#53591) · f2f9b0c2
      Leo Chen 提交于
      * Copy QAT files from PaddleSlim
      
      * Integrate QAT API into Paddle
      
      * Replace eval function
      
      * Reduce test_quant_aware run time
      
      * Apply new formatter on modified files
      
      * Remove code check for Paddle version check
      
      * Copy quant_post_quant_aware UT from PaddleSlim
      
      * Integrate test_quant_post_quant_aware UT into PaddlePaddle
      
      * Apply new formatter on modified files
      
      * Remove redundant code and add unittests
      
      * Add new unittests
      
      * Update the time limit of new unittests
      f2f9b0c2
  8. 22 5月, 2023 2 次提交
    • M
      [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() (#53856) · 3794d171
      Meteor Liu 提交于
      * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()
      
      * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()
      
      * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()
      
      * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()
      
      * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()
      
      * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()
      
      * fixed cyclic reference that caused patial import
      
      * fixed bad change
      
      * fix bad import
      
      * fix bad import
      
      * fix bad import
      
      * fix ut failed caused by change in_dynamic_mode
      
      * fix ut failed caused by change in_dynamic_mode
      
      * fixed usage of in_dynamic_mode() or in_dygraph_mode()
      
      * revert python3 to python in .pre-commit-config.yaml
      
      * fix merge conflicts
      3794d171
    • N
      Fix ctest error in test_amp_api (#53885) · 56947361
      niuliling123 提交于
      56947361
  9. 18 5月, 2023 1 次提交
    • shaojie_wang's avatar
      [AMP]Master grad in static graph (#53362) · 972581d8
      shaojie_wang 提交于
      * add master gradients on static graph
      
      * add unit test for bf16 master grad static graph
      
      * use float16 as v100 test dtype
      
      * only skip GPU which do not support bf16
      
      * use linear layer to test master grad
      
      * 1.push master grad creation before all optimizer ops; 2.remove useless unittest; 3.use a function to create master grad states
      972581d8
  10. 16 5月, 2023 2 次提交
  11. 15 5月, 2023 1 次提交
  12. 12 5月, 2023 1 次提交
  13. 11 5月, 2023 2 次提交
  14. 10 5月, 2023 1 次提交
  15. 09 5月, 2023 1 次提交
  16. 08 5月, 2023 1 次提交
  17. 06 5月, 2023 1 次提交
  18. 04 5月, 2023 1 次提交
  19. 25 4月, 2023 1 次提交
    • Q
      [fluid clean] remove Print. (#51778) · ed45ecc6
      qizhaoaoe 提交于
      * fluid clean: remove print/switch from fluid to static
      
      * remove Switch in static.__init__
      
      * fix conflicts.
      
      * replace Switch by case.
      
      * fix piecewise_lr decay.
      
      * fix typo
      
      * fix conflicts.
      
      * fix lr dtype
      
      * keep Switch in paddle.static.nn.control_flow and fix piecewise_lr.
      
      * fix conflicts.
      
      * keep Switch in the fluid.
      
      * fix Switch doc
      
      * fix example in Switch doc
      
      * fix Switch doc.
      
      * fix static/__init__.
      ed45ecc6
  20. 24 4月, 2023 2 次提交
  21. 23 4月, 2023 1 次提交
    • L
      relocate paddle/fluid/contrib/layers (#52820) · 802be98b
      LoneRanger 提交于
      * relocate metri_op.py
      
      * reloacte nn.py
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix variable->tensor and fix __all__
      
      * fix ctr_metric_bundle and sparse_embedding
      
      * fix bug of function init
      
      * fix bug of importing sparse_embedding and ctr_metric_bundle
      
      * fix bug
      
      * Update __init__.py
      802be98b
  22. 22 4月, 2023 1 次提交
  23. 21 4月, 2023 1 次提交
  24. 19 4月, 2023 1 次提交
  25. 18 4月, 2023 3 次提交
  26. 14 4月, 2023 3 次提交
    • D
      update_npu_check_finite_and_unscale (#52914) · ddcc1002
      duanyanhui 提交于
      ddcc1002
    • F
      1. modify set_value op, use Scalars to represent attr `values`, instead of a... · dd2a749a
      Feiyu Chan 提交于
      1. modify set_value op, use Scalars to represent attr `values`, instead of a bunch of attributs of various types; (#52408)
      
      2. add program converter and set_value op as an example, which provides the functionality to convert `paddle::framework::ProgramDesc` between old and new formats(the differences are mainly some operators with incompatible updates in the definition);
      3. program version and operator version map now are always saved when serializing `paddle::framework::ProgramDesc` to identify the version;
      3. provide an option `legacy_format=false` in  serialization of `paddle::framework::ProgramDesc`, it decided whether to convert ProgramDesc back to a legacy format, which is compatible for paddle 2.4.2 or earlier versions to load and execute;
      4. deserialization of `paddle::framework::ProgramDesc` is now automatically detecting whether the bytes it receives is in legacy format(contains any of the operators that has been incompatibly updated and have any attribute of type `Scalar`) and convert it to new format. But if you want a faithful deserialization without the automatic conversion, you can use protobuf's deserialization instead. Though it is not recommended, it can be used for the purpose of testing.
      dd2a749a
    • Y
      [AMP] Unify the static amp codes of fp16 and bf16. (#52694) · dfcba7f4
      Yiqun Liu 提交于
      * Unify the static amp codes of fp16 and bf16.
      
      * Polish apis and add unittest.
      
      * Add operator stats collecting tools for program.
      
      * Add the check of number of bloat16 operators in unittest.
      
      * Add warning for operator not supported for amp.
      
      * Add testing of BF16 O1 and O2.
      dfcba7f4
  27. 12 4月, 2023 1 次提交
  28. 10 4月, 2023 1 次提交
    • Q
      【AMP OP&Test】instance_norm fp16 and bf16 support. (#52241) · 7c98abd9
      qizhaoaoe 提交于
      * add fp16 and bf16 support for instance_norm
      
      * fix /= operator which not support bf16
      
      * fix instance_norm_grad kernel and unittests.
      
      * fix fp32 unittests.
      
      * fix instance_norm_kernel and unittests.
      
      * fix instance_norm_grad_kernel and unittest threshold.
      
      * add fp16/bf16 for instance_norm_grad_grad op.
      
      * add bf16 dtype check.
      
      * fix conflicts.
      
      * fix cpu support for fp32 op and fix type in instance_norm_grad_kernel.
      
      * fix type in instance_norm_kernel.
      
      * fix bf16 outputs in unittests and refine codes.
      
      * fix dx computation.
      
      * delete unuseful params and head including.
      
      * add fp16/bf16 for static graph.
      
      * fix device condiction for instance_norm op.
      
      * fix instance_norm_grad_grad and bf16 op tests.
      
      * fix op_test to support grad of bf16 can be compared with fp32.
      
      * remove updates.
      
      * add self-defined grad.
      7c98abd9
  29. 09 4月, 2023 1 次提交
  30. 06 4月, 2023 2 次提交