1. 21 4月, 2023 1 次提交
  2. 19 4月, 2023 1 次提交
  3. 18 4月, 2023 3 次提交
  4. 14 4月, 2023 3 次提交
    • D
      update_npu_check_finite_and_unscale (#52914) · ddcc1002
      duanyanhui 提交于
      ddcc1002
    • F
      1. modify set_value op, use Scalars to represent attr `values`, instead of a... · dd2a749a
      Feiyu Chan 提交于
      1. modify set_value op, use Scalars to represent attr `values`, instead of a bunch of attributs of various types; (#52408)
      
      2. add program converter and set_value op as an example, which provides the functionality to convert `paddle::framework::ProgramDesc` between old and new formats(the differences are mainly some operators with incompatible updates in the definition);
      3. program version and operator version map now are always saved when serializing `paddle::framework::ProgramDesc` to identify the version;
      3. provide an option `legacy_format=false` in  serialization of `paddle::framework::ProgramDesc`, it decided whether to convert ProgramDesc back to a legacy format, which is compatible for paddle 2.4.2 or earlier versions to load and execute;
      4. deserialization of `paddle::framework::ProgramDesc` is now automatically detecting whether the bytes it receives is in legacy format(contains any of the operators that has been incompatibly updated and have any attribute of type `Scalar`) and convert it to new format. But if you want a faithful deserialization without the automatic conversion, you can use protobuf's deserialization instead. Though it is not recommended, it can be used for the purpose of testing.
      dd2a749a
    • Y
      [AMP] Unify the static amp codes of fp16 and bf16. (#52694) · dfcba7f4
      Yiqun Liu 提交于
      * Unify the static amp codes of fp16 and bf16.
      
      * Polish apis and add unittest.
      
      * Add operator stats collecting tools for program.
      
      * Add the check of number of bloat16 operators in unittest.
      
      * Add warning for operator not supported for amp.
      
      * Add testing of BF16 O1 and O2.
      dfcba7f4
  5. 12 4月, 2023 1 次提交
  6. 10 4月, 2023 1 次提交
    • Q
      【AMP OP&Test】instance_norm fp16 and bf16 support. (#52241) · 7c98abd9
      qizhaoaoe 提交于
      * add fp16 and bf16 support for instance_norm
      
      * fix /= operator which not support bf16
      
      * fix instance_norm_grad kernel and unittests.
      
      * fix fp32 unittests.
      
      * fix instance_norm_kernel and unittests.
      
      * fix instance_norm_grad_kernel and unittest threshold.
      
      * add fp16/bf16 for instance_norm_grad_grad op.
      
      * add bf16 dtype check.
      
      * fix conflicts.
      
      * fix cpu support for fp32 op and fix type in instance_norm_grad_kernel.
      
      * fix type in instance_norm_kernel.
      
      * fix bf16 outputs in unittests and refine codes.
      
      * fix dx computation.
      
      * delete unuseful params and head including.
      
      * add fp16/bf16 for static graph.
      
      * fix device condiction for instance_norm op.
      
      * fix instance_norm_grad_grad and bf16 op tests.
      
      * fix op_test to support grad of bf16 can be compared with fp32.
      
      * remove updates.
      
      * add self-defined grad.
      7c98abd9
  7. 09 4月, 2023 1 次提交
  8. 06 4月, 2023 2 次提交
  9. 05 4月, 2023 1 次提交
  10. 04 4月, 2023 1 次提交
  11. 03 4月, 2023 1 次提交
  12. 01 4月, 2023 1 次提交
  13. 31 3月, 2023 1 次提交
  14. 30 3月, 2023 3 次提交
  15. 29 3月, 2023 1 次提交
  16. 28 3月, 2023 3 次提交
  17. 25 3月, 2023 2 次提交
  18. 23 3月, 2023 2 次提交
  19. 22 3月, 2023 2 次提交
  20. 21 3月, 2023 4 次提交
  21. 20 3月, 2023 2 次提交
    • zhouweiwei2014's avatar
    • G
      Fluid clean move out fill constant (#49511) · c985b1ac
      GGBond8488 提交于
      * migrate fill_constant to paddle.tensor
      
      * move fill_constant to paddle.tensor and repalce the reference
      
      * add missing fill_constant replacement
      
      * fix typro
      
      * remove unused import fill_constant
      
      * fix zeros import error
      
      * fix circle import
      
      * fix layers.zeros
      
      * fix unitest
      
      * fix unitests
      
      * fix unitest
      
      * use paddle.full replace fill_constant in samplecode
      
      * fix sample code
      
      * recovery xpu test
      
      * recovery xpu test
      
      * fix circle import
      
      * fix utils import error
      
      * fix utils error
      
      * fix circle import
      
      * redo
      
      * fix circle import
      
      * fix prim fill constant import
      
      * fix type error
      
      * fix increase error
      
      * fix test error
      
      * fix fill_constant
      c985b1ac
  22. 16 3月, 2023 1 次提交
  23. 15 3月, 2023 1 次提交
  24. 14 3月, 2023 1 次提交