1. 25 11月, 2021 1 次提交
  2. 16 11月, 2021 1 次提交
    • Z
      [cherry-pick-2.2.1]fix fused_transformer_encoder_layer bug (#37229) · 36dd295e
      zhangkaihuo 提交于
      修复了fused_transformer_encoder_layer fine-tune过程发现的一些问题:
      
          fused_attention_op添加attn_mask=None的支持:PR
          pre_layer_norm处理问题:PR
          参数处理,计算错误的问题:PR
          add_bias计算错误问题:PR
          添加pure fp16的支持:PR
      36dd295e
  3. 27 10月, 2021 1 次提交
  4. 26 10月, 2021 4 次提交
  5. 18 9月, 2021 1 次提交
  6. 17 9月, 2021 2 次提交
    • Z
      [AMP] Support pure fp16 training mode for dygraph (#35521) · adaeee4d
      zhangbo9674 提交于
      * add pure fp16 major function in auto_cast & tracer
      
      * support master weight in dygraph for pure fp16
      
      * check mix dtype of fp16&fp32 for check_finite_and_unscale op
      
      * change pure fp16 funtion name
      
      * refine some bug in auto_cast
      
      * refine auto_cast interface logic
      
      * add param _casted_by_pure_fp16 for class Layer
      
      * support state_dict hook for save model by user appointed dtype in pure_fp16_decorator
      
      * refine pure_fp16_decorator as decorator
      
      * add unittest
      
      * add comment
      
      * add comment
      
      * support recompute
      
      * add comment for auto_cast and decorator
      
      * support to_static_state_dict for paddle.jit.save
      
      * unlimite models num and optimizers num
      
      * add lookup_table in black_list
      
      * fix momentum and layer state_dict
      
      * fix bug in layer state_dict
      
      * fix bug in layer state_dict_helper
      
      * refine unittest
      
      * refine test_momentun_op
      
      * refine interface and some code
      
      * refine amp_decorator interface
      
      * refine pure fp16 interface
      
      * refine master weight interface
      adaeee4d
    • Z
      Make flag adding easier (#35823) · 2c781455
      Zeng Jinle 提交于
      * make flag setter easier
      
      * update
      
      * rename macro name
      
      * fix bug of public/writable
      
      * update to pass CI
      
      * polish
      
      * fix CPU link error
      2c781455
  7. 14 9月, 2021 1 次提交
    • H
      Add solutions to PyLayer which is unsupported in DataParallel (#35401) · d483b8c0
      Haohongxiang 提交于
      * Add solutions to PyLayer which is unsupported in DataParallel
      
      * modify note format for parallel.py
      
      * modify docs of dataparallel
      
      * add docs of dp with pylayer
      
      * modify docs format
      
      * modify example format
      
      * change example of dp with pylayer
      
      * add unittest for dp with pylayer
      
      * modify ut
      
      * merge latest codes
      
      * update
      
      * modify for CI-Coverage
      
      * modify text-indent
      d483b8c0
  8. 10 9月, 2021 1 次提交
  9. 08 9月, 2021 2 次提交
  10. 01 9月, 2021 1 次提交
  11. 24 8月, 2021 1 次提交
    • H
      Add no_sync in data parallel for dynamic graph (#34740) · b09f4d7f
      Haohongxiang 提交于
      * Add no_sync in data parallel for dynamic graph
      
      * modify UT of no_sync
      
      * delete test_parallel_dygraph_dataparallel_no_sync.py
      
      * add test_parallel_dygraph_no_sync.py
      
      * modify run_trainer_with_spawn in UTs
      
      * Add UT of complex control flow in no_sync
      
      * add specific descriptions and notes for no_sync
      
      * check code style
      
      * modify UT's TIMEOUT in CMakeLists.txt
      b09f4d7f
  12. 12 8月, 2021 1 次提交
  13. 06 8月, 2021 1 次提交
  14. 05 8月, 2021 1 次提交
  15. 04 8月, 2021 1 次提交
  16. 03 8月, 2021 3 次提交
  17. 09 7月, 2021 1 次提交
  18. 02 7月, 2021 1 次提交
  19. 30 6月, 2021 1 次提交
  20. 29 6月, 2021 1 次提交
  21. 24 6月, 2021 1 次提交
    • H
      [NPU] support dygraph execution on npu place(#33579) · 6aea6be2
      houj04 提交于
      * in NPU environment, use CPUPlace for missing operators.
      
      * in NPU environment, use CPUPlace for missing operators.
      
      * fix TensorCopy bug and add unit test.
      
      * fix code style.
      
      * add more unit tests.
      6aea6be2
  22. 23 6月, 2021 1 次提交
    • W
      optimize attr default value (#33357) · 5d2eb678
      wanghuancoder 提交于
      * optimize attr default value, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * fix bug in AttrReader, test=develop
      
      * fix bug, test=develop
      
      * fix double_grad, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * fix checker null, test=develop
      
      * for test, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      5d2eb678
  23. 21 6月, 2021 1 次提交
  24. 10 6月, 2021 1 次提交
  25. 08 6月, 2021 1 次提交
  26. 26 5月, 2021 1 次提交
  27. 12 5月, 2021 1 次提交
  28. 11 5月, 2021 1 次提交
  29. 10 5月, 2021 1 次提交
  30. 01 5月, 2021 1 次提交
  31. 30 4月, 2021 2 次提交
    • W
      pylayer_op:release context after compute. (#32707) · 3cc11a3d
      WeiXin 提交于
      3cc11a3d
    • P
      Add 12 inplace APIs including auto generated (#32573) · 308073de
      pangyoki 提交于
      * add relu6_ hardsigmoid_ leaky_relu_ Inplace APIs
      
      * add softmax_with_cross_entropy_ Inplace API
      
      * add clip_ scale_ add_ subtract_ Inplace APIs
      
      * add wlist
      
      * fix parameter of scale api
      
      * add add_n_ Inplace API and remove log_ Inplace API
      
      * fix elementwise_add_ and elementwise_sub_ broadcast problem
      
      * elementwise inplace api give error message before run the op
      
      * use broadcast_shape in elementwise inplace op
      
      * add 8 inplace apis that is auto generated
      
      * add unittest for all inplace apis
      
      * add decorator for inplace apis in static mode
      
      * fix windows blas fail of exp inplace api, change array_equal to allclose
      
      * add flatten inplace api
      
      * add flatten unittest
      
      * fix flatten unittest
      
      * add decorator
      
      * fix grad.numpy in test_pylayer_op
      
      * unsupport softmax_with_cross_entropy_
      
      * add test_inplace_softmax_with_cross_entropy to static_mode_white_list
      
      * delete __all__ in inplace_utils
      
      * delete activation inplace function and add Tensor.inplace_func
      
      * change paddle.inplace_ to Tensor.inplace_
      
      * fix little problem
      
      * add paddle in inplace_utils
      308073de
  32. 29 4月, 2021 1 次提交