1. 16 12月, 2022 1 次提交
  2. 15 12月, 2022 1 次提交
  3. 12 12月, 2022 2 次提交
  4. 11 12月, 2022 1 次提交
  5. 09 12月, 2022 2 次提交
  6. 08 12月, 2022 3 次提交
  7. 07 12月, 2022 2 次提交
  8. 06 12月, 2022 1 次提交
  9. 05 12月, 2022 4 次提交
  10. 02 12月, 2022 2 次提交
    • B
      Split common funcs from reduction and structure modification (#46970) · ef575d6a
      Bo Zhang 提交于
      * profile reduce kernel for fp16 and reduceHigherdim
      
      * use reinterpret_cast
      
      * fix for CI on ROCm
      
      * add Macro for ROCm
      
      * ROCm CI config
      
      * ROCm CI config
      
      * unit test repair
      
      * pull
      
      * add common_funcs.h
      
      * reduceType
      
      * Update reduce_function.h
      
      * not higher
      
      * rename
      ef575d6a
    • J
      [Eager] Optimize Grad by prune useless branch (#47827) · d1e93be1
      Jiabin Yang 提交于
      * [Eager] Fix paddle.grad interface
      
      * [Eager] Support minimum SubGraph for GeneralGrad
      
      * Add needed_nodes to prune grad graph more thoroughly
      
      * [Eager] Add grad_node_trans_mapping_ to record which grad_node has been transformed to AccumulationNode
      
      * [Eager] Fix paddle.grad interface
      
      * Polish code
      
      * remove potential_stop_node
      
      * Add endding_nodes to enhance genSugraph logic
      
      * clear endding_nodes_
      
      * polish code
      
      * rename endding_nodes to endding_nades_
      
      * Refactor grad interface
      
      * Add register_hook case to fix coverage-ci
      
      * Fix code format
      
      * Refactor general_grad
      
      * Add more code comments
      
      * call clear directly to release GradSlotMeta
      
      * fix a mistake
      
      * fix matmul/ multiply kernel logic and optional input in yaml, fill zeros logic and so on.
      
      * fix batch_norm_double_grad yaml optional config
      
      * fix tanh_triple_grad yaml and kernels
      
      * fix MultiplyTripleGradKernel optional logic
      
      * fix merge mistake
      
      * fix compile error
      
      * remove legacy attr for bn
      
      * polish code
      
      * fix some kernel
      
      * merge develop
      
      * fix error
      
      * remote log
      
      * fix kernel with full like
      
      * hide value log behind
      
      * hide value log behind
      
      * fix matmul_triple grad
      Co-authored-by: NWeilong Wu <veyron_wu@163.com>
      d1e93be1
  11. 01 12月, 2022 1 次提交
  12. 30 11月, 2022 3 次提交
  13. 29 11月, 2022 1 次提交
  14. 28 11月, 2022 4 次提交
  15. 25 11月, 2022 2 次提交
  16. 24 11月, 2022 4 次提交
    • W
      [Fluid clean] (#48105) · 43b92b63
      wangxiaoning 提交于
      * add index sample fp16 support
      
      * remove fluid APIs in distributed_strategy.py and role_maker.py
      
      * Revert "remove fluid APIs in distributed_strategy.py and role_maker.py"
      
      This reverts commit 223bbee990d3bf69e252fc3c0f19e3873550a264.
      
      * remove fluid APIs in distributed_strategy.py and role_maker.py
      
      * remove index sample op changes
      
      * remove fluid APIs under fleet.base
      
      * remove fluid APIs under fleet.layers.mpu
      
      * remove fluid APIs under fleet.meta_optimizers
      
      * fix fluid error
      
      * fix util_factory.py
      
      * reset fluid.io.load_inference_model API
      43b92b63
    • H
      [PHI decoupling] simplify "convert_utils.h" in fluid (#48168) · de4310e6
      huangjiyi 提交于
      * rm dependence to "convert_utils.h" in some files
      
      * fix bugs
      
      * replace DataType2String with DataTypeToString
      
      * replace framework::DataTypeSize with phi::SizeOf
      
      * mv convert_function from fluid to phi and rm old map
      
      * recommit with pre-commit
      
      * repalce ProtoVarType with ProtoDataType and update comment.
      
      * fix error about include "dnnl.hpp"
      
      * revert add dep mkldnn to convert_utils in phi
      
      * add mkldnn deps in convert_utils.h in phi
      
      * move deps to convert_utils.h in phi
      de4310e6
    • P
      df23c7c3
    • S
      fix adam thread num (#48297) · dd27996c
      sneaxiy 提交于
      dd27996c
  17. 23 11月, 2022 1 次提交
  18. 22 11月, 2022 1 次提交
  19. 21 11月, 2022 2 次提交
  20. 18 11月, 2022 2 次提交
    • MarDino's avatar
      Optimize FusedBiasAddGelu Kernel (#47679) · b0e28540
      MarDino 提交于
      * Add quick gelu and fused bias add kernel
      
      * fix annotation
      
      * remove useless code
      
      * add fast gelu option and set it in multi transformer op
      
      * add flag to restrict if use fast gelu approximate
      
      * fix flags conflict
      
      * fix use tanh function instead
      
      * add cudart version limit
      
      * use phi fast tanh func
      
      * fix comment
      b0e28540
    • H
      [PHI decoupling] move "gpu_device_function.h" from fluid to phi (#48097) · 27ee6e71
      huangjiyi 提交于
      * move "paddle/phi/backends/gpu/gpu_device_function.h" to phi
      
      * update copyright years
      
      * rm "fluid/platform/device/gpu/gpu_device_function.h" in phi
      
      * fix rocm-complie bugs
      27ee6e71