1. 11 8月, 2023 1 次提交
  2. 10 8月, 2023 9 次提交
  3. 09 8月, 2023 16 次提交
    • X
      【New IR】backward code of new ir (#55957) · 0399b39f
      xiaoguoguo626807 提交于
      * [prim][newir] add basic framework for primitive
      
      * support desctensor in new ir
      
      * add vjp interface
      
      * support vjp in new ir
      
      * support vjp in new ir
      
      * polish vjp interface
      
      * fix stop_gradients set
      
      * fix vjp dispatch
      
      * add comment
      
      * add vjp test for new ir
      
      * add test for tanh vjp
      
      * [prim][newir] add basic framework for primitive
      
      * support desctensor in new ir
      
      * support vjp in new ir
      
      * support vjp in new ir
      
      * polish vjp interface
      
      * fix stop_gradients set
      
      * fix vjp dispatch
      
      * add comment
      
      * add vjp test for new ir
      
      * add test for tanh vjp
      
      * add eager and static backend for warp lower level api
      
      * support call_vjp pybind
      
      * polish code and add test for vjp
      
      * remove useless code
      
      * polish code
      
      * remove useless code
      
      * support mean vjp
      
      * backward origin code
      
      * add test for mean vjp and support has_vjp function
      
      * fix call_vjp
      
      * polish code
      
      * add attrs and dtype interface
      
      * add primitive ops set for backend
      
      * fix compile bugs
      
      * fix some bugs
      
      * fix windows bugs
      
      * add vjp test for tanh_
      
      * fix inference CI
      
      * fix inference ci
      
      * modify fluid cmake
      
      * origin test of tanh and mean passed
      
      * fix conflict
      
      * modify stop_gradient
      
      * modify block.ops
      
      * modify test
      
      * fix conflict
      
      * reply review comments
      
      * reply review comments
      
      * pulish code
      
      ---------
      Co-authored-by: Ncxxly <chenxx_id@163.com>
      Co-authored-by: NCharles-hit <wanghao107@baidu.com>
      Co-authored-by: Nzhangbo9674 <zhangbo54@baidu.com>
      Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
      0399b39f
    • X
      [New IR] add Add_n op description and python api (#56080) · 7cbb433a
      xiaoguoguo626807 提交于
      * refine program translator
      
      * fix warning: not override
      
      * fix bug
      
      * merge new modifications
      
      * modify by reviews
      
      * resolve conflicts
      
      * resolve conflicts
      
      * fix
      
      * fix
      
      * fix conflicts
      
      * pseudocode of backward
      
      * modify test
      
      * modify register op
      
      * clear other code
      
      * modify ci build bug
      
      * reply review comments
      
      * reply review comments
      
      ---------
      Co-authored-by: Nkangguangli <kangguangli@hotmail.com>
      7cbb433a
    • X
      [Paddle Inference] Set softmax op use_cudnn default true. (#56036) · 4f2cf7fb
      xiaoxiaohehe001 提交于
      * fix_softmax_eigen
      
      * fix_ctest_seresnet
      
      * fix_ci_error
      4f2cf7fb
    • X
      [New Feature] add _inplace_assign interface for sot. (#56077) · b4a4f1bb
      xiongkun 提交于
      * [New Feature] add _inplace_assign interface for sot
      
      * add unittest for inplace_assign
      b4a4f1bb
    • X
      [oneDNN]rename macro to PADDLE_WITH_DNNL (#52208) · 6ff4c130
      Xinyu Chen 提交于
      * onednn: rename macro to PADDLE_WITH_DNNL
      
      * onednn: rename macro to CINN_WITH_DNNL
      6ff4c130
    • R
      [clang-tidy] fix modernize-make-unique (#55764) · 9f04f2ac
      Ruibin Cheung 提交于
      9f04f2ac
    • C
      Add FP16 & BF16 for nanmedian (#56056) · 4ae9945b
      cyberslack_lee 提交于
      4ae9945b
    • U
      Fix select sdp for FA-2 (#56045) · 08e46d6f
      umiswing 提交于
      08e46d6f
    • N
      change index's dtype for int to int64 (#55949) · 8d181e37
      niuliling123 提交于
      8d181e37
    • H
      b982af4a
    • H
      [XPU] add pos_weight for sigmoid_cross_entropy_with_logits. (#55001) · 4315bc4c
      houj04 提交于
      * [XPU] add pos_weight for sigmoid_cross_entropy_with_logits.
      
      * update xdnn version.
      4315bc4c
    • L
      Reduce the workspace for TRT index_select UT (#56069) · 597f6d43
      Leo Chen 提交于
      597f6d43
    • F
      Set Input Type for TensorRT convert tests (#55823) · 242d26d4
      Frank Lin 提交于
      * set input type to fp16
      242d26d4
    • K
      [NewIR] minor fix about new ir test (#56075) · a127d7c8
      kangguangli 提交于
      * fix bugs about new ir test
      
      * enable dy2st newir test in all cases
      
      * fix
      a127d7c8
    • L
      remove the... · 723c6f77
      LoneRanger 提交于
      remove the AdamOptimizer、SGDOptimizer、MomentumOptimizer、ModelAverage、LookaheadOptimizer、FtrlOptimizer、DecayedAdagradOptimizer、DpsgdOptimizer in fluid and relocate the ExponentialMovingAverage、PipelineOptimizer、GradientMergeOptimizer and change optimizer base for LarsMomentumOptimizer and RecomputeOptimizer (#55970)
      
      * change the optimizer base for SGDOptimizer
      
      * change the optimizer base for SGDOptimizer
      
      * replace the SGDOptimizer with SGD
      
      * fix bug of sgd
      
      * change the optimizer base for MomentumOptimizer
      
      * fix the remaining tests
      
      * remove the Momentum in fluid/optimizer.py
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * Update test_resnet_cinn.py
      
      * Update test_resnet_prim_cinn.py
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * remove the ModelAverage in fluid
      
      * remove the LookaheadOptimizer in fluid
      
      * fix bug
      
      * remove AdamOptimizer in fluid
      
      * Update test_image_classification_fp16.py
      
      * fix bug
      
      * relocate the ExponentialMovingAverage in fluid
      
      * restore the static api
      
      * remove the FtrlOptimizer in fluid
      
      * remove the DecayedAdagradOptimizer in fluid
      
      * remove the DpsgdOptimizer in fluid
      
      * fix bug
      
      * fix codestyle
      
      * fix bug
      
      * fix bug
      
      * relocate the PipelineOptimizer
      
      * relocate the GradientMergeOptimizer
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix doc
      
      * Update __init__.py
      
      * Update test_fleet_qat_meta_optimizer.py
      
      * change optimizer base for LarsMomentumOptimizer
      
      * fix bug
      
      * fix conflict
      
      * fix code-style
      
      * fix sample codes
      
      * fix bug
      
      * fix bug
      
      * fix cinn bug
      
      * fix bug
      
      * fix bug
      
      * Update qat_optimizer.py
      
      * Update __init__.py
      
      * fix bug
      
      * change optimizer base for RecomputeOptimizer
      
      * fix bug
      
      * fix bug
      
      * Update test_imperative_optimizer_v2.py
      723c6f77
    • A
      [NewIR]Add cinn RuntimeDialect and JitKernelOp (#56074) · 12823f2b
      Aurelius84 提交于
      * [NewIR]Add cinn RuntimeDialect and JitKernelOp
      
      * remove PointerAttribute register
      
      * fix comment
      12823f2b
  4. 08 8月, 2023 13 次提交
  5. 07 8月, 2023 1 次提交
    • Y
      Add attn_mask supported for FlashAttnKernel. (#55969) · 42e0c6b8
      yin wei 提交于
      * add mask
      
      * add backword
      
      * add enforce info
      
      * update scale
      
      * integrate code
      
      * update enforce
      
      * add enforce eq
      
      * add error type
      
      * update enforce
      
      * add test_flash_attention
      
      * Polish codes and fix compiling errors.
      
      * Set num_splits to 0 for flash-attn with tensor mask.
      
      * Fix the compiling error for non flash-attn case.
      
      ---------
      Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
      42e0c6b8