1. 10 8月, 2023 4 次提交
  2. 09 8月, 2023 16 次提交
    • X
      【New IR】backward code of new ir (#55957) · 0399b39f
      xiaoguoguo626807 提交于
      * [prim][newir] add basic framework for primitive
      
      * support desctensor in new ir
      
      * add vjp interface
      
      * support vjp in new ir
      
      * support vjp in new ir
      
      * polish vjp interface
      
      * fix stop_gradients set
      
      * fix vjp dispatch
      
      * add comment
      
      * add vjp test for new ir
      
      * add test for tanh vjp
      
      * [prim][newir] add basic framework for primitive
      
      * support desctensor in new ir
      
      * support vjp in new ir
      
      * support vjp in new ir
      
      * polish vjp interface
      
      * fix stop_gradients set
      
      * fix vjp dispatch
      
      * add comment
      
      * add vjp test for new ir
      
      * add test for tanh vjp
      
      * add eager and static backend for warp lower level api
      
      * support call_vjp pybind
      
      * polish code and add test for vjp
      
      * remove useless code
      
      * polish code
      
      * remove useless code
      
      * support mean vjp
      
      * backward origin code
      
      * add test for mean vjp and support has_vjp function
      
      * fix call_vjp
      
      * polish code
      
      * add attrs and dtype interface
      
      * add primitive ops set for backend
      
      * fix compile bugs
      
      * fix some bugs
      
      * fix windows bugs
      
      * add vjp test for tanh_
      
      * fix inference CI
      
      * fix inference ci
      
      * modify fluid cmake
      
      * origin test of tanh and mean passed
      
      * fix conflict
      
      * modify stop_gradient
      
      * modify block.ops
      
      * modify test
      
      * fix conflict
      
      * reply review comments
      
      * reply review comments
      
      * pulish code
      
      ---------
      Co-authored-by: Ncxxly <chenxx_id@163.com>
      Co-authored-by: NCharles-hit <wanghao107@baidu.com>
      Co-authored-by: Nzhangbo9674 <zhangbo54@baidu.com>
      Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
      0399b39f
    • X
      [New IR] add Add_n op description and python api (#56080) · 7cbb433a
      xiaoguoguo626807 提交于
      * refine program translator
      
      * fix warning: not override
      
      * fix bug
      
      * merge new modifications
      
      * modify by reviews
      
      * resolve conflicts
      
      * resolve conflicts
      
      * fix
      
      * fix
      
      * fix conflicts
      
      * pseudocode of backward
      
      * modify test
      
      * modify register op
      
      * clear other code
      
      * modify ci build bug
      
      * reply review comments
      
      * reply review comments
      
      ---------
      Co-authored-by: Nkangguangli <kangguangli@hotmail.com>
      7cbb433a
    • X
      [Paddle Inference] Set softmax op use_cudnn default true. (#56036) · 4f2cf7fb
      xiaoxiaohehe001 提交于
      * fix_softmax_eigen
      
      * fix_ctest_seresnet
      
      * fix_ci_error
      4f2cf7fb
    • X
      [New Feature] add _inplace_assign interface for sot. (#56077) · b4a4f1bb
      xiongkun 提交于
      * [New Feature] add _inplace_assign interface for sot
      
      * add unittest for inplace_assign
      b4a4f1bb
    • X
      [oneDNN]rename macro to PADDLE_WITH_DNNL (#52208) · 6ff4c130
      Xinyu Chen 提交于
      * onednn: rename macro to PADDLE_WITH_DNNL
      
      * onednn: rename macro to CINN_WITH_DNNL
      6ff4c130
    • R
      [clang-tidy] fix modernize-make-unique (#55764) · 9f04f2ac
      Ruibin Cheung 提交于
      9f04f2ac
    • C
      Add FP16 & BF16 for nanmedian (#56056) · 4ae9945b
      cyberslack_lee 提交于
      4ae9945b
    • U
      Fix select sdp for FA-2 (#56045) · 08e46d6f
      umiswing 提交于
      08e46d6f
    • N
      change index's dtype for int to int64 (#55949) · 8d181e37
      niuliling123 提交于
      8d181e37
    • H
      b982af4a
    • H
      [XPU] add pos_weight for sigmoid_cross_entropy_with_logits. (#55001) · 4315bc4c
      houj04 提交于
      * [XPU] add pos_weight for sigmoid_cross_entropy_with_logits.
      
      * update xdnn version.
      4315bc4c
    • L
      Reduce the workspace for TRT index_select UT (#56069) · 597f6d43
      Leo Chen 提交于
      597f6d43
    • F
      Set Input Type for TensorRT convert tests (#55823) · 242d26d4
      Frank Lin 提交于
      * set input type to fp16
      242d26d4
    • K
      [NewIR] minor fix about new ir test (#56075) · a127d7c8
      kangguangli 提交于
      * fix bugs about new ir test
      
      * enable dy2st newir test in all cases
      
      * fix
      a127d7c8
    • L
      remove the... · 723c6f77
      LoneRanger 提交于
      remove the AdamOptimizer、SGDOptimizer、MomentumOptimizer、ModelAverage、LookaheadOptimizer、FtrlOptimizer、DecayedAdagradOptimizer、DpsgdOptimizer in fluid and relocate the ExponentialMovingAverage、PipelineOptimizer、GradientMergeOptimizer and change optimizer base for LarsMomentumOptimizer and RecomputeOptimizer (#55970)
      
      * change the optimizer base for SGDOptimizer
      
      * change the optimizer base for SGDOptimizer
      
      * replace the SGDOptimizer with SGD
      
      * fix bug of sgd
      
      * change the optimizer base for MomentumOptimizer
      
      * fix the remaining tests
      
      * remove the Momentum in fluid/optimizer.py
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * Update test_resnet_cinn.py
      
      * Update test_resnet_prim_cinn.py
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * remove the ModelAverage in fluid
      
      * remove the LookaheadOptimizer in fluid
      
      * fix bug
      
      * remove AdamOptimizer in fluid
      
      * Update test_image_classification_fp16.py
      
      * fix bug
      
      * relocate the ExponentialMovingAverage in fluid
      
      * restore the static api
      
      * remove the FtrlOptimizer in fluid
      
      * remove the DecayedAdagradOptimizer in fluid
      
      * remove the DpsgdOptimizer in fluid
      
      * fix bug
      
      * fix codestyle
      
      * fix bug
      
      * fix bug
      
      * relocate the PipelineOptimizer
      
      * relocate the GradientMergeOptimizer
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix doc
      
      * Update __init__.py
      
      * Update test_fleet_qat_meta_optimizer.py
      
      * change optimizer base for LarsMomentumOptimizer
      
      * fix bug
      
      * fix conflict
      
      * fix code-style
      
      * fix sample codes
      
      * fix bug
      
      * fix bug
      
      * fix cinn bug
      
      * fix bug
      
      * fix bug
      
      * Update qat_optimizer.py
      
      * Update __init__.py
      
      * fix bug
      
      * change optimizer base for RecomputeOptimizer
      
      * fix bug
      
      * fix bug
      
      * Update test_imperative_optimizer_v2.py
      723c6f77
    • A
      [NewIR]Add cinn RuntimeDialect and JitKernelOp (#56074) · 12823f2b
      Aurelius84 提交于
      * [NewIR]Add cinn RuntimeDialect and JitKernelOp
      
      * remove PointerAttribute register
      
      * fix comment
      12823f2b
  3. 08 8月, 2023 13 次提交
  4. 07 8月, 2023 7 次提交
    • Y
      Add attn_mask supported for FlashAttnKernel. (#55969) · 42e0c6b8
      yin wei 提交于
      * add mask
      
      * add backword
      
      * add enforce info
      
      * update scale
      
      * integrate code
      
      * update enforce
      
      * add enforce eq
      
      * add error type
      
      * update enforce
      
      * add test_flash_attention
      
      * Polish codes and fix compiling errors.
      
      * Set num_splits to 0 for flash-attn with tensor mask.
      
      * Fix the compiling error for non flash-attn case.
      
      ---------
      Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
      42e0c6b8
    • Y
      [New IR]Add attrs Interface for Python (#55974) · 02e6347d
      YuanRisheng 提交于
      * add attrs and dtype interface
      
      * fix compile bugs
      
      * fix some bugs
      
      * fix windows bugs
      02e6347d
    • Y
      [Inference] save_optimized_model_pass support tensorrt (#55893) · 6b10c0e5
      Yuanle Liu 提交于
      * fix cudnn 8.7+ bug on cudnnConvolutionBiasActivationForward
      
      * save_optimized_model_pass support tensorrt
      
      * update
      
      * update
      
      * fix compile
      
      * update
      
      * fix ut timeout
      6b10c0e5
    • G
      5ada98b8
    • R
      30a02d27
    • C
      Fix typos (#56008) · 4d094b0c
      co63oc 提交于
      4d094b0c
    • X
      [dy2static] PaddleSOT pr (#54202) · c1913a5f
      xiongkun 提交于
      * add paddle-symbolic-trace to paddle
      
      * add symoblic trace
      
      * delete swp
      
      * support Layer in symbolic trace
      
      * fix test-symbolic-trace, make symbolic trace return a StaticFunction
      
      * template the error message
      
      * fix some unittest
      
      * Modify the execution mode of test
      
      * Modify the module name
      
      * add dy2static unittest decorator
      
      * change some unittest files by @ast_only_test
      
      * fix unittest.
      
      * test-symbolic-trace
      
      * update test_write_python_container.py
      
      * update
      
      * fix test_param_parse.py
      
      * add submodule and ln -sf in cmakefile
      
      * update
      
      * update
      
      * fix some ast only errors
      
      * update
      
      * Polish ut
      
      * fix unittests
      
      * update
      
      * update
      
      * fix unittests
      
      * update
      
      * test warning ast only
      
      * update
      
      * Ast only some uts
      
      * Fix unitests
      
      * test_error ast only
      
      * update
      
      * update
      
      * Support build_strategy for sot
      
      * update
      
      * import sot as a third party module
      
      * update
      
      * update
      
      * Polish code
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * remove old fluid api and use paddle.nn.relu instead
      
      * fix
      
      * comment the print of ast code
      
      * add try-finally block
      
      * fix dy2static stop-gradient bugs
      
      * fix code
      
      * remove unused submodule and minor codestyle fix
      
      * fix
      
      * fix cast error
      
      * fix interpolate meets int64 in static model
      
      * add evalframe support for py311
      
      * fix
      
      * fix err
      
      * switch ENABLE_FALL_BACK=False
      
      * fix
      
      * Fix CI for some unittest
      
      * add ENABLE_SOT
      
      * remove setup.py dependences
      
      ---------
      Co-authored-by: NNotHaozi <zhangmenghao@baidu.com>
      Co-authored-by: Nfeifei-111 <2364819892@qq.com>
      Co-authored-by: N0x45f <wangzhen45@baidu.com>
      Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
      c1913a5f