1. 09 2月, 2023 7 次提交
    • H
      [PHI decoupling] move strided_memcpy.h to phi (#50346) · 17318c1a
      Huang Jiyi 提交于
      * decouple strided_memcpy
      
      * move strided_memcpy
      
      * move strided_memcpy to phi
      
      * fix namespace
      
      * update
      
      * fix gpu compile bugs
      17318c1a
    • Y
      Add MultiTenosrAdam OP (#49220) · 10654c77
      yuehuayingxueluo 提交于
      * add multi_tenosr_adam
      
      * update multi_tensor_base.py, test_multi_tensor_adam.py, adamw.py
      
      * fix adam.py optimizer.py
      
      * fix adamw.py
      
      * fix test_multi_tensor_adam.py
      
      * fix CI bug
      
      * fix CI coverage
      
      * fix ci bug
      
      * fix betapow
      
      * fix some bugs
      
      * fix test_adamw_op.py
      
      * fix CI coverage
      
      * fix multi_tensor_adam_kernel.cc
      
      * fix CI bug
      
      * fix multi_tensor_adam_op.cc and test_multi_tensor_adam.py
      
      * fix code style
      
      * update C++ parts
      
      * remove python parts modification temporarily
      
      * add C++ ut
      
      * update betapow copy code logic
      
      * fix ci ut
      
      * fix windows ci
      
      * fix coverage ci
      
      * improve coverage rate
      
      ---------
      Co-authored-by: Nsneaxiy <sneaxiy@126.com>
      10654c77
    • L
      fix gc bug and start interceptor (#50344) · 5d5cb256
      LiYuRio 提交于
      5d5cb256
    • Z
      [Paddle-TRT] GroupNorm int8 nchw32 fake kernel (#50146) · d93c63a0
      zhoutianzi666 提交于
      * add fmha_flashattention oss plugin
      
      * add fmhca
      
      * add oss fmhca
      
      * code reconstruct and add ut
      
      * code style refine
      
      * fix ut and enforce check
      
      * refine trt version check
      
      refine compile
      
      fix compile
      
      * fix cross ut
      
      * code refine
      
      * use runtime trt version check
      
      * bug fix and code refine
      
      * compile fix
      
      * merge develop
      
      * add GN QDQ kernel
      
      * support GN int8 fake kernel
      
      * add with_int8
      
      * add GN int8 fake kernel
      
      * add GN int8 fake kernel
      
      * add GN int8 fake kernel
      
      * add GN int8 fake kernel
      
      * add GN int8 fake kernel
      
      * add GN int8 fake kernel
      
      * add GN int8 fake kernel
      
      * add GN int8  UT
      
      * add verison > 8000  in GN int8  UT
      
      * add some check in .cu
      
      * add stdlib.h in UT
      
      * little change  in .cu
      
      * remove rand_r use rand
      
      * remove use rand
      
      * setAxis(1)
      
      * when int8 is on allow fall back to fp16
      
      ---------
      Co-authored-by: Nwwbitejotunn <wang_bojun@outlook.com>
      d93c63a0
    • K
      [BugFix][ConditionalBlock] fix judgement about scope validation (#50086) · 61f9f136
      kangguangli 提交于
      * fix judgement about scope validation
      
      * fix ci bug: same address is not enough for data consistency
      
      * remove useless check
      61f9f136
    • P
      Fix pscore test (#50349) · fe811625
      pangengzheng 提交于
      fe811625
    • W
      [TRT] Transpose layernorm fusion with different input format (#50082) · b2bb7ec9
      Wang Bojun 提交于
      * trans_layernorm
      b2bb7ec9
  2. 08 2月, 2023 13 次提交
  3. 07 2月, 2023 9 次提交
  4. 06 2月, 2023 11 次提交