1. 28 3月, 2023 2 次提交
  2. 23 3月, 2023 2 次提交
  3. 21 3月, 2023 1 次提交
  4. 20 3月, 2023 2 次提交
    • Y
      [PHI Decoupling]Add PHI init for extension (#51511) · 62712aec
      YuanRisheng 提交于
      * remove init
      
      * delete fluid in context pool
      
      * fix custom op bugs
      
      * fix profiler bugs
      
      * fix ci bugs
      
      * fix window compile bugs
      
      * fix windows bugs
      
      * fix window bugs
      62712aec
    • S
      [Hackathon NO.71] 为 Paddle-TRT 添加 pad3d 算子 (#50986) · c36e3fd2
      Sonder 提交于
      * update codes about pad3d
      
      * add codes about Tensor type Padding
      
      * update
      
      * 更新单测文件
      
      * format code style
      
      * update and to &&'
      
      * rewrite codes about pad3d
      
      * add codes about converting paddle pad format to tensorrt pad format
      
      * fix some errors
      
      * 指定trt版本范围
      
      * 修正dims初始化方式
      
      * fix code style
      
      * update test pad values
      
      * 指定pad3d trt版本
      
      * 更新 单测 文件范围
      
      * 更新单测文件
      
      * update pad3d paddings convert codes
      
      * update pad3d
      
      * add static mode support
      
      * update test file
      
      * fix bugs about dynamic mode test codes
      
      * fix bug and add limite in op_teller
      
      * use a new padding convert method[ITensor* padding with using Slice to split the pre_pad and the  post pad]
      
      * fix PADDLE_THROW grammaly error
      
      * update test codes
      
      * 添加对于Tensor padding 的 size 判断
      c36e3fd2
  5. 16 3月, 2023 2 次提交
  6. 14 3月, 2023 1 次提交
  7. 13 3月, 2023 1 次提交
    • W
      squeeze2_op (#51146) · f9a4f007
      wenbin 提交于
      * squeeze2_op
      
      * add ut
      
      * fix ut
      
      * fix static
      
      * modity ut
      f9a4f007
  8. 09 3月, 2023 2 次提交
  9. 07 3月, 2023 1 次提交
  10. 06 3月, 2023 1 次提交
  11. 02 3月, 2023 3 次提交
  12. 28 2月, 2023 1 次提交
  13. 22 2月, 2023 1 次提交
  14. 14 2月, 2023 1 次提交
    • X
      add setvalue trt converter (#50341) · 2548657e
      xjmxyt 提交于
      * add cast setvalue op
      
      * add set_value to op teller
      
      * renew test and add description
      
      * add setAxis and add complex test
      
      * change test
      2548657e
  15. 11 2月, 2023 1 次提交
    • W
      [TRT] elementwise_add+transpose fusion (#50081) · fd0d4fa4
      Wang Bojun 提交于
      * eleadd_trans first version
      
      log fix
      
      * refine code for linear format, add pass check
      
      * linear format refine and ut fix
      
      * fix ut
      
      * windows ut
      
      * windows ut 2
      
      * move tensorMeta and alloc to configure
      fd0d4fa4
  16. 09 2月, 2023 2 次提交
    • Z
      [Paddle-TRT] GroupNorm int8 nchw32 fake kernel (#50146) · d93c63a0
      zhoutianzi666 提交于
      * add fmha_flashattention oss plugin
      
      * add fmhca
      
      * add oss fmhca
      
      * code reconstruct and add ut
      
      * code style refine
      
      * fix ut and enforce check
      
      * refine trt version check
      
      refine compile
      
      fix compile
      
      * fix cross ut
      
      * code refine
      
      * use runtime trt version check
      
      * bug fix and code refine
      
      * compile fix
      
      * merge develop
      
      * add GN QDQ kernel
      
      * support GN int8 fake kernel
      
      * add with_int8
      
      * add GN int8 fake kernel
      
      * add GN int8 fake kernel
      
      * add GN int8 fake kernel
      
      * add GN int8 fake kernel
      
      * add GN int8 fake kernel
      
      * add GN int8 fake kernel
      
      * add GN int8 fake kernel
      
      * add GN int8  UT
      
      * add verison > 8000  in GN int8  UT
      
      * add some check in .cu
      
      * add stdlib.h in UT
      
      * little change  in .cu
      
      * remove rand_r use rand
      
      * remove use rand
      
      * setAxis(1)
      
      * when int8 is on allow fall back to fp16
      
      ---------
      Co-authored-by: Nwwbitejotunn <wang_bojun@outlook.com>
      d93c63a0
    • W
      [TRT] Transpose layernorm fusion with different input format (#50082) · b2bb7ec9
      Wang Bojun 提交于
      * trans_layernorm
      b2bb7ec9
  17. 08 2月, 2023 1 次提交
  18. 06 2月, 2023 1 次提交
  19. 01 2月, 2023 1 次提交
    • W
      Preln fix (#49802) · e03718f5
      Wang Bojun 提交于
      * preln_residual 2 fused_bias_residual
      
      * skip layernorm fix and ut
      
      * code refine
      
      * code style refine
      
      * fix ut
      
      * fix output
      
      * add trt layer fall back info
      
      * refine op teller and ut
      
      * DropoutMaskOut output fix
      e03718f5
  20. 31 1月, 2023 2 次提交
    • W
      gn_silu (#49928) · 111075a3
      wenbin 提交于
      * gn_silu
      
      * add ut
      
      * set TIMEOUT
      
      * correct comments
      
      * comments
      
      * disable windows ut
      
      * rename parameter
      111075a3
    • W
      Unary (#49914) · 0d9185b9
      wenbin 提交于
      * disable integer
      
      * disable integer
      
      * add cast layer
      0d9185b9
  21. 18 1月, 2023 1 次提交
  22. 13 1月, 2023 2 次提交
    • W
      add oss flash fmha and fmhca support (#49438) · a48b8e2c
      Wang Bojun 提交于
      * add fmha_flashattention oss plugin
      a48b8e2c
    • Z
      [inference][trt]set output data type of trt network (#49712) · 690d7a69
      Zhang Jun 提交于
      * update trt engine to set in/out data type
      
      * update
      
      * Update engine.cc
      
      * Update engine.cc
      
      * update
      
      * set engine output type before freeze the network
      
      * update
      
      * update trt autoscan ut
      
      * update
      
      * update ut
      
      * fix equal bug, update ut
      
      * fix cast and equal ut
      
      * update cast ut using TRT < 8.4
      
      * set datatype from scope
      
      * check output var is nullptr
      
      * Update op_converter.h
      
      * update tensorrt_engine_op_test ut
      
      * update
      690d7a69
  23. 12 1月, 2023 1 次提交
  24. 11 1月, 2023 1 次提交
  25. 10 1月, 2023 1 次提交
  26. 09 1月, 2023 1 次提交
    • W
      Preln groupnorm (#49463) · 591be3bd
      wenbin 提交于
      * skip_groupnorm
      
      * init
      
      * preln
      
      * add ut
      
      * more assert
      
      * set timeout
      
      * fix windows ci issue
      591be3bd
  27. 05 1月, 2023 1 次提交
  28. 03 1月, 2023 1 次提交
  29. 30 12月, 2022 2 次提交