1. 28 10月, 2021 4 次提交
    • F
      change api to support trt8 in pool3d_op_convert (#36783) (#36812) · 5fb28500
      feng_shuai 提交于
      * change api for support trt8
      5fb28500
    • H
      [Cherry-pick] Enable CTC grad compute on GPU (#36780) · 8ede9e6f
      Hui Zhang 提交于
      * Revert "Align CTC grad scale same with ESPNet (#34729)"
      
      This reverts commit 10f9644c.
      
      * ctc grad compute on gpu
      8ede9e6f
    • L
      Fix fused_attention_op and fused_feedforward_op bug when pre_layer_norm is false. (#36793) (#36816) · ae592233
      Li Min 提交于
      * Fix bug when pre_layer_norm is false.
      ae592233
    • X
      [Cherry-pick]FFT function enhancements and bugfixes (#36537) · 11b9f5f9
      Xiaoxu Chen 提交于
      * update fft api path (#36219)
      
      * update fft api path
      * add sample code for ihfft2
      Co-authored-by: Nchenfeiyu <chenfeiyu@baidu.com>
      
      * fix fft axis (#36321)
      
      fix: `-1` is used when fft's axis is `0`
      
      * use unified external error message for cufft api (#36114)
      
      * fft: modify sample code result (#36325)
      
      * dynamic load mkl as a fft backend when it is avaialble and requested (#36414)
      
      * add rocm support for fft api (#36415)
      
      * move signal apis
      
      * move fft and signal API path (#2)
      
      * move signal apis
      
      * move fft.py and signal.py to paddle/, fix typos
      
      * fix relative imports from fft.py and signal.py
      
      * fix typos in signal.py (#3)
      
      * move signal apis
      
      * move fft.py and signal.py to paddle/, fix typos
      
      * fix relative imports from fft.py and signal.py
      
      * fix typos
      
      * disable Cache when CUFFT_VERSION >= 10200 (#4)
      
      * move signal apis
      
      * move fft.py and signal.py to paddle/, fix typos
      
      * fix relative imports from fft.py and signal.py
      
      * fix typos
      
      * Add LRUCache for fft plans
      
      * add LRUCache for cuff and hipfft (#5)
      
      * move signal apis
      
      * move fft.py and signal.py to paddle/, fix typos
      
      * fix relative imports from fft.py and signal.py
      
      * fix typos
      
      * WIP: add cache
      
      * delete move constructor and operator= for CuFFTHandle and FFTConfig
      
      * remove log from CuFFTHandle and FFTConfig
      
      * add lrucache for fft rocm backend
      
      * disable LRUCache when CUFFT_VERSION >= 10200
      
      * disbale copy and move for hipFFTHandle; format code
      Co-authored-by: NXiaoxu Chen <chenxx_id@163.com>
      
      * remove debug message of cufftHandler
      
      * roll_op: support Tensor as input for shifts (#36727)
      
      * fix fftshift/ifftshift on static mode
      
      * update roll_op version
      
      * add more test cases for fftshift/ifftshift
      Co-authored-by: Nzhiboniu <31800336+zhiboniu@users.noreply.github.com>
      Co-authored-by: Nchenfeiyu <chenfeiyu@baidu.com>
      Co-authored-by: LJQ️ <33169170+lijiaqi0612@users.noreply.github.com>
      11b9f5f9
  2. 27 10月, 2021 6 次提交
  3. 26 10月, 2021 16 次提交
  4. 25 10月, 2021 14 次提交