1. 27 12月, 2022 1 次提交
    • H
      [Cherry-pick] Fix custom operator backward=None (#48656) (#48715) · 39eb77a6
      HongyuJia 提交于
      * [Release2.4] Revert python link prs (#48573)
      
      * Revert "Fix mac link python (#48017)"
      
      This reverts commit 3fa7a736.
      
      * Revert "[Cherry-pick] Fix python link error (#47811)"
      
      This reverts commit ff642c68.
      
      * Update config.go
      
      * fix custom operator backward=None (#48656)
      
      * [Custom Extension] Fix custom double_grad backward=None (#49224)
      
      * fix custom double_grad backward=None
      
      * fix custom_relu.cu bug && polish testcase of double_grad
      
      * remove old dynamic graph test
      
      * add import fluid
      
      * add import fluid
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      39eb77a6
  2. 22 12月, 2022 1 次提交
  3. 20 12月, 2022 1 次提交
  4. 19 12月, 2022 1 次提交
    • Y
      [cherry-pick][Inference] support mixed precision inference (#49077) · ddcd1b61
      Yuanle Liu 提交于
      * [Release2.4] Revert python link prs (#48573)
      
      * Revert "Fix mac link python (#48017)"
      
      This reverts commit 3fa7a736.
      
      * Revert "[Cherry-pick] Fix python link error (#47811)"
      
      This reverts commit ff642c68.
      
      * Update config.go
      
      * [Paddle Inference] Add float_to_half_pass to support  inference with mixed precision (#47993)
      
      * [Inference] optimize some code and fix some bug (#48780)
      
      * clean ir_pass_manager and fix map_depthwise_conv_to_conv_pass
      
      * fix unitest timeout
      
      * [Paddle Inference] clean unused code  (#48392)
      
      * fix
      
      * update
      
      * update
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      ddcd1b61
  5. 29 11月, 2022 1 次提交
    • Y
      [cherry-pick] updating mul and matmul with set_mem_desc and fix... · 9e2ba9b9
      yeliang2258 提交于
      [cherry-pick] updating mul and matmul with set_mem_desc and fix squeeze_transpose for MKLDNN (#47951)
      
      * Fix slice bugs in MKLDNN when input dims are zeros (#46671)
      
      * fix slice bugs
      
      * fix
      
      * update code
      
      * fix
      
      * update code
      
      * updating mul and matmul with set_mem_desc (#45624)
      
      * - mul & matmul changes
      
      - fix
      
      - bs16 correction of strides
      
      * - cosmetic fixes
      
      * - lint
      
      * - fix
      
      * - fix
      
      * - format -> mem_desc
      
      * - fix
      
      * - fix
      
      * - fix
      
      * - fix
      
      * - fix
      
      * fix squueze_transpose (#47911)
      Co-authored-by: NJacek Czaja <jacek.czaja@intel.com>
      9e2ba9b9
  6. 28 11月, 2022 1 次提交
    • Z
      Cherrypick NV fixes to release/2.4 (#48263) · 7a0b8625
      zlsh80826 提交于
      * Reduce squeeze2_matmul_fuse_pass, flattent tests time (#47098)
      
      * Add missing fp32 config and reduce the testing combination
      
      * Reduce trt matmul pass test max examples
      
      * Loose TRT fp16 tests tolerance (#47100)
      
      * Loose TRT half test tolerance to 1e-3 (#47101)
      
      * Loose TRT half test tolerance to 1e-3 (#47106)
      
      * Update distributed_strategy.proto (#46531)
      
      * Close popen pipe after used (#47053)
      
      * Add launch_bounds (#47285)
      
      * Fix TRT UT failures (#47488)
      
      * Format cherry-picked commits
      
      * CudnnNormConvolution is no longer supported on NVIDIA Hopper GPUs (#48203)
      
      * Skip tests that use fused_ops on H100
      
      * Add error message to FusedOps on H100
      Co-authored-by: NShijie <505749828@qq.com>
      Co-authored-by: NLeo Chen <39020268+leo0519@users.noreply.github.com>
      Co-authored-by: NTian Zheng <tizheng@nvidia.com>
      7a0b8625
  7. 25 11月, 2022 1 次提交
  8. 16 11月, 2022 1 次提交
  9. 11 11月, 2022 1 次提交
  10. 10 11月, 2022 2 次提交
  11. 09 11月, 2022 1 次提交
  12. 08 11月, 2022 2 次提交
  13. 07 11月, 2022 1 次提交
  14. 03 11月, 2022 3 次提交
  15. 01 11月, 2022 2 次提交
    • Z
      [cherry-pick][code-gen] Support code-gen for opmaker of sparse op (#46993) (#47417) · 601626ac
      zyfncg 提交于
      * support generating code of opmaker for backward op invoke forward op (#46912)
      
      * [code-gen] Support code-gen for opmaker of sparse op (#46993)
      
      * support generating code of opmaker for backward op invoke forward op
      
      * gsupport code-gen of opmaker for sparse op
      
      * refind logic of choose phi kernrel
      
      * fix complie budg
      
      * fix code_gen bug
      
      * fix bug
      
      * fix kernel signature code-gen
      
      * fix complie bug of VarType
      
      * fix complie bug of VarType
      
      * fix test_sparse_conv_op
      
      * fix test_sparse_norm_op
      
      * [Phi] Refactor logic of judging whether having a phi kernrel (#46920)
      
      * refind logic of choose phi kernrel
      
      * fix complie budg
      
      * update cmake
      601626ac
    • Y
      fix p2p comm memory release logic (#47497) (#47517) · 0201ccc4
      Yuang Liu 提交于
      0201ccc4
  16. 29 10月, 2022 1 次提交
  17. 28 10月, 2022 4 次提交
  18. 27 10月, 2022 2 次提交
  19. 26 10月, 2022 3 次提交
  20. 24 10月, 2022 1 次提交
  21. 21 10月, 2022 3 次提交
    • Z
      [Paddle-TRT] Fix conv2d (#47034) · d42a1dc3
      zhoutianzi666 提交于
      * forbid Conv2D into Paddle-TensoRT when weight is not persistable.
      d42a1dc3
    • J
      Add infer prune function (#47047) · 8739497c
      JingZhuangzhuang 提交于
      * Add infer prune function
      
      * add fusion op
      8739497c
    • Y
      Add paddle audio dataset & backend (#45939) (#47230) · 29c9f027
      YangZhou 提交于
      * add audio feature dataset
      
      * fix coding style
      
      * fix coding style2
      
      * rm librosa
      
      * rm voxceleb
      
      * rm librosa in test
      
      * add scipy fftpack
      
      * add functional
      
      * fix setup
      
      * fix setup2
      
      * rm colorlog
      
      * refactor dataset __init__.py
      
      * fix converage
      
      * fix librosa import error
      
      * fix windows test
      
      * fix windows ci
      
      * rm datasets
      
      * fix setup
      
      * remove testdata
      
      * add librosa in requirement
      
      * add librosa in requirement2
      
      * change librosa to 0.8.1
      
      * update ci docker
      
      * fix ci error
      
      * fix ci error2
      
      * fix ci coverage
      
      * fix converage
      
      * fix coverage
      
      * rm audio_base in test, notest,test=coverage
      
      * fix copyright
      
      * rm backend
      
      * add datast in __init__
      
      * rm compliance&&add function test
      
      * fix setup
      
      * fix windows
      
      * fix windows2
      
      * fix test timeout
      
      * add backend & datasets
      
      * fix bugs
      
      * fix ci time issue
      
      * add dataset test
      
      * rm test_audio_feature
      
      * avoid windows isssue, tmp
      
      * note windows isssue
      
      * skip windows issue
      
      * refactor dataset test
      
      * add dataset.py
      
      * fix dtype in layers.mfcc
      
      * fix ci-static-check
      
      * fix dtype in layers.mfcc && fix ci-static-check
      
      * add relative accuracy
      
      * modity API.spec
      
      * skip cuda11.2 test
      
      * skip cuda11.2 test2
      
      * skip cuda11.2
      
      * change dataset name
      
      * fix format
      
      * update api.spec
      
      * update api.spec2
      
      * fix coverage
      
      * add dataset test
      
      * rm download load dict
      
      * rm download load dict in init
      
      * update api.spec3
      
      * fix dataset coverage
      
      * fix coverage
      
      * fix coverage2
      
      * restore api.spec
      
      * restore api.spec2
      
      * fix api-spec 3
      
      * fix api-spec 4
      
      * fix api.spec
      
      * fix api.spec6
      
      * refactor init_backend
      
      * fix typo
      
      * change paddleaudio backend set
      
      * fix get_current_audio_backend()
      
      * fix format
      
      * fix format2
      
      * remove format in parameters
      
      * fix format2
      
      * add warning massage in wave_backend && remove redundant audio util
      
      * rm audio util in print_signatures
      
      * fix format3
      
      * add tess dataset license
      
      * format warning
      
      * add more info in warning msg
      
      * add paddleaudio version check
      
      * replace dataset esc50 with tess
      
      * add tess dataset && rm numpy transform in dataset.py
      
      * fix set audio backend bug
      
      * fix equal error
      
      * fix format && coverage error
      
      * add api example
      
      * fix format
      
      * fix error
      
      * fix typo
      
      * add noqa in __init__
      
      * fix backend doc example error
      
      * rm seed in dataset
      
      * update bakcend example
      
      * fix typo
      
      * fix typo
      
      * fix example err
      
      * fix typo
      
      * fix ci dataset test
      
      * fix example fil
      
      * try to fix ci
      
      * clean dataset doc
      
      * change get_current_audio_backend to get_current_backend
      
      * creplace paddle.audio.backends.info with paddle.audio.info, same with load, save
      
      * fix ci error
      
      * repalce api in test_audio_backend
      
      * fix save&&set_backend exmaple
      29c9f027
  22. 20 10月, 2022 6 次提交