1. 19 12月, 2022 1 次提交
    • Y
      [cherry-pick][Inference] support mixed precision inference (#49077) · ddcd1b61
      Yuanle Liu 提交于
      * [Release2.4] Revert python link prs (#48573)
      
      * Revert "Fix mac link python (#48017)"
      
      This reverts commit 3fa7a736.
      
      * Revert "[Cherry-pick] Fix python link error (#47811)"
      
      This reverts commit ff642c68.
      
      * Update config.go
      
      * [Paddle Inference] Add float_to_half_pass to support  inference with mixed precision (#47993)
      
      * [Inference] optimize some code and fix some bug (#48780)
      
      * clean ir_pass_manager and fix map_depthwise_conv_to_conv_pass
      
      * fix unitest timeout
      
      * [Paddle Inference] clean unused code  (#48392)
      
      * fix
      
      * update
      
      * update
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      ddcd1b61
  2. 29 11月, 2022 1 次提交
    • Y
      [cherry-pick] updating mul and matmul with set_mem_desc and fix... · 9e2ba9b9
      yeliang2258 提交于
      [cherry-pick] updating mul and matmul with set_mem_desc and fix squeeze_transpose for MKLDNN (#47951)
      
      * Fix slice bugs in MKLDNN when input dims are zeros (#46671)
      
      * fix slice bugs
      
      * fix
      
      * update code
      
      * fix
      
      * update code
      
      * updating mul and matmul with set_mem_desc (#45624)
      
      * - mul & matmul changes
      
      - fix
      
      - bs16 correction of strides
      
      * - cosmetic fixes
      
      * - lint
      
      * - fix
      
      * - fix
      
      * - format -> mem_desc
      
      * - fix
      
      * - fix
      
      * - fix
      
      * - fix
      
      * - fix
      
      * fix squueze_transpose (#47911)
      Co-authored-by: NJacek Czaja <jacek.czaja@intel.com>
      9e2ba9b9
  3. 28 11月, 2022 1 次提交
    • Z
      Cherrypick NV fixes to release/2.4 (#48263) · 7a0b8625
      zlsh80826 提交于
      * Reduce squeeze2_matmul_fuse_pass, flattent tests time (#47098)
      
      * Add missing fp32 config and reduce the testing combination
      
      * Reduce trt matmul pass test max examples
      
      * Loose TRT fp16 tests tolerance (#47100)
      
      * Loose TRT half test tolerance to 1e-3 (#47101)
      
      * Loose TRT half test tolerance to 1e-3 (#47106)
      
      * Update distributed_strategy.proto (#46531)
      
      * Close popen pipe after used (#47053)
      
      * Add launch_bounds (#47285)
      
      * Fix TRT UT failures (#47488)
      
      * Format cherry-picked commits
      
      * CudnnNormConvolution is no longer supported on NVIDIA Hopper GPUs (#48203)
      
      * Skip tests that use fused_ops on H100
      
      * Add error message to FusedOps on H100
      Co-authored-by: NShijie <505749828@qq.com>
      Co-authored-by: NLeo Chen <39020268+leo0519@users.noreply.github.com>
      Co-authored-by: NTian Zheng <tizheng@nvidia.com>
      7a0b8625
  4. 25 11月, 2022 2 次提交
  5. 24 11月, 2022 1 次提交
  6. 16 11月, 2022 1 次提交
  7. 11 11月, 2022 2 次提交
  8. 10 11月, 2022 4 次提交
  9. 09 11月, 2022 2 次提交
  10. 08 11月, 2022 3 次提交
  11. 07 11月, 2022 6 次提交
  12. 04 11月, 2022 2 次提交
    • X
      [CherryPick] Cherry pick #45916 #46031 #47299 (#47610) · 72e1eb6b
      xiongkun 提交于
      * [ Dy2Static ] Fix bugs when select inputs meeting different shape or undefined-var (#45916)
      
      * fix select_input with different shape errors:
      1. select_input_with_buildin_type directly return non-undefinedvar branch when meeting undefined var
      2. the output shape of select_input is inferred from inputs.
      
      * reverse the logic in select_input
      
      * [warning] added warning message in cond block when one branch returns variable and another returns None (#46031)
      
      * [cherry-pick] Allow manaully set py_reader name in standalone executor (#45898) (#45931)
      
      * Allow manaully set py_reader name in standalone executor
      
      * [BugFix] while cond receives dict as input (#47299)
      
      * fix bugs while cond receives dict as input
      
      * add unittest
      
      * change flatten -> _is_sequence_except_dict
      
      * code format
      Co-authored-by: Nfeifei-111 <wuzhanfei@baidu.com>
      72e1eb6b
    • L
      [cherry-pick2.4]for CodeStyle (#47608) · cfee9c13
      Ligoml 提交于
      * only run pre-commit
      
      * only run pre-commit
      cfee9c13
  13. 03 11月, 2022 5 次提交
  14. 02 11月, 2022 1 次提交
  15. 01 11月, 2022 5 次提交
  16. 31 10月, 2022 3 次提交
    • Z
      2.4/fix engine build (#47462) · 4b3589fb
      zhaoyingli 提交于
      * update codestyle
      
      * [AutoParallel] fix fp16 for subblock (#47189)
      
      * [AutoParallel] fix fp16 for subblock
      
      * fix engine
      
      * fix comment
      
      * [AutoParallel] fix engine _build and cost method (#47263)
      
      * fix engine build method
      
      * fix import
      
      * update engine cost
      
      * update raise error
      
      * update cmakelist
      
      * revert optimizer
      
      * revert optimizer
      
      * fix unittest
      
      * fix unittest
      Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
      Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
      4b3589fb
    • Y
      [Cherry-pick][audio] rm kaiser window in audio get_window function && rm audio... · f93e9a58
      YangZhou 提交于
      [Cherry-pick][audio] rm kaiser window in audio get_window function && rm audio utils(#47469) (#47479)
      
      * [audio] rm kaiser window in audio get_window function && rm audio utils (#47469)
      
      * rm kaiser window in audio window function
      
      * rm paddle audio utils which is redundant
      
      * rm kaiser in test_audio_functions.py
      Conflicts:
      	python/paddle/audio/utils/error.py
      	python/paddle/tests/test_audio_functions.py
      
      * format
      f93e9a58
    • G
      [cherry-pick] update dygraph PTQ export_model api (#47415) · 12b9b03e
      Guanghua Yu 提交于
      * update dygraph PTQ export_model api
      
      * remove postprocess
      12b9b03e