1. 29 12月, 2022 1 次提交
  2. 20 12月, 2022 1 次提交
  3. 29 11月, 2022 1 次提交
    • Y
      [cherry-pick] updating mul and matmul with set_mem_desc and fix... · 9e2ba9b9
      yeliang2258 提交于
      [cherry-pick] updating mul and matmul with set_mem_desc and fix squeeze_transpose for MKLDNN (#47951)
      
      * Fix slice bugs in MKLDNN when input dims are zeros (#46671)
      
      * fix slice bugs
      
      * fix
      
      * update code
      
      * fix
      
      * update code
      
      * updating mul and matmul with set_mem_desc (#45624)
      
      * - mul & matmul changes
      
      - fix
      
      - bs16 correction of strides
      
      * - cosmetic fixes
      
      * - lint
      
      * - fix
      
      * - fix
      
      * - format -> mem_desc
      
      * - fix
      
      * - fix
      
      * - fix
      
      * - fix
      
      * - fix
      
      * fix squueze_transpose (#47911)
      Co-authored-by: NJacek Czaja <jacek.czaja@intel.com>
      9e2ba9b9
  4. 28 11月, 2022 1 次提交
    • Z
      Cherrypick NV fixes to release/2.4 (#48263) · 7a0b8625
      zlsh80826 提交于
      * Reduce squeeze2_matmul_fuse_pass, flattent tests time (#47098)
      
      * Add missing fp32 config and reduce the testing combination
      
      * Reduce trt matmul pass test max examples
      
      * Loose TRT fp16 tests tolerance (#47100)
      
      * Loose TRT half test tolerance to 1e-3 (#47101)
      
      * Loose TRT half test tolerance to 1e-3 (#47106)
      
      * Update distributed_strategy.proto (#46531)
      
      * Close popen pipe after used (#47053)
      
      * Add launch_bounds (#47285)
      
      * Fix TRT UT failures (#47488)
      
      * Format cherry-picked commits
      
      * CudnnNormConvolution is no longer supported on NVIDIA Hopper GPUs (#48203)
      
      * Skip tests that use fused_ops on H100
      
      * Add error message to FusedOps on H100
      Co-authored-by: NShijie <505749828@qq.com>
      Co-authored-by: NLeo Chen <39020268+leo0519@users.noreply.github.com>
      Co-authored-by: NTian Zheng <tizheng@nvidia.com>
      7a0b8625
  5. 11 11月, 2022 1 次提交
  6. 09 11月, 2022 1 次提交
  7. 08 11月, 2022 1 次提交
  8. 07 11月, 2022 1 次提交
  9. 03 11月, 2022 1 次提交
  10. 01 11月, 2022 1 次提交
    • Z
      [cherry-pick][code-gen] Support code-gen for opmaker of sparse op (#46993) (#47417) · 601626ac
      zyfncg 提交于
      * support generating code of opmaker for backward op invoke forward op (#46912)
      
      * [code-gen] Support code-gen for opmaker of sparse op (#46993)
      
      * support generating code of opmaker for backward op invoke forward op
      
      * gsupport code-gen of opmaker for sparse op
      
      * refind logic of choose phi kernrel
      
      * fix complie budg
      
      * fix code_gen bug
      
      * fix bug
      
      * fix kernel signature code-gen
      
      * fix complie bug of VarType
      
      * fix complie bug of VarType
      
      * fix test_sparse_conv_op
      
      * fix test_sparse_norm_op
      
      * [Phi] Refactor logic of judging whether having a phi kernrel (#46920)
      
      * refind logic of choose phi kernrel
      
      * fix complie budg
      
      * update cmake
      601626ac
  11. 28 10月, 2022 2 次提交
  12. 27 10月, 2022 1 次提交
  13. 26 10月, 2022 3 次提交
  14. 21 10月, 2022 1 次提交
  15. 20 10月, 2022 4 次提交
  16. 19 10月, 2022 2 次提交
  17. 18 10月, 2022 2 次提交
  18. 17 10月, 2022 1 次提交
  19. 14 10月, 2022 1 次提交
  20. 13 10月, 2022 2 次提交
  21. 11 10月, 2022 3 次提交
  22. 10 10月, 2022 5 次提交
  23. 29 9月, 2022 1 次提交
    • Z
      [cherry-pick] Open the clip_extra flag in save_inference_model (#46577) · d67da3dc
      zyfncg 提交于
      * set flag of clip_extra in save_inference_model to true (#46151)
      
      * open the clip_extra flag in paddle.static.save_inference_model, test=allcase (#46456)
      
      * Open the clip_extra flag in TracedLayer.save_inference_model (#46473)
      
      * open the clip_extra flag in paddle.static.save_inference_model, test=allcase
      
      * set the defalut value of clip_extra in TracedLayer from False to True, test=allcase
      
      * update english doc of paddle.static.save_inference_model, test=document_fix (#46484)
      
      * Fix clip_extra logic in remove_training_info (#46534)
      
      * fix clip_extra code in remove_training_info
      
      * revert rnn opmaker clear
      d67da3dc
  24. 28 9月, 2022 1 次提交
    • Z
      [cherry-pick] Clear extra attrs of some ops in OpMaker (#46150, #46321,... · b2e4211d
      zyfncg 提交于
      [cherry-pick] Clear extra attrs of some ops in OpMaker (#46150, #46321, #46418, #46451, #46457) (#46553)
      
      * Clear extra attributes of some Op in OpMaker (Part4) (#46060)
      
      * clear extra attr of some ops in opmaker
      
      * revert clear use_cudnn for pool
      
      * fix test_operator_desc
      
      * fix Attr interface of OperatorBase
      
      * clear extra attrs of condition op in opmaker (#46150)
      
      * Clear extra attrs of lookup_table_v2 in OpMaker (#46321)
      
      * clear extra attrs of look_up_table_v2 in opmaker
      
      * fix bug
      
      * clear extra attrs of quantize op in opmaker (#46418)
      
      * delete repeated item
      
      * clear extra attrs of distribute op in opmaker (#46451)
      
      * clear extra atts of sequence_softmax in opmaker (#46457)
      b2e4211d
  25. 27 9月, 2022 1 次提交
    • Z
      [cherry-pick] clear extra attrs of some ops in OpMaker (#45845, #45984, 46060) (#46218) · 0cc2251f
      zyfncg 提交于
      * Clear extra attrs of elementwise op in OpMaker (#45845)
      
      * clear extra attrs of elementwise op in opmaker
      
      * fix op_debug_string_test
      
      * fix bug of grad_add
      
      * fix sort of runtime attrs
      
      * Clear extra attrs of scale in OpMaker (#45984)
      
      * clear extra attr of scale in opmaker
      
      * fix sum bug
      
      * fix merge conflict
      
      * fix minus
      
      * Clear extra attributes of some Op in OpMaker (Part4) (#46060)
      
      * clear extra attr of some ops in opmaker
      
      * revert clear use_cudnn for pool
      
      * fix test_operator_desc
      
      * fix Attr interface of OperatorBase
      
      * fix code stype
      0cc2251f