1. 26 6月, 2022 1 次提交
  2. 23 6月, 2022 2 次提交
    • W
      【code format】fix cpplint style 5 (#43733) · 75080988
      wangzhen38 提交于
      * fix cpplint style 5
      
      * fix cpplint style 5
      
      * fix cpplint style 5
      
      * fix cpplint style 5
      
      * fix cpplint style 5
      
      * fix cpplint style 5
      
      * fix cpplint style 5
      75080988
    • S
      [external reviewing] Params to int8 pass (#42625) · b8b2d6a9
      Sylwester Fraczek 提交于
      * sylwek
      
      prototype params to int8 pass
      
      * trying to make warmup work
      
      * wip
      
      * wip
      
      * change test to cpp test
      
      * review fixes, refactoring
      
      * more refactoring
      
      * add erasevars
      
      * change test to fixture
      
      * rename pass
      
      and reorder erasevars and graphsaferemovenodes
      
      * fix
      
      * more refactoring and fixed bug
      
      * formatting
      
      * remove scale count
      
      * enfroce message too short
      
      * remove erasevars
      
      erasevars couldbe cauuse of memory issues
      
      some other fixes
      
      * add count of successfull fuses to name of new nodes
      
      * FindVar -> GetVar and use ConvResidual pattern
      
      * use tensor->clear() instead of new variable
      
      * Update paddle/fluid/framework/ir/mkldnn/params_quantization_mkldnn_pass_tester.cc
      Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
      
      * Update paddle/fluid/framework/ir/mkldnn/params_quantization_mkldnn_pass_tester.cc
      Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
      
      * Update paddle/fluid/inference/tests/api/analyzer_lexical_analysis_gru_tester.cc
      Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
      
      * add log (review fix)c
      
      * review fix (2 functions to one)
      
      * code review: Conv->QuantizeConv
      
      * revert
      
      * fix formatting
      
      * remove unused functions
      
      * add paddle enforce
      Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
      b8b2d6a9
  3. 21 6月, 2022 2 次提交
  4. 09 6月, 2022 1 次提交
  5. 07 6月, 2022 1 次提交
  6. 05 6月, 2022 1 次提交
  7. 02 6月, 2022 1 次提交
    • T
      Enable fc on bfloat16 (#43154) · cb1a0ec1
      Tomasz Socha 提交于
      * Enable fc on bfloat16
      
      * Add pass for residual connection
      
      * Dissable Residual connection pass for now
      
      * Ban ResidualData from DQ
      
      * style
      
      * WO for python tests
      cb1a0ec1
  8. 31 5月, 2022 1 次提交
  9. 30 5月, 2022 1 次提交
  10. 22 5月, 2022 1 次提交
  11. 16 5月, 2022 1 次提交
  12. 11 5月, 2022 1 次提交
    • Z
      Move weights and biases scale computing into pass (#42241) · c0652972
      Zuza Gawrysiak 提交于
      * Add int8 scales gathering pass for convolution
      
      * Fix typo
      
      * Add unittest
      
      * Add corrected unit test
      
      * Change test name
      
      * Remove enabling mkldnn in test
      
      * Speed up test
      
      * Change max examples
      
      * Add functional test
      
      * Change test name
      
      * Add new test case
      
      * Rename pass
      c0652972
  13. 10 5月, 2022 1 次提交
    • P
      Rea-dd conv_affine_channel fuse pass as oneDNN only pass (#41998) · 3540d33b
      piotrekobi 提交于
      * Readd conv_affine_channel fuse pass as mkldnn pass
      
      * Fix formatting
      
      * Add new test to parallel_UT_rule.py
      
      * Fix Coverage and Windows CI issues
      
      * Revert "Fix Coverage and Windows CI issues"
      
      This reverts commit f33459846385c9fd51c07f9f44e7ff283a652637.
      
      * Fix CI errors
      
      * Remove unnecessary conv_eltwise_add_affine_channel fuse pass
      
      * Remove test from parallel_UT_rule.py
      3540d33b
  14. 28 4月, 2022 1 次提交
    • T
      Bfloat16 refactor (#42238) · 8ad38701
      Tomasz Socha 提交于
      * Refactor Quantization
      
      * Refactor Dequantization
      
      * Classy solution
      
      * Style I
      
      * Style II
      
      * Style III
      
      * Use VLOG(4) for debug info
      
      * Style IV
      8ad38701
  15. 27 4月, 2022 1 次提交
  16. 14 4月, 2022 3 次提交
    • S
      FC+elementwise_add (residual connection) (#41776) · 92d8d0bc
      Sławomir Siwek 提交于
      * Change tensor name to match activation
      
      * declare fc_eltwise_add pass
      
      * merge conv_eltwise refactor PR
      
      * first compilable draft
      
      * unittest feedback tools
      
      * Fuse pass tester
      
      * Move IsReachable() to shared file
      
      * 100% coverage of fuse_pass_tester.cc
      
      * register pass
      
      * Add bias node
      
      * Improve unit tests / remove bias node from pattern
      
      * improve fc_eltwiseadd_unittest
      
      * cancel eltwise_add fuse if act is already fused
      
      * Add elementwise_input scale
      
      * Residual MVP
      
      * Add new FC attrs
      
      * Add more test cases
      
      * Add missing op attrs
      
      * Adapt code to new Elementwise pattern
      
      * reuse existing fcpattern
      
      * improve code style
      
      * remove unused arguments
      
      * fix typo
      
      * remove whitespace
      
      * remove int8 related code
      
      * Remove attributes from base ops
      
      * style
      
      * style check
      
      * Remove input from base op
      
      * Set attribute during fuse
      
      * ut timeout
      
      * download and test model
      
      * DRY
      
      * apply feedback from review
      
      * Style check
      
      * fix typo
      
      * cosmetic changes
      
      * explicitly set residual as output
      
      * VIT-OCR accuracy check
      
      * trigger CI
      
      * remove whitespaces
      
      * fix missing data file
      92d8d0bc
    • B
      add mkldnn int8 pass [step3] (#41599) · 8e2d4d30
      baoachun 提交于
      * add mkldnn int8 pass [step3]
      
      * Add test for compute_propagate_scales_mkldnn_pass
      
      * update pass
      
      * update api comment and python api
      Co-authored-by: Nwozna <joanna.wozna@intel.com>
      8e2d4d30
    • J
      Added shuffle_channel BF16/FP32 FWD oneDNN kernel (#39756) · c7623d72
      jakpiase 提交于
      * added shuffle_channel bf16/fp32 fwd kernel
      
      * added missing files
      
      * CI fix
      
      * changed from pten to phi
      
      * tmp save
      
      * added reviewers suggestions
      
      * fix for test
      c7623d72
  17. 10 4月, 2022 2 次提交
  18. 04 4月, 2022 1 次提交
  19. 24 3月, 2022 1 次提交
  20. 23 3月, 2022 1 次提交
  21. 21 3月, 2022 1 次提交
  22. 18 3月, 2022 1 次提交
  23. 16 3月, 2022 1 次提交
  24. 15 3月, 2022 2 次提交
    • J
      oneDNN NHWC fixes (#40049) · dde9cec0
      Jacek Czaja 提交于
      * - Prototype of third solution
      
      - fix
      
      - compilation fixes
      
      - fix
      
      - fixe
      
      - fix
      
      - fix
      
      - compilation fix
      
      - comment fix
      
      - lint
      
      update mkldnn conv_elementwise_add_fuse_pass ut
      
      - NHWC changes to prelu
      
      - alhpa dims
      
      - UT fix
      
      - fix to UT
      
      - lint
      
      - Some fixes
      
      - added to BWD of prelu NHWC support
      
      - reverted removal of resetting cu_layout in clearing of caching
      
      * - Small changes
      
      * - compilation fix
      
      * - fix
      
      * - fix
      
      * lint
      
      * - fixes after internal review
      
      * - compilation fix
      
      * - lint
      dde9cec0
    • Y
      [Phi]Move Tanh/BRelu/LeakyRelu/ThresholdedRelu Kernels to Phi (#40385) · d7112180
      YuanRisheng 提交于
      * move activation op
      
      * adjust code format
      
      * fix compile bugs
      
      * fix ci bugs
      
      * code format adjust
      
      * code format adjust2
      
      * activate ci status
      
      * modify according to comment
      
      * move activation kernel
      
      * revert relu6
      
      * reduce add code
      
      * perfect use_phi_functor
      
      * completing func name
      
      * fix bugs when run ci
      
      * fix bugs when run infr
      
      * modifpy infrt get kernel signature
      d7112180
  25. 14 3月, 2022 1 次提交
    • T
      Add an elementwise + activation fusion pass. (#36541) · 3f219160
      Tomasz Socha 提交于
      * Add elementwise add and activation fuse pass
      
      * Fix copy ellision
      
      * More flexible pattern detector
      
      * More flexible fusion pass
      
      * Update lists for pass
      
      * Add support for Pow operator
      
      * Add support for more activation types
      
      * Style
      
      * Rename fusion pass
      
      * First version of tests
      
      * Dirty version of pass
      
      * Polished version
      
      * Update pbtxt
      
      * Style
      
      * Update names
      
      * Style
      
      * Use PADDLE_ENFORCE_EQ
      
      * Save error message to variable
      
      * WO for error checks
      
      * CR
      
      * Static style check
      
      * Add missing 'activation_scale' attribute
      
      * Add relu6 and sigmoid activations
      
      * Style
      
      * Fix fuse list formating
      
      * Sync filenames for fuse pass files
      
      * Fix cmake after move
      
      * Fix registration
      
      * Fix pass name in tests
      
      * Add missing activations to checker
      
      * WIPS
      
      * Working mul op
      
      * Working sub
      
      * Working Add
      
      * Remove pten includes
      
      * Remove some forward declarations
      
      * Remove Includes
      
      * Fixes
      
      * Remove default kernels
      
      * Add check if post_ops attributes are avaliable
      
      * Style
      
      * Code adjustment
      
      * Register default kernels
      
      * We have year 2022 not 2021...
      Co-authored-by: Njakpiase <jakpia21@gmail.com>
      Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>
      
      * Fast review fixes
      Co-authored-by: Njakpiase <jakpia21@gmail.com>
      Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>
      
      * Review Fix
      
      * Rename one_dnn -> onednn
      
      * Style after review
      
      * Fast and dirty fix for quantization
      
      * Update tests
      
      * Style
      
      * Fix mkldnn_quantizer config
      
      * Add Joanna's suggestion.
      
      * Check if operator is explicitly disables on OneDNN
      
      * Try to use unregistered attributes
      
      * Style
      
      * Test new framework
      
      * FXI
      
      * FXII
      
      * Update test
      
      * Style
      Co-authored-by: Njakpiase <jakpia21@gmail.com>
      Co-authored-by: NSylwester Fraczek <sylwester.fraczek@intel.com>
      3f219160
  26. 11 3月, 2022 1 次提交
  27. 10 3月, 2022 1 次提交
  28. 08 3月, 2022 1 次提交
  29. 03 3月, 2022 1 次提交
    • H
      Move bn to pten (#39347) · ebd0f512
      hong 提交于
      * add bn cpu version; test=develop
      
      * move batch norm to pten
      
      * move batch norm to pten; test=develop
      
      * fix bug; test=develop
      
      * fix func::tranpose depend bug; test=develop
      
      * fix compile bugs; test=develop
      
      * fix use_op batch_norm bug; test=develop
      
      * fix cudnn bn add relu test; test=develop
      
      * fix pten context build and double grad bug; test= develop
      
      * remve useless code; test=develop
      
      * add batch norm gpu fp16 support; test=develop
      
      * fix test bn op bug; test=develop
      
      * remove output dtype set; test=develop
      
      * fix bug; test=develop
      
      * fix bug; test=develop
      
      * fix applay pass to program bug; test=develop
      
      * revert to develop; test=develop
      
      * fix rocm bug; test=develop
      
      * revert operator to develop; test=develop
      
      * fix pre_commit; test=develop
      
      * fix statci check error; test=develop
      
      * resolve conflict; test=develop
      
      * ana batch norm bug;
      
      * revert batch norm op
      
      * resolve conlict
      
      * fix nan inf and speed bug; test=develop
      
      * fix bug; test=develop
      
      * fix error; test=develop
      
      * test expand op; test=develop
      
      * fix bug; test=develop
      
      * resolve confilct
      
      * resolve confilct; test=develop
      
      * polish code; test=develop
      
      * polish code; test=develop
      
      * change mutable data to ctx alloc; test=develop
      
      * make format same with ci; test=develop
      
      * fix format error with ci; test=develop
      ebd0f512
  30. 28 2月, 2022 1 次提交
  31. 25 2月, 2022 1 次提交
    • C
      [Phi] Support cudnn kernel moving & move softmax kernels (#39547) · 8895379a
      Chen Weihang 提交于
      * support cudnn kernel moving
      
      * polish cmake rules
      
      * add unittest for coverage
      
      * remove orig kernel
      
      * remove softmax cudnn kernel
      
      * fix softmax test failed
      
      * fix npu func error
      
      * resolve conflict
      
      * rename gpu dnn kernels
      
      * fix name rule error
      
      * fix compile error
      
      * update fp16 namespace
      8895379a
  32. 24 2月, 2022 1 次提交
  33. 20 2月, 2022 1 次提交
  34. 19 2月, 2022 1 次提交
    • A
      [Pten]Unify paddle/pten::framework::ddim into pten::ddim (#39614) · 2fe04264
      Aurelius84 提交于
      * Unify paddle/pten::framework::ddim into pten::ddim
      
      * fix paddle namespace
      
      * compile sucessfully
      
      * fix npu src file
      
      * fix conflict
      
      * fix conflict
      
      * fix tensorrt compiler error
      
      * fix conflict
      
      * fix conflict
      
      * fix tesst file conflict
      
      * fix conflict
      
      * fix mlu file conflict
      
      * fix mlu file conflict
      
      * fix cinn header file conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      2fe04264