1. 28 11月, 2022 1 次提交
  2. 23 11月, 2022 1 次提交
  3. 21 11月, 2022 2 次提交
    • S
      add fc-residual quantization (#46917) · fed0ed34
      Sylwester Fraczek 提交于
      * add fc-residual quantization
      
      * revert removal of check for use_mkldnn
      
      * fix bug
      
      * add disable_logs
      
      * review fix
      
      call twice AreScalesPresntForNodes instead of if-else
      
      * rewrite residual input to output
      
      * revert fc mkldnn taking residual data
      
      * format fix
      
      * fix LoDTensor->DenseTensor
      
      * LoDTensor->DenseTensor
      
      * output->input
      
      * revert changes to unsupported script
      
      revert changes to unsupported script
      
      * remove fc residualdata from output blocklist in cpu_bfloat16_pass.cc
      fed0ed34
    • R
      delete unnecessary shape and slice op (#48112) · 41483383
      RichardWooSJTU 提交于
      41483383
  4. 18 11月, 2022 1 次提交
  5. 16 11月, 2022 4 次提交
  6. 15 11月, 2022 2 次提交
    • J
      Added optimization pass for oneDNN layernorm kernel (#47782) · 519e7426
      jakpiase 提交于
      * optimization for ln
      
      * fix
      
      * added output to gpd
      
      * added formatting
      
      * fix
      519e7426
    • S
      mkldnn directory cleanup (#47779) · 8a339d24
      Sławomir Siwek 提交于
      * cleanup unused code
      
      * unify is_int8 is_bfloat16
      
      * Simplify matmul_v2 FWD kernel
      
      * remove RunKernel methods
      
      * remove import namespace
      
      * remove headers
      
      * clean fluid/phi cross imports
      
      * remove fluid axpy_handler
      
      * delete fluid methods
      
      * activations
      
      * OneDNNMemDesc
      
      * MKLDNNFormatForSize
      
      * MatchShapeToLayout
      
      * MKLDNNMemoryFormat
      
      * MKLDNNFormat
      
      * ReorderMKLDNNHandler
      
      * to_void_cast
      
      * review suggestions
      
      * interpolate
      
      * remove fluid depedency
      8a339d24
  7. 14 11月, 2022 2 次提交
  8. 10 11月, 2022 3 次提交
  9. 09 11月, 2022 2 次提交
    • J
      Fix U2++ perf (#47780) · b1fb2360
      joanna.wozna.intel 提交于
      b1fb2360
    • P
      Enable fc passes (#45704) · 7e914386
      Paulina Gacek 提交于
      * Analysis API interface for disabling fc passes
      
      * Unit tests corrected
      
      * Python API added
      
      * test runs only when PADDLE_WITH_MKLDNN
      
      * Fc op changed to relu in matmul_op_test
      
      * Disable fc passes in tests where acc drops
      
      * code formating
      
      * Unit test for analysisConf added
      
      * Unit test gpu added
      
      * fc passes disabled when iterations=0 in gru test
      
      * style
      
      * passes disabled when fp32 in gru test
      
      * fc passes disabled in lstm test
      
      * Import from inference, not fluid in doc
      7e914386
  10. 08 11月, 2022 2 次提交
  11. 07 11月, 2022 2 次提交
  12. 04 11月, 2022 2 次提交
  13. 03 11月, 2022 1 次提交
  14. 02 11月, 2022 1 次提交
  15. 01 11月, 2022 2 次提交
  16. 31 10月, 2022 1 次提交
  17. 28 10月, 2022 1 次提交
  18. 27 10月, 2022 2 次提交
    • A
      [JIT] Add Predictor for JITLayer (#47379) · b160d09e
      Aurelius84 提交于
      * add predictor_engine
      
      * add predictor_engine
      
      * fix zero shape
      
      * fix lodTensor
      
      * fix unittest
      
      * fix code style
      
      * update CmakeList
      b160d09e
    • L
      make all cpp tests dynamic linked to libpaddle.so [except windows] (#47088) · 2096448b
      Leo Chen 提交于
      * make all cpp tests dynamic linked to libpaddle.so
      
      * add comments
      
      * keep old cc_test for some tests
      
      * fix some ut
      
      * make some ut use cc_test_old
      
      * fix typos and fit for win32
      
      * fix lib path
      
      * fix some tests
      
      * skip lite test
      
      * fit for rocm
      
      * fit for cinn
      
      * fit for mac
      
      * fit for win32
      
      * skip inference ut
      
      * skip  windows
      
      * fix coverage
      2096448b
  19. 26 10月, 2022 2 次提交
  20. 24 10月, 2022 2 次提交
  21. 20 10月, 2022 2 次提交
  22. 19 10月, 2022 1 次提交
  23. 18 10月, 2022 1 次提交
    • W
      Merge layernorm trt fuse (#46320) · 5e9f491e
      Wang Bojun 提交于
      * first version, accuracy corrected
      
      * disable debug print
      
      * use blockReduceSum in phi
      
      * add UT
      
      * add opCompat
      
      * code style
      
      * code refine
      
      * bug fix
      
      * code refine
      
      * test fix
      
      * bugfix
      
      * codesytle fix
      
      * code style
      
      * code-style
      
      * code-style
      
      * code-style
      5e9f491e