1. 07 12月, 2021 1 次提交
  2. 11 11月, 2021 1 次提交
    • J
      Added softplus + activation oneDNN fuse pass (#36657) · a346c4dc
      jakpiase 提交于
      * added softplus + activation fuse plass
      
      * minor change
      
      * implemented reviewer suggestion
      
      * minor fix
      
      * minor fix
      
      * added scale_out parameter
      
      * minor fix
      
      * fix for iScan CI
      
      * conditionally disabled logs
      
      * refactored pass builder
      a346c4dc
  3. 26 10月, 2021 1 次提交
    • W
      [Paddle-Inference]Add MatmulV2ToMatmul convert Pass, fix (matmul_v2, matmul,... · 93c591e2
      Wangzheee 提交于
      [Paddle-Inference]Add MatmulV2ToMatmul convert Pass, fix (matmul_v2, matmul, mul) convert pass, fix (matmul, mul) op_teller (#36652)
      
      * new_Matmul2ToMatmulToMul
      
      * new_Matmul2ToMatmulToMul
      
      * fix paddle_pass_builder
      
      * fix paddle_pass_builder
      
      * fix paddle_pass_builder
      
      * tem
      
      * tem
      
      * Add MatmulV2ToMatmul convert Pass; MatmulV2ToMul convert Pass
      
      * Add MatmulV2ToMatmul convert Pass; MatmulV2ToMul convert Pass
      
      * add matmul_broadcast_unitest
      
      * fix op_teller
      93c591e2
  4. 21 10月, 2021 1 次提交
    • J
      Added matmul_v2+transpose+reshape fuse pass (#36481) · 856cb9c5
      jakpiase 提交于
      * added base changes for matmul_v2+trans+resh fuse pass
      
      * added full matmul_v2+transpose+reshape pass
      
      * removed a file added by mistake
      
      * added reviewers suggestions
      
      * Changed ops type in checking capatibility version
      
      * Deteled one statement
      856cb9c5
  5. 14 10月, 2021 1 次提交
  6. 13 10月, 2021 1 次提交
    • W
      [PaddleInference] Pass: add int8 flag for op (#36042) · d7858c99
      Wangzheee 提交于
      * add_int_pass
      
      * add_int8_flag_pass
      
      * add_int8_flag_pass
      
      * fix CMakeLists.txt
      
      * fix test_trt_fc_fuse_quant_dequant_pass.py
      
      * fix python/paddle/fluid/tests/unittests/ir/inference/test_trt_fc_fuse_quant_dequant_pass.py
      
      * fix test_trt_fc_fuse_quant_dequant_pass.py
      d7858c99
  7. 11 10月, 2021 1 次提交
  8. 22 9月, 2021 1 次提交
  9. 06 9月, 2021 1 次提交
  10. 27 8月, 2021 1 次提交
  11. 16 8月, 2021 1 次提交
  12. 30 7月, 2021 1 次提交
    • J
      Added reshape, reshape2, squeeze and squeeze2 BF16/FP32 FWD/BWD kernels (#34219) · 22c4c189
      jakpiase 提交于
      * test version of matmul_v2
      
      * added matmul_v2 grad kernel
      
      * minor changes
      
      * minor changes
      
      * minor change for CI approval
      
      * CI fix
      
      * CI fix
      
      * added squeeze and squeeze2 kernels
      
      * CI fix
      
      * CI fix
      
      * CI fix
      
      * disabled tests when compiled with cuda
      
      * added setting format_tag by strides
      
      * added sigmoid BF16 FWD/BWD and gelu BF16 BWD
      
      * changes after review
      
      * Revert "added sigmoid BF16 FWD/BWD and gelu BF16 BWD"
      
      This reverts commit 6e3f76720b545abfcff9f6052b46b73a1e745cae.
      
      * Revert "Merge branch 'matmul_v2_grad' into squeeze2_op"
      
      This reverts commit 06fcf67843a4a7884eccdf67a02a03575e1d4cb8, reversing
      changes made to 6e3f76720b545abfcff9f6052b46b73a1e745cae.
      
      * minor change
      
      * added reshape1/2 kernels
      
      * moved some functions into private block
      
      * CI fix
      
      * CI fix
      
      * CI fix
      22c4c189
  13. 20 7月, 2021 1 次提交
  14. 07 7月, 2021 1 次提交
  15. 30 6月, 2021 1 次提交
    • J
      Added matmul_v2 BF16/FP32 FWD kernel (#33750) · 24783c84
      jakpiase 提交于
      * added matmul_v2 bf16/fp32 FWD kernel
      
      added matmul_v2 bf16/fp32 FWD kernel
      
      * added formatting
      
      * removed some tests due to timeout in CI
      
      * refactored tests
      
      * merged tests classes into one file
      
      * minor change
      
      * removed test guard for CUDA
      
      * remove skipIf
      
      * changes after review
      
      * formated one file
      
      * minor change
      
      * added skipping UT in CUDA place
      24783c84
  16. 24 6月, 2021 1 次提交
  17. 23 6月, 2021 1 次提交
    • J
      Added split op bf16/fp32 oneDNN kernel (#33584) · 68106509
      jakpiase 提交于
      * base changes for split op
      
      * 90% of split functionality added
      
      * full fp32 functionality
      
      * added bf16 test
      
      * added submemory caching
      
      * added bf test to static mode whitelist
      
      * minor change
      
      * enabled split op for inference
      
      * minor fix
      
      * minor fix
      68106509
  18. 12 6月, 2021 1 次提交
  19. 28 4月, 2021 1 次提交
    • D
      Nne integration (#32604) · abcb3f54
      denglin-github 提交于
      * Add dlnne engine runtime
      
      * Fix log
      
      * Remove <const_cast> and remove unrelated modify with dlnne, +clang-format
      
      * Fix CMakeList format error
      
      * Add copyright message
      
      * Fix dlnne CMakeList.txt
      
      * Add some paddlepaddle_pass to support more networks
      
      * Fix some format bug
      
      * Add delete dropout_op pass
      
      * Fix some format bug
      
      * Fix format bug
      abcb3f54
  20. 30 3月, 2021 1 次提交
  21. 01 3月, 2021 1 次提交
  22. 23 2月, 2021 1 次提交
  23. 18 2月, 2021 1 次提交
    • J
      Add Conv Transpose BF16 (#30877) · caf9d398
      joanna.wozna.intel 提交于
      * Add conv transpose BF16
      
      * Share function GetWeightsTz
      
      * Adjust to review and fix op compatibility
      
      * Add bias to unique handler name
      
      * Remove errors related to paddle enforce
      
      * Add conv2d_transpose to bf16 list and kernel refator
      caf9d398
  24. 04 2月, 2021 1 次提交
  25. 03 2月, 2021 1 次提交
  26. 27 1月, 2021 1 次提交
  27. 13 1月, 2021 1 次提交
    • A
      Added support for inference using quantization aware trained dygraph (#30288) · 7bbf3ac5
      alncat 提交于
      * added support for inference using qunatization aware trained dygraph
      
      * added support for inference using qunatization aware trained dygraph
      correct boost get usage
      
      * Delete incorrect warning message (#30196)
      
      * fix warning and no grad
      
      * clean redundant API alias in 2.0 - part 2 (#30013)
      
      * delete paddle.nn.functional.assign
      
      * fix dynamic to static error
      
      * just add the op error message for the matmul xpu (#30246)
      
       add the op error message for the matmul xpu
      
      * Add Static Variable Clone (#30208)
      
      Add clone method for static Variable so that this interface will be same as dygraph. It fixed some bugs in dy2stat
      
      * use wget to replace curl to download the lcov file (#30229)
      
      * use wget to replace curl to download the lcov file
      
      * add cache for lcov
      
      * fix test_pool3d_op timeout issue (#30248)
      
      * Fix unittests bugs. (#30250)
      
      * modify error message based on comments (#30189)
      
      * modify error message based on comments
      
      * edit code according to review.
      
      * Correct spelling according to review.
      
      * Fix bug for 'save mutiple method' (#30218)
      
      * Fix bug for 'save mutiple method'
      
      * To pass coverage.
      
      * edit code to pass coverage.
      
      * edit code to pass coverage.
      
      * add unittest for coverage.
      
      * change for coverage.
      
      * edit for coverage.
      
      * added support for inference using qunatization aware trained dygraph
      
      * Alias from  paddle.fluid.layers.auc to paddle.static.auc (#30206)
      
      * add alias from  fluid.layers.auc to static.auc
      
      * Update __init__.py
      
      * added support for inference using qunatization aware trained dygraph
      correct boost get usage
      
      * corrected boost get usage
      
      * corrected naming issues and enforcing zero check
      
      * correct paddle enforce message
      
      * added more error checkings
      
      * corrected error report message and optimized code
      
      * corrected findvar usage
      
      * corrected paddle_enforce in scope
      
      * correct error messages
      
      * correct error reporting format
      Co-authored-by: NLielinJiang <50691816+LielinJiang@users.noreply.github.com>
      Co-authored-by: NXiaoguangHu <46782768+XiaoguangHu01@users.noreply.github.com>
      Co-authored-by: Nwawltor <fangzeyang0904@hotmail.com>
      Co-authored-by: NHuihuang Zheng <zhhsplendid@gmail.com>
      Co-authored-by: NYUNSHEN XIE <1084314248@qq.com>
      Co-authored-by: NBai Yifan <me@ethanbai.com>
      Co-authored-by: Ngongweibao <weibao.gong@gmail.com>
      Co-authored-by: NWeiXin <weixin10@baidu.com>
      Co-authored-by: NJiaqi Liu <liujiaqi06@baidu.com>
      7bbf3ac5
  28. 29 12月, 2020 1 次提交
  29. 24 12月, 2020 1 次提交
  30. 30 11月, 2020 1 次提交
  31. 26 11月, 2020 2 次提交
  32. 25 11月, 2020 1 次提交
  33. 24 11月, 2020 1 次提交
  34. 20 11月, 2020 1 次提交
  35. 17 11月, 2020 1 次提交
  36. 06 11月, 2020 1 次提交
  37. 05 11月, 2020 1 次提交
  38. 29 10月, 2020 1 次提交
  39. 27 10月, 2020 1 次提交