1. 08 9月, 2023 1 次提交
  2. 06 9月, 2023 1 次提交
  3. 05 9月, 2023 1 次提交
  4. 01 9月, 2023 1 次提交
  5. 30 8月, 2023 1 次提交
    • H
      Add paddle custom flags support (#56256) · 2ef4ec71
      huangjiyi 提交于
      * update
      
      * repalce gflags header
      
      * replace DEFINE_<type> with PD_DEFINE_<type>
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * update cmake
      
      * add :: before some paddle namespace
      
      * fix link error
      
      * fix CI-Py3
      
      * allow commandline parse
      
      * fix SetFlagsFromEnv
      
      * fix bug
      
      * fix bug
      
      * fix CI-CINN
      
      * fix CI-Coverage-build
      
      * fix CI-Windows-build
      
      * fix CI-Inference
      
      * fix bug
      
      * fix bug
      
      * fix CI-CINN
      
      * fix inference api test
      
      * fix infer_ut test
      
      * revert infer_ut gflags usage
      
      * update
      
      * fix inference
      
      * remove flags export macro
      
      * revert inference demo_ci gflags usage
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * fix bug when turn on WITH_GFLAGS
      
      * turn on WITH_GFLAGS
      
      * fix bug when turn on WITH_GFLAGS
      
      * fix bug when turn on WITH_GFLAGS
      
      * update
      
      * update and add unittest
      
      * add unittest
      
      * fix conflict
      
      * rerun ci
      
      * update
      
      * resolve conflict
      2ef4ec71
  6. 25 8月, 2023 1 次提交
  7. 24 8月, 2023 1 次提交
  8. 23 8月, 2023 1 次提交
  9. 22 8月, 2023 2 次提交
  10. 14 8月, 2023 1 次提交
  11. 10 8月, 2023 1 次提交
  12. 09 8月, 2023 1 次提交
  13. 08 8月, 2023 2 次提交
  14. 07 8月, 2023 1 次提交
    • U
      [WIP] Integration flash attention 2 (#55758) · 0473369f
      umiswing 提交于
      * Work for fa-2 padded fwd. Code to be cleaned.
      
      * Work for fa2 unpadded fwd.
      
      * Work for padded-bwd, dk get small diff on np.random.seed(0)
      
      * Anyway I pass paddle's utest, except return softmax without dropout.
      
      * Clean code.
      
      * Modify interface.
      
      * Clean code and add some check.
      
      * Easy compile for dev.
      
      * Fix ci.
      
      * Fix ci-build.
      
      * Add std c++17 option again.
      
      * Limit max job when compiling fa2.
      
      * Remove const_cast
      
      * Add fwd params, to be cleaned.
      
      * Clean code.
      
      * Add bwd params.
      
      * Clean code.
      
      * Add enforce.
      
      * Use v2.0.4
      
      * Pass RNG state to fa2 capi
      
      * Fix review.
      
      * Add assert
      
      * Skip compile for sm less than 80.
      0473369f
  15. 20 7月, 2023 1 次提交
  16. 18 7月, 2023 2 次提交
  17. 17 7月, 2023 1 次提交
  18. 14 7月, 2023 2 次提交
  19. 12 7月, 2023 1 次提交
    • Y
      [ONEDNN] Upgrade oneDNN version to v3.1 (#52463) · cfa513f7
      YangQun 提交于
      * squash pick the poc code
      * fix build after rebase
      * fix int8 conv and fc uts
      * Fix and clean-up Get_SRC_Scale_Memory
      * fix floating point fc uts
      * fix test_analyzer_int8_googlenet
      * test_analyzer_int8_mobilenetv1
      * fix int8 mobilenet v2 and v3
      * fix build error after rebase
      * [oneDNN] rename library version
      * fix conv bias datatype
      * try to fix import error
      * fix rebase error
      * [oneDNN] pack library into python wheel
      * add MKLDNN_SHARED_LIB_3 to env_dict
      * fix test_analyzer_bert
      * fix fill_constant op kernel
      * fix ernie and matmul op ut
      * fix softplus ut
      * fix conv+relu6 fusion ut
      * fix hardswish fusion
      * fix quant+transpose fusion ut
      * fixsgd ut
      * fix int8 matmul with flatten
      * fix fc+scale fusion
      * fix conv/matmul+gelu fusion uts
      * fix rebase error
      * Revert "fix conv/matmul+gelu fusion uts"
      This reverts commit 47eb5e49972bd8f7271a233def9bfb3e98ce78e1.
      * upgrade to onednn v3.1
      * remove older version onednn
      * use densetensor::data() for achieving mean and var in layernorm impl
      * comments for atol of integer tests
      * fix clang-format
      * Revert "remove older version onednn"
      This reverts commit 783e57ddfd4401254596eae7d47adb9b03590c09.
      * improve binary handle
      * fix expand kernel
      * Revert "use densetensor::data() for achieving mean and var in layernorm impl"
      * always use forward_inference for conv
      * remove activation scales
      * rollback changes to mkldnn.cmake
      * address comments
      * port changes to dequantize kernel
      * fix merge error
      * fix fused_elementwise_kernel
      * upgrade onednn version to v3.1.1
      * fix some approval error
      * fix error msg format
      * remove old onednn libs
      * try to fix symbolic link issue
      * fix cinn test case segfault
      * do not explicit link test with onednn
      * remove unnecessary changes
      * integrate CINN with onednn v3
      * link with mkldnn project
      * fix cinn build file
      
      ---------
      Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
      Co-authored-by: NChen, Xinyu1 <xinyu1.chen@intel.com>
      Co-authored-by: Ntianshuo78520a <707759223@qq.com>
      cfa513f7
  20. 07 7月, 2023 1 次提交
  21. 06 7月, 2023 1 次提交
  22. 04 7月, 2023 1 次提交
    • H
      [XPU] Add XPU plugin support (#55101) · 6d5d9f23
      hong19860320 提交于
      * Add XPU plugin to support the customized ops or improve the performance of the fusion ops based on hand-written xpu micro kernels.
      
      * refine README.md
      6d5d9f23
  23. 02 7月, 2023 1 次提交
  24. 28 6月, 2023 4 次提交
  25. 27 6月, 2023 1 次提交
    • 6
      Code merge | Merge CINN into Paddle (#54749) · 67c69dca
      6clc 提交于
      * feat(cmake): add cmake of cinn
      
      * feat(cmake): add cmake of cinn python test
      
      * feat(cmake): add jit
      
      * feat(cmake): test/CMakeList.txt
      
      * feat(cmake): rebase to develop
      
      * feat(cmake): remove some flags
      
      * fix(cmake): fix cinn's gflags depends
      
      * feat(cmake): add ci scripts of cinn
      
      * feat(cmake): copy code of cinn
      
      * fix(cmake): fix cinn third_party model path
      
      * gflags dynamic dependce
      
      * fix ci build_demo
      
      * tmp update to c++17 of cinn-only test
      
      * fix cinn only with c++17
      67c69dca
  26. 26 6月, 2023 3 次提交
  27. 20 6月, 2023 1 次提交
    • X
      [XPU] avoid compile issue in non-xpu env (#54711) · e2690526
      XiaociZhang 提交于
      * [kunlun] avoid compile issue in non-xpu env
      
      also rename macro WITH_XPU_XPTI to WITH_XPTI
      
      * move get_xpti_dependency.sh to tools/xpu
      
      * move get_xpti_dependency.sh to tools/xpu
      
      * call get_xpti_dependency.sh only in need
      e2690526
  28. 19 6月, 2023 4 次提交