1. 12 7月, 2023 1 次提交
    • Y
      [ONEDNN] Upgrade oneDNN version to v3.1 (#52463) · cfa513f7
      YangQun 提交于
      * squash pick the poc code
      * fix build after rebase
      * fix int8 conv and fc uts
      * Fix and clean-up Get_SRC_Scale_Memory
      * fix floating point fc uts
      * fix test_analyzer_int8_googlenet
      * test_analyzer_int8_mobilenetv1
      * fix int8 mobilenet v2 and v3
      * fix build error after rebase
      * [oneDNN] rename library version
      * fix conv bias datatype
      * try to fix import error
      * fix rebase error
      * [oneDNN] pack library into python wheel
      * add MKLDNN_SHARED_LIB_3 to env_dict
      * fix test_analyzer_bert
      * fix fill_constant op kernel
      * fix ernie and matmul op ut
      * fix softplus ut
      * fix conv+relu6 fusion ut
      * fix hardswish fusion
      * fix quant+transpose fusion ut
      * fixsgd ut
      * fix int8 matmul with flatten
      * fix fc+scale fusion
      * fix conv/matmul+gelu fusion uts
      * fix rebase error
      * Revert "fix conv/matmul+gelu fusion uts"
      This reverts commit 47eb5e49972bd8f7271a233def9bfb3e98ce78e1.
      * upgrade to onednn v3.1
      * remove older version onednn
      * use densetensor::data() for achieving mean and var in layernorm impl
      * comments for atol of integer tests
      * fix clang-format
      * Revert "remove older version onednn"
      This reverts commit 783e57ddfd4401254596eae7d47adb9b03590c09.
      * improve binary handle
      * fix expand kernel
      * Revert "use densetensor::data() for achieving mean and var in layernorm impl"
      * always use forward_inference for conv
      * remove activation scales
      * rollback changes to mkldnn.cmake
      * address comments
      * port changes to dequantize kernel
      * fix merge error
      * fix fused_elementwise_kernel
      * upgrade onednn version to v3.1.1
      * fix some approval error
      * fix error msg format
      * remove old onednn libs
      * try to fix symbolic link issue
      * fix cinn test case segfault
      * do not explicit link test with onednn
      * remove unnecessary changes
      * integrate CINN with onednn v3
      * link with mkldnn project
      * fix cinn build file
      
      ---------
      Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
      Co-authored-by: NChen, Xinyu1 <xinyu1.chen@intel.com>
      Co-authored-by: Ntianshuo78520a <707759223@qq.com>
      cfa513f7
  2. 11 7月, 2023 1 次提交
    • L
      replace the AdagradOptimizer... · 94365855
      LoneRanger 提交于
      replace the AdagradOptimizer 、adamaxOptimizer、AdadeltaOptimizer、RMSPropOptimizer、LambOptimizer and Momentum (#54152)
      
      * replace the AdadeltaOptimizer with Adadelta
      
      * replace the RMSPropOptimizer with RMSProp
      
      * replace the LambOptimizer with lamb
      
      * replace the momentum in contrib/optimizer.py with Momentum in python/paddle/optimizer/momentum.py
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug of Lamp
      
      * fix bug of Lamp
      
      * fix bug of import
      
      * replace the AdamaxOptimizer with Admax and change the optimizer base for AdagradOptimizer
      
      * fix bug
      
      * fix bug
      
      * Update optimizer.py
      
      * fix bug
      
      * fix bug
      94365855
  3. 07 7月, 2023 1 次提交
  4. 27 6月, 2023 1 次提交
    • 6
      Code merge | Merge CINN into Paddle (#54749) · 67c69dca
      6clc 提交于
      * feat(cmake): add cmake of cinn
      
      * feat(cmake): add cmake of cinn python test
      
      * feat(cmake): add jit
      
      * feat(cmake): test/CMakeList.txt
      
      * feat(cmake): rebase to develop
      
      * feat(cmake): remove some flags
      
      * fix(cmake): fix cinn's gflags depends
      
      * feat(cmake): add ci scripts of cinn
      
      * feat(cmake): copy code of cinn
      
      * fix(cmake): fix cinn third_party model path
      
      * gflags dynamic dependce
      
      * fix ci build_demo
      
      * tmp update to c++17 of cinn-only test
      
      * fix cinn only with c++17
      67c69dca
  5. 26 6月, 2023 1 次提交
  6. 20 6月, 2023 1 次提交
    • Z
      [IR] Change IR from Static library to dynamic library (#54729) · 24a3cb52
      zhangbo9674 提交于
      * new_ir to shared
      
      * refine code
      
      * add ir lib path to env
      
      * refine type
      
      * refine code
      
      * fix bug
      
      * fix bug
      
      * refine code
      
      * refine code
      
      * close win
      
      * refine code
      
      * refine code
      
      * refine code
      
      * add win share
      
      * refine code
      
      * refie code
      
      * refine code
      
      * refine code
      
      * refien code
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * solve conflict
      
      * solve conflict
      
      * fix bug
      
      * refine code
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * refine code
      
      * fix interpretercore program bug
      
      * delete unuse code
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix cinn bug
      
      * fix cinn bug
      
      * debug
      
      * fix cinn bug
      
      * delete unused code
      
      * fix cinn bug
      
      * fix cinn bug
      
      * fix  ug
      
      * test win openblas
      
      * test win openblas
      
      * fix win openblas bug
      
      * polish code
      
      * fix win open blas bug
      
      * close win dll
      
      * fix flag bug
      
      * test for windows
      
      * fix compile bug
      24a3cb52
  7. 12 6月, 2023 1 次提交
    • Y
      Add PHI option in cmake (#54462) · eca64f0f
      YuanRisheng 提交于
      * add phi option in cmake
      
      * modify default option
      
      * fix windows bugs
      
      * perfect log
      
      * change default mode
      
      * change phi so on for py3
      eca64f0f
  8. 30 5月, 2023 1 次提交
  9. 26 5月, 2023 1 次提交
    • Y
      [PHI Decoupling]Create PHI shared lib (#53735) · da50a009
      YuanRisheng 提交于
      * create phi so
      
      * fix ci bugs
      
      * fix py3 bugs
      
      * add file
      
      * fix py3 bugs
      
      * fix windows bugs
      
      * perfect so
      
      * fix py3 bugs
      
      * delete all static target in phi
      
      * fix windows bugs
      
      * fix py3 bugs
      
      * fix ci bugs
      
      * fix windows bugs
      
      * fix bugs: gflags can't be linked by dynamic and static lib
      
      * fix bugs that can not load 3rd party
      
      * fix ci bugs
      
      * fix compile bugs
      
      * fix py3 bugs
      
      * fix conflict
      
      * fix xpu bugs
      
      * fix mac compile bugs
      
      * fix psgpu bugs
      
      * fix inference failed
      
      * deal with conflict
      
      * fix LIBRARY_PATH bug
      
      * fix windows bugs
      
      * fix onednn error
      
      * fix windows compile bugs
      
      * fix windows compile bugs
      
      * fix test_cuda_graph_static_mode_error aborted
      
      * fix windows bugs
      
      * fix mac-python3 error
      
      * fix hip compile bugs
      
      * change mode to static
      
      * change to static mode
      
      * fix ci bugs
      
      * fix py3 bugs
      
      * fix windows bugs
      
      * fix bugs
      
      * add static flag
      
      * add PADDLE_API
      
      * change position of PADDLE_API
      
      * fix windows bugs
      
      * change mode to dynamic lib
      
      * fix windows static bugs
      
      * deal with conflict
      
      * fix windows unit bug
      
      * fix coverage
      
      * deal with conflict
      
      * fix windows-inference
      
      * fix py3 bugs
      
      * fix bugs when compile type_info
      
      * fix compile bugs
      
      * fix py3 bugs
      
      * fix windows bugs
      
      * fix windows openblas
      
      * fix xpu bugs
      
      * fix enforce_test in windows
      
      * update code according comment
      
      * fix windows cmake bug
      
      * fix windows bugs
      
      * fix windows bugs
      
      * delete cinn unittest
      
      * fix cinn bugs
      
      ---------
      Co-authored-by: HappyHeavyRain's avatarlzydev <1528794076@qq.com>
      da50a009
  10. 11 5月, 2023 2 次提交
  11. 26 4月, 2023 1 次提交
  12. 23 4月, 2023 1 次提交
    • L
      relocate paddle/fluid/contrib/layers (#52820) · 802be98b
      LoneRanger 提交于
      * relocate metri_op.py
      
      * reloacte nn.py
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix variable->tensor and fix __all__
      
      * fix ctr_metric_bundle and sparse_embedding
      
      * fix bug of function init
      
      * fix bug of importing sparse_embedding and ctr_metric_bundle
      
      * fix bug
      
      * Update __init__.py
      802be98b
  13. 17 4月, 2023 1 次提交
  14. 12 4月, 2023 1 次提交
  15. 23 3月, 2023 1 次提交
  16. 21 3月, 2023 1 次提交
  17. 20 3月, 2023 1 次提交
  18. 09 3月, 2023 1 次提交
  19. 07 3月, 2023 1 次提交
  20. 06 3月, 2023 1 次提交
  21. 01 3月, 2023 2 次提交
    • C
      Integration flash attention (#49869) · 61611786
      Chitsing KUI 提交于
      * flash attn
      
      * seed
      
      * almost
      
      * softmax
      
      * fix workspace
      
      * add unitest; linux only
      
      * fix setup
      
      * fix datatype include
      
      * fix setup typo
      
      * fix def scope
      
      * new error api
      
      * use paddle fork
      
      * fix attr bug; complete ut
      
      * update flash hash
      
      * fix rng reset
      
      * fix offset
      
      * fix comments
      61611786
    • W
      [Fluidclean]move fluid.transpiler to distributed.transpiler (#51025) · 51aa2129
      wangxiaoning 提交于
      * remove transpiler
      
      * Revert "remove transpiler"
      
      This reverts commit 46044ccd52011d45d7026786d331f264a6a8f645.
      
      * Revert "Revert "remove transpiler""
      
      This reverts commit 80ad0945401b5b5efebac4baee0ec50a793d4405.
      
      * codestyle
      
      * fix setup
      
      * fix
      
      * fix
      51aa2129
  22. 27 2月, 2023 2 次提交
  23. 22 2月, 2023 1 次提交
  24. 21 2月, 2023 1 次提交
  25. 20 2月, 2023 3 次提交
    • W
      [RM FLUID] trainer_pass&heter_trainer_pass (#50610) · 43622a20
      wangzhen38 提交于
      * [RM FLUID] trainer_pass&heter_trainer_pass
      * [RM FLUID] rm distributed_strategy
      43622a20
    • H
      [Cpp Extension] Support Cpp Extension (#49893) · 2a0563fa
      HongyuJia 提交于
      * update include directory
      
      * fully support C++ extension, pass unittest
      
      * fix include directory
      
      * support both extension and operator in one file
      
      * polish testcase
      
      * add jit unittest
      
      * update third_party.cmake, pass CI test
      
      * fix cmake
      
      * fix setup
      
      * fix inference, fix unittest precision
      
      * fix unittest precision
      
      * fix inference_lib cmake
      
      * try fix setup, try fix inference_lib
      
      * try fix inference_lib pybind
      
      * fix mix_op_extension, fix inference_lib
      
      * fix mix_op_extension, fix inference_lib
      
      * change cmake
      
      * change cmake
      
      * add compile flags
      
      * add Python.h headerfile
      
      * add test_custom_plugin_creater cmake
      
      * comment compile flag
      
      * pass all CI
      
      * pass all CI
      
      * comment compile flag
      
      * try solve test_custom_plugin_creater link error
      
      * try solve test_custom_plugin_creater link error
      
      * polish codes
      
      * remove windows compile flag
      
      * remove python_include_path
      
      * update pybind11, 2.4.3->2.6.0
      
      * update pybind11, 2.6.0->2.10.0
      
      * update pybind11, 2.10.0->2.6.0b1
      
      * update pybind11, 2.6.0b1->2.6.0, start fix unittest
      
      * fix pybind11 2.6.0 VarBase print error
      
      * fix pybind11 2.6.0 VarBase print error
      
      * handle PADDLE_ON_INFERENCE
      
      * modify according to reviewer
      
      * fix cmake
      
      * cmake decouple pybind_util when not ON_INFER
      
      * cmake decouple pybind_util when not ON_INFER
      
      * remove copy of inference_lib.cmake
      
      * change pybind.cc headerfile fluid->phi
      2a0563fa
    • W
      [Fluid clean]clean fluid.transpiler.details (#50564) · 167e6488
      wangxiaoning 提交于
      * move fluid.transpiler.details
      
      * fix setup
      
      * fix
      
      * fix setup
      
      * add setup
      167e6488
  26. 17 2月, 2023 1 次提交
  27. 16 2月, 2023 1 次提交
    • W
      Add Post-Training Quantization and export function in dygraph mode (#50107) · b7030257
      whs 提交于
      Add PTQ and exporting function
      1. Add Post-Training Quantization
      1.1 Abstract some functions from QAT to Quantization class
      1.2 Add Post-Training Quantization by extending Quantization class
      1.3 Add observers for PTQ
      1.4 Add unittest for PTQ
      2. Add exporting function for QAT and PTQ
      b7030257
  28. 15 2月, 2023 2 次提交
  29. 29 1月, 2023 1 次提交
  30. 12 1月, 2023 1 次提交
  31. 11 1月, 2023 1 次提交
    • W
      Add API for quantization-aware training in dygraph mode (#49398) · b53888e7
      whs 提交于
      * Add tools for quantization-aware training
      1. Expose an API named paddle.quantization.QAT
      2. Define a wrapper class to insert quanters into model for QAT
      3. Add some functions in QuantConfig for QAT
      4. Add unittest for QAT
      
      * Add QuantedConv2D and QuantedLinear for QAT
      
      * Add paddle.nn.quant.qat to setup.py
      b53888e7
  32. 05 1月, 2023 2 次提交
  33. 04 1月, 2023 1 次提交