1. 02 6月, 2023 6 次提交
  2. 01 6月, 2023 7 次提交
  3. 31 5月, 2023 1 次提交
  4. 30 5月, 2023 5 次提交
    • R
      update_c++17 (#53892) · 950b563b
      risemeup1 提交于
      * update_c++17
      
      * update_c++17
      
      * fix windows bug
      
      * solve cirle depend
      
      * solve cirle depend
      
      * solve cirle depend
      
      * solve cirle depend
      
      * solve cirle depend
      
      * fix windows bug
      
      * fix compiler error
      
      * fix compiler error
      
      * update eigen3
      
      * update eigen3
      
      * update eigen3
      
      * fix mac-py3 compiler error
      
      * update C++17
      
      * fix mac compiler error
      
      * fix compile error
      
      * fix coverage_compiler error
      
      * fix coverage_ci_problem
      
      * fix coverage_error
      
      * fix_kunlun200 compile error
      
      * fix kunlun200 compiler error
      
      * fix compile error
      
      * fix compiler error
      
      * fix py3 failed test
      
      * fix kunlun200 compiler error
      
      * test
      
      * fix test error
      
      * fix test error
      
      * fix test error
      
      * test
      
      * test
      
      * fix mac py3 error
      
      * fix mac py3 error
      
      * fix mac py3 error
      
      * fix test error
      
      * fix test error
      
      * fix compile error
      
      * fix compile error
      
      * fix compile error
      
      * test
      
      * test
      
      * fix compiler error
      
      * test
      
      * test
      
      * debug on ci
      
      * fix compiler error
      
      * fix compiler error
      
      * test
      
      * fix cinn compiler error
      
      * test
      
      * fix rocm cmpile error
      
      * fix cinn and kunlun compile error
      
      * update c++14
      
      * Update flags.cmake
      950b563b
    • T
      del sequence_enumerate_op (#54177) · f9065e15
      tianshuo78520a 提交于
      * del sequence_enumerate_op
      
      * del analyzer_pyramid_dnn_tester
      
      * fix
      f9065e15
    • R
      06736921
    • H
      Support static graph code generation for op strided_slice (#54098) · e531bb02
      huangjiyi 提交于
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      e531bb02
    • R
      support auto generate for activation op relu6 (#53979) · dabfbba9
      RedContritio 提交于
      * support auto generate for activation_op relu6
      
      * add generated_static_op for activation_op in CMakeLists.txt
      dabfbba9
  5. 26 5月, 2023 1 次提交
    • Y
      [PHI Decoupling]Create PHI shared lib (#53735) · da50a009
      YuanRisheng 提交于
      * create phi so
      
      * fix ci bugs
      
      * fix py3 bugs
      
      * add file
      
      * fix py3 bugs
      
      * fix windows bugs
      
      * perfect so
      
      * fix py3 bugs
      
      * delete all static target in phi
      
      * fix windows bugs
      
      * fix py3 bugs
      
      * fix ci bugs
      
      * fix windows bugs
      
      * fix bugs: gflags can't be linked by dynamic and static lib
      
      * fix bugs that can not load 3rd party
      
      * fix ci bugs
      
      * fix compile bugs
      
      * fix py3 bugs
      
      * fix conflict
      
      * fix xpu bugs
      
      * fix mac compile bugs
      
      * fix psgpu bugs
      
      * fix inference failed
      
      * deal with conflict
      
      * fix LIBRARY_PATH bug
      
      * fix windows bugs
      
      * fix onednn error
      
      * fix windows compile bugs
      
      * fix windows compile bugs
      
      * fix test_cuda_graph_static_mode_error aborted
      
      * fix windows bugs
      
      * fix mac-python3 error
      
      * fix hip compile bugs
      
      * change mode to static
      
      * change to static mode
      
      * fix ci bugs
      
      * fix py3 bugs
      
      * fix windows bugs
      
      * fix bugs
      
      * add static flag
      
      * add PADDLE_API
      
      * change position of PADDLE_API
      
      * fix windows bugs
      
      * change mode to dynamic lib
      
      * fix windows static bugs
      
      * deal with conflict
      
      * fix windows unit bug
      
      * fix coverage
      
      * deal with conflict
      
      * fix windows-inference
      
      * fix py3 bugs
      
      * fix bugs when compile type_info
      
      * fix compile bugs
      
      * fix py3 bugs
      
      * fix windows bugs
      
      * fix windows openblas
      
      * fix xpu bugs
      
      * fix enforce_test in windows
      
      * update code according comment
      
      * fix windows cmake bug
      
      * fix windows bugs
      
      * fix windows bugs
      
      * delete cinn unittest
      
      * fix cinn bugs
      
      ---------
      Co-authored-by: HappyHeavyRain's avatarlzydev <1528794076@qq.com>
      da50a009
  6. 25 5月, 2023 2 次提交
  7. 24 5月, 2023 1 次提交
  8. 23 5月, 2023 11 次提交
  9. 22 5月, 2023 3 次提交
  10. 19 5月, 2023 3 次提交
    • warrentdrew's avatar
      add minimum grad composite rules (#52561) · 97690816
      warrentdrew 提交于
      * add minimum grad composite rules
      
      * add public python api
      
      * fix format
      
      * fix format
      
      * update testcase
      
      * fix testcase
      
      * fix format
      
      * fix cmakelist.txt
      
      * fix format
      
      * fix param problem
      
      * fix op and composite rule
      
      * fix bf16 cpu support problem
      
      * fix bf16 cpu issue
      
      * fix axis error log
      
      * add axis for maximum
      
      * revert commit
      
      * remove .orig
      
      * fix generic problem
      
      * revert max op
      
      * fix axis error
      
      * fix maximum axis
      
      * fix test_check_output
      
      * fix cinn
      
      * fix minimum maximum axis check
      97690816
    • L
      Add flash attention to speedup fused_gate_attention. (#52731) · d29c1f8e
      limingshu 提交于
      * Reorganize the forward codes of flash-attention.
      
      * Fix forward.
      
      * Remove some noused codes.
      
      * Simplify codes and fix backward.
      
      * Change all LOG(INFO) to VLOG and fix the backward.
      
      * add scale for AF2 flash_attn, much thanks to xreki and shaojie for debug these codes
      
      * decrease the effect of debug print on performance
      
      * Unify the initialize of flashattn arguments.
      
      * Rewirte the reshape of temp_mask and temp_bias.
      
      * API support use_flash_attn.
      
      * Fix compiling error on CI.
      
      * Try to crop the flash-attention lib.
      
      * Correct the condition of whether can use flash-attn.
      
      * Remove the softmax_out argument.
      
      * Remove is_causal.
      
      * Polish codes.
      
      * Fix qkv_transpose_out's shape and scaling of Q * K.
      
      * Update commit of flash-attention.
      
      ---------
      Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
      d29c1f8e
    • G
      test,test=develop (#53811) · 10758725
      Galaxy1458 提交于
      10758725