1. 22 6月, 2022 3 次提交
  2. 21 6月, 2022 4 次提交
  3. 20 6月, 2022 1 次提交
  4. 17 6月, 2022 2 次提交
  5. 15 6月, 2022 1 次提交
  6. 14 6月, 2022 1 次提交
    • X
      [ CherryPick ] Cherry pick for einsum optimization. (#43468) · 22e75d92
      xiongkun 提交于
      * [EinsumOp] Polish forward logic and backward logic for optimize (#42603)
      
      * change logic for optimize
      
      * modifty
      
      * merge
      
      * change einsum_v2 as default and add new flags: FLAG_einsum_opt=1|0 (#43010)
      
      * [EinsumOp] Make EinsumOp support bfloat16. (#43085)
      
      * change einsum_v2 as default and add new flags: FLAG_einsum_opt=1|0
      
      * make EInsumOP support bf16
      
      * add unittest for BF16
      
      * add condition for test_BF16
      
      * fix bugs
      
      * fix
      
      * change the backward api to fit einsum op
      22e75d92
  7. 09 6月, 2022 1 次提交
  8. 08 6月, 2022 3 次提交
  9. 07 6月, 2022 1 次提交
  10. 06 6月, 2022 1 次提交
    • N
      cherry-pick 42645 (#43205) · 835a1888
      niuliling123 提交于
      删除Broadcast function中rank例化以及Elementwise调用,降低编译时间。
      从develop分支中的#42645 PR修改而来,由于develop分支与release分支相差较大,无法实现cherry-pick,因此针对release2.3重新提交PR.
      Broadcast中关于rank的例化会导致底层模板展开较多,造成reduce_sum_grad_kernel.cu.o文件体积过大,修改后可以降低.o体积及编译时间
      835a1888
  11. 31 5月, 2022 1 次提交
    • T
      Del check size (#43113) · 40a7e0ad
      tianshuo78520a 提交于
      删除判断build目录大小和预测库大小检查功能。该功能是和develop比较,会存在差异,在release任务中取消判断
      40a7e0ad
  12. 30 5月, 2022 2 次提交
  13. 27 5月, 2022 2 次提交
  14. 26 5月, 2022 1 次提交
  15. 23 5月, 2022 1 次提交
  16. 17 5月, 2022 1 次提交
  17. 11 5月, 2022 1 次提交
  18. 10 5月, 2022 4 次提交
  19. 09 5月, 2022 1 次提交
  20. 07 5月, 2022 2 次提交
  21. 06 5月, 2022 1 次提交
  22. 05 5月, 2022 2 次提交
  23. 04 5月, 2022 3 次提交
    • S
      graph partition (#42472) · a3917625
      seemingwang 提交于
      * enable graph-engine to return all id (#42319)
      
      * enable graph-engine to return all id
      
      * change vector's dimension
      
      * change vector's dimension
      
      * enlarge returned ids dimensions
      
      * change sample result's structure to fit training (#42426)
      
      * enable graph-engine to return all id
      
      * change vector's dimension
      
      * change vector's dimension
      
      * enlarge returned ids dimensions
      
      * add actual_val
      
      * change vlog
      
      * fix bug
      
      * bug fix
      
      * bug fix
      
      * fix display test
      
      * singleton of gpu_graph_wrapper
      
      * change sample result's structure to fit training
      
      * recover sample code
      
      * fix
      
      * secondary sample
      
      * add graph partition
      
      * fix pybind
      Co-authored-by: NDesmonDay <908660116@qq.com>
      Co-authored-by: NDesmonDay <908660116@qq.com>
      a3917625
    • X
      [cherry-pick 2.3] fix bug of batch_norm_grad kernel with fp16 (#42461) · a5745864
      XiaoguangHu 提交于
      * fix bug of batch_norm_grad kernel with fp16
      
      * format code
      a5745864
    • H
      fix paddle-ort python bug (#42464) (#42470) · 87e6149c
      heliqi 提交于
      * fix paddle-ort python bug
      
      * fix paddle-ort python bug
      87e6149c