1. 10 1月, 2023 1 次提交
  2. 09 1月, 2023 5 次提交
    • W
      [gpubox fix] benchmark2 (#49649) · 2642caaa
      wangzhen38 提交于
      2642caaa
    • W
      Preln groupnorm (#49463) · 591be3bd
      wenbin 提交于
      * skip_groupnorm
      
      * init
      
      * preln
      
      * add ut
      
      * more assert
      
      * set timeout
      
      * fix windows ci issue
      591be3bd
    • H
      Rewrite batch norm act fuse pass tester (#49277) · aaa25222
      Hulek 提交于
      * Rewritten
      
      * change mkldnn to onednn
      
      * fix cmake name
      aaa25222
    • G
      Unify the pass of the map class (#49568) · ee49994f
      gem5 提交于
      ee49994f
    • J
      Prim paddle Basic (#49272) · 2f601282
      Jiabin Yang 提交于
      * proto type of composite grad in paddle
      
      * proto type of composite grad in paddle
      
      * refactor composite api with phi
      
      * fix compile error
      
      * support static graph code-gen for squeeze op
      
      * generate static graph code of unsqueeze
      
      * refine op name
      
      * fix compile error
      
      * add extra output in op_compat
      
      * remove debug log
      
      * fix clang compile error
      
      * support prim switch flag
      
      * support prim switch flag
      
      * fix dygraph error
      
      * merge develop
      
      * add code_gen
      
      * add necessary files without codegen
      
      * fix code_gen bug
      
      * add deps
      
      * modify igmnore
      
      * add ignore
      
      * delete std cout
      
      * add composite logic for backward.py
      
      * add tanh first order grad composite
      
      * support enable_prim flag for static graph
      
      * throw expection when both GrapOpMaker and GradCompOpMaker not been registered
      
      * reorganize the directory of prim api tests
      
      * fix windows error
      
      * add eager_utils
      
      * add eager_utils
      
      * modify code gen
      
      * add composite parse
      
      * add unittest for get_grad_op_desc
      
      * code optimize
      
      * fix static test on windows
      
      * support generate static graph code for imag and real op
      
      * fix windows compile error in test_static_prim
      
      * merge develop
      
      * disable test eager in inference
      
      * prim code gen
      
      * disable eager compile in inference
      
      * rm other file
      
      * rm gitignore file
      
      * code_style
      
      * add eager test
      
      * code_style
      
      * merge develop
      
      * remove useless files
      
      * modify static test
      
      * support bool flag from singlton
      
      * merge develop
      
      * recover git ignore
      
      * fix conflict
      
      * recover git ignore for generated op
      
      * fix test compile error
      
      * remove some tests
      
      * add python test
      
      * fix some name issue
      
      * add composite code gen
      
      * modify backward yaml
      
      * fix static composite grad maker code gen
      
      * remove addtional files
      
      * add some static funcs unit test
      
      * fix some bugs
      
      * fix composite grad maker register code gen
      
      * optimize some functions
      Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com>
      Co-authored-by: Nwangruting <wangruting@baidu.com>
      Co-authored-by: Ncxxly <chenxx_id@163.com>
      Co-authored-by: Ncharles-hit <wanghao107@baidu.com>
      Co-authored-by: Nxiaoguoguo626807 <100397923+xiaoguoguo626807@users.noreply.github.com>
      2f601282
  3. 06 1月, 2023 2 次提交
  4. 05 1月, 2023 5 次提交
  5. 04 1月, 2023 5 次提交
  6. 03 1月, 2023 4 次提交
  7. 02 1月, 2023 1 次提交
  8. 30 12月, 2022 4 次提交
  9. 29 12月, 2022 1 次提交
  10. 28 12月, 2022 1 次提交
    • L
      [new-exec] Ahead-Of-Time choosing kernel (#48789) · 63d2d722
      Leo Chen 提交于
      * add skip run
      
      * alloc minimum memory
      
      * skip check_size in Alloc
      
      * skip check_size in Alloc
      
      * skip check_size in Alloc
      
      * fix cases when tensor is initialized or empty
      
      * alloc empty output for place info
      
      * add test
      
      * increase timeout
      
      * format code
      
      * skip cpu
      
      * add cudnn_deterministic
      
      * fit for hostAlloc
      
      * follow comments
      
      * change check_size to fake_alloc
      63d2d722
  11. 27 12月, 2022 2 次提交
  12. 26 12月, 2022 3 次提交
  13. 23 12月, 2022 5 次提交
  14. 21 12月, 2022 1 次提交