1. 03 11月, 2022 3 次提交
    • H
      [Opt Kernel Selection] Opt CanMKLDNNBeUsed performance (#47563) · 9adad42d
      HongyuJia 提交于
      * opt CanMKLDNNBeUsed performance
      
      * fix nullptr bug
      
      * fix OpBase default_attrs=nullptr bug
      
      * fix OpBase default_attrs=nullptr bug
      
      * fix OpBase default_attrs=nullptr bug
      9adad42d
    • S
      [PHI] Migrate softmax kernel (#47339) · b8ae3858
      Sławomir Siwek 提交于
      * add extra attr property set
      
      * add type_info for all context
      
      * add onednn context to all context
      
      * fix context compile error
      
      * simplify conv kernel args
      
      * pass runtime attr into dev_ctx
      
      * fix marco error
      
      * clear conv_grad_kernel extra args
      
      * merge conv_grad_grad into conv_grad
      
      * clear conv2d_grad_grad extra attrs
      
      * remove redundant imports
      
      * migrate softmax
      
      * clear yaml and eager extra attr
      
      * fix conv1d error
      
      * change to thread local
      
      * fix npu compile failed
      
      * try to fix windows compile failed
      
      * add conv2d onednn phi kernel
      
      * fix ci bugs (#36)
      
      * fix compile bugs (#38)
      
      * fix extra input transform bug (#39)
      
      * support dynamic created attr (#40)
      
      * reset extra info gen code
      
      * rm conv_grad_grad kernel
      
      * reimpl pass attr adapting
      
      * add int attr support
      
      * remove vector inputnames creating
      
      * merge dev
      
      * fix map at error
      
      * adjust attribute
      
      * adapt funcs to PHI
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
      b8ae3858
    • W
      bug fix (#47611) · 5160628c
      wenbin 提交于
      5160628c
  2. 02 11月, 2022 4 次提交
  3. 01 11月, 2022 6 次提交
  4. 31 10月, 2022 4 次提交
    • F
      feat: add int8 support for vit (#47330) · 2953b708
      feng_shuai 提交于
      * feat: add int8 support for vit
      
      * test:add test
      2953b708
    • Y
      [Auto Parallel] Improve the c++ dist attr (#47358) · b03b4a3c
      Yulong Ao 提交于
      * [Auto Parallel] Improve the c++ dist attr
      
      * [Auto Parallel] Modify test_program.py
      
      * [Auto Parallel] Add the missiong import
      b03b4a3c
    • K
      [ControlFlow] replace executor in run method of control flow ops with standalone_executor (#45696) · 3b219e5e
      kangguangli 提交于
      * replace executor in conditional_block_op.run with standalone_executor
      
      * add block_id as the argument of standalone executor's method run; add print for program
      
      * fix scope bug about conditional block op
      
      * fix bug: unnecessary return of fetch value
      
      * fix typo
      
      * fix: quantization will set variable persistable, and these variables must exist in global scope
      
      * add interpretercore cache for conditional block op but not activate in default
      
      * fix bug: local scope reuse for conditional block op
      
      * reset scope when conditional block op runs
      
      * fix typo
      
      * fix typo and code style
      
      * add build scope for conditional block op
      
      * add skip for transfer_layout kernel
      
      * refind code
      
      * fix reset_scope
      
      * fix reset_scope
      
      * refine code
      
      * refine code
      
      * refine code
      
      1. remove flag use in conditional_block_op
      2. pass execution_config to BuildOpFuncList instead of individual parameter
      
      * refine code
      
      * remove the use of FLAGS_control_flow_use_new_executor_cache
      
      * change FLAGS_control_flow_use_new_executor to false
      3b219e5e
    • N
      fix typos for `True` and `False` (#47477) · f5912d0c
      Nyakku Shigure 提交于
      * fix typo `Fasle`/`Flase` -> `Flase`
      
      * fix typo `Ture` -> `True`
      f5912d0c
  5. 27 10月, 2022 3 次提交
  6. 26 10月, 2022 5 次提交
  7. 25 10月, 2022 1 次提交
  8. 24 10月, 2022 3 次提交
  9. 21 10月, 2022 1 次提交
  10. 20 10月, 2022 3 次提交
  11. 19 10月, 2022 5 次提交
  12. 18 10月, 2022 2 次提交
    • W
      Merge layernorm trt fuse (#46320) · 5e9f491e
      Wang Bojun 提交于
      * first version, accuracy corrected
      
      * disable debug print
      
      * use blockReduceSum in phi
      
      * add UT
      
      * add opCompat
      
      * code style
      
      * code refine
      
      * bug fix
      
      * code refine
      
      * test fix
      
      * bugfix
      
      * codesytle fix
      
      * code style
      
      * code-style
      
      * code-style
      
      * code-style
      5e9f491e
    • S
      FC + activation fuse passes (#45183) · b7a23adb
      Sławomir Siwek 提交于
      * git
      
      * style
      
      * leave default relu in kernel
      
      * style
      
      * cleanup FCMKLDNN pattern
      
      * merge conflicts
      
      * update develop
      
      * update develop
      
      * add const
      
      * rename to oneDNN and adjust attributes
      
      * whitespace
      b7a23adb