1. 10 11月, 2022 4 次提交
  2. 09 11月, 2022 2 次提交
    • J
      Final changes to introduce mem_desc to be hold in Tensor (#46768) · 14f261ad
      Jacek Czaja 提交于
      * first commit
      
      - more fixes
      
      - compilation fix
      
      - compilation fix
      
      - fix
      
      - another fix
      
      - yet another fix
      
      - Fix
      
      - fix to fused ops
      
      - compilation fix
      
      - compilation fix
      
      - another compilation fix
      
      - another fix
      
      - fix
      
      - fix
      
      - fix
      
      - fix
      
      - yet another fix
      
      - fix
      
      - fix
      
      - cosmetic fix
      
      :- lint
      
      - Revert some changes (to be brought back later)
      
      - fix to build
      
      - Added prototype of slice
      
      - fix
      
      compilation fix
      
      - compilation fix
      
      - fix
      
      - fix
      
      - Fix
      
      - fix
      
       fix
      	modified:   cmake/flags.cmake
      
      * lint
      
      * rerun of CI
      
      * - Fix
      
      * - lint
      
      * - lint2
      14f261ad
    • C
      [PHI decoupling] Move fluid op generator into fluid (#47714) · f369b2b1
      Chen Weihang 提交于
      * move fluid op generator into fluid
      
      * remove parsed op
      
      * resolve sig undef error
      
      * append python interp find logic
      
      * remove dup code
      f369b2b1
  3. 08 11月, 2022 3 次提交
  4. 07 11月, 2022 3 次提交
    • H
      suqeeze2 + transpose2 fuse onednn (#47592) · fa874a46
      Hui Zhang 提交于
      * suqeeze2 transpose2 fuse onednn
      
      * format
      
      * fix output shape
      
      * fix conflict
      
      * format
      
      * format
      
      * remove useless
      
      * remove log
      
      * simply pass
      
      * fix comment
      
      * fix
      
      * fix msg
      
      * fix error msg
      
      * format
      fa874a46
    • H
      [Restore PR] Remove hard code of PADDLE_WITH_CUDA (#47630) · 908a381d
      HongyuJia 提交于
      * move cudnn hardcode outside GetExpectedKernelType
      
      * add header file
      
      * debug
      
      * update interpreter_util with hardcode
      
      * update interpreter_util headerfile
      
      * solve activation hardcode
      
      * debug with CI
      
      * add mkldnn_op_list header file
      
      * temporarily uncomment mkldnn
      
      * temporarily uncomment mkldnn
      
      * delete sequence_softmax cudnn hardcode
      
      * add hardcode to data_transfer.cc
      
      * update data_transfer headerfile
      
      * try fix segment fault
      
      * update cudnn&miopen_helper
      
      * reset HasAttr of DygraphExctnCtx
      
      * debug, this commit should pass all CI
      
      * debug should pass CI, temporarily disable activation
      
      * debug should pass CI
      
      * fix default_attr=nullptr bug
      
      * clean debug code
      
      * Call SetDnnFallback function in the base class
      
      * activation fallback to plain kernel
      
      * fix default GetExpectedKernelType find wrong kernel
      
      * search cudnn kernel instead of fallback
      
      * fix cudnn_handle bug
      
      * remove tanh use_cudnn
      
      * restore tanh use_cudnn
      
      * debug tanh
      
      * fix tanh bug
      
      * delete activation cudnn kernel
      
      * polish code
      908a381d
    • S
      [PHI] Migrate batch_norm (#47652) · 2337e609
      Sławomir Siwek 提交于
      * init changes
      
      * bnorm
      
      * method signature
      
      * change order
      
      * bnorm
      
      * removed unused args
      2337e609
  5. 05 11月, 2022 1 次提交
  6. 04 11月, 2022 2 次提交
  7. 03 11月, 2022 6 次提交
  8. 02 11月, 2022 4 次提交
  9. 01 11月, 2022 6 次提交
  10. 31 10月, 2022 4 次提交
    • F
      feat: add int8 support for vit (#47330) · 2953b708
      feng_shuai 提交于
      * feat: add int8 support for vit
      
      * test:add test
      2953b708
    • Y
      [Auto Parallel] Improve the c++ dist attr (#47358) · b03b4a3c
      Yulong Ao 提交于
      * [Auto Parallel] Improve the c++ dist attr
      
      * [Auto Parallel] Modify test_program.py
      
      * [Auto Parallel] Add the missiong import
      b03b4a3c
    • K
      [ControlFlow] replace executor in run method of control flow ops with standalone_executor (#45696) · 3b219e5e
      kangguangli 提交于
      * replace executor in conditional_block_op.run with standalone_executor
      
      * add block_id as the argument of standalone executor's method run; add print for program
      
      * fix scope bug about conditional block op
      
      * fix bug: unnecessary return of fetch value
      
      * fix typo
      
      * fix: quantization will set variable persistable, and these variables must exist in global scope
      
      * add interpretercore cache for conditional block op but not activate in default
      
      * fix bug: local scope reuse for conditional block op
      
      * reset scope when conditional block op runs
      
      * fix typo
      
      * fix typo and code style
      
      * add build scope for conditional block op
      
      * add skip for transfer_layout kernel
      
      * refind code
      
      * fix reset_scope
      
      * fix reset_scope
      
      * refine code
      
      * refine code
      
      * refine code
      
      1. remove flag use in conditional_block_op
      2. pass execution_config to BuildOpFuncList instead of individual parameter
      
      * refine code
      
      * remove the use of FLAGS_control_flow_use_new_executor_cache
      
      * change FLAGS_control_flow_use_new_executor to false
      3b219e5e
    • N
      fix typos for `True` and `False` (#47477) · f5912d0c
      Nyakku Shigure 提交于
      * fix typo `Fasle`/`Flase` -> `Flase`
      
      * fix typo `Ture` -> `True`
      f5912d0c
  11. 27 10月, 2022 3 次提交
  12. 26 10月, 2022 2 次提交