1. 26 12月, 2022 2 次提交
  2. 23 12月, 2022 1 次提交
  3. 19 12月, 2022 2 次提交
  4. 12 12月, 2022 1 次提交
  5. 08 12月, 2022 1 次提交
  6. 30 11月, 2022 1 次提交
  7. 29 11月, 2022 1 次提交
  8. 28 11月, 2022 1 次提交
  9. 26 11月, 2022 1 次提交
  10. 25 11月, 2022 1 次提交
    • R
      Refactor stream anayzer (#48158) · 889318d8
      Ruibiao Chen 提交于
      * Move stream_anayzer to interpreter
      
      * Refactor StreamAnalyzer
      
      * Refactor RunNextInstructionList
      
      * Remove no_data_transform_index
      
      * Fix typos
      
      * Fix data_transfer OpFuncType error
      
      * Add event for depend_op
      
      * Update transfer OpFuncType for heter place
      889318d8
  11. 15 11月, 2022 1 次提交
    • S
      mkldnn directory cleanup (#47779) · 8a339d24
      Sławomir Siwek 提交于
      * cleanup unused code
      
      * unify is_int8 is_bfloat16
      
      * Simplify matmul_v2 FWD kernel
      
      * remove RunKernel methods
      
      * remove import namespace
      
      * remove headers
      
      * clean fluid/phi cross imports
      
      * remove fluid axpy_handler
      
      * delete fluid methods
      
      * activations
      
      * OneDNNMemDesc
      
      * MKLDNNFormatForSize
      
      * MatchShapeToLayout
      
      * MKLDNNMemoryFormat
      
      * MKLDNNFormat
      
      * ReorderMKLDNNHandler
      
      * to_void_cast
      
      * review suggestions
      
      * interpolate
      
      * remove fluid depedency
      8a339d24
  12. 14 11月, 2022 1 次提交
  13. 11 11月, 2022 1 次提交
    • Z
      Refine shape op lanch method for standalone executor (#47843) · 981d1a10
      zhangbo9674 提交于
      * refine shape op in new_exe
      
      * Revert "refine shape op in new_exe"
      
      This reverts commit 0e0336ddc5eede3da019b348a0bcc0ef0f3be64e.
      
      * refine shape op in new_exe
      
      * refine shape expected_kernel_type
      
      * add SelectedRows check for shape op
      
      * refine code
      981d1a10
  14. 07 11月, 2022 1 次提交
    • H
      [Restore PR] Remove hard code of PADDLE_WITH_CUDA (#47630) · 908a381d
      HongyuJia 提交于
      * move cudnn hardcode outside GetExpectedKernelType
      
      * add header file
      
      * debug
      
      * update interpreter_util with hardcode
      
      * update interpreter_util headerfile
      
      * solve activation hardcode
      
      * debug with CI
      
      * add mkldnn_op_list header file
      
      * temporarily uncomment mkldnn
      
      * temporarily uncomment mkldnn
      
      * delete sequence_softmax cudnn hardcode
      
      * add hardcode to data_transfer.cc
      
      * update data_transfer headerfile
      
      * try fix segment fault
      
      * update cudnn&miopen_helper
      
      * reset HasAttr of DygraphExctnCtx
      
      * debug, this commit should pass all CI
      
      * debug should pass CI, temporarily disable activation
      
      * debug should pass CI
      
      * fix default_attr=nullptr bug
      
      * clean debug code
      
      * Call SetDnnFallback function in the base class
      
      * activation fallback to plain kernel
      
      * fix default GetExpectedKernelType find wrong kernel
      
      * search cudnn kernel instead of fallback
      
      * fix cudnn_handle bug
      
      * remove tanh use_cudnn
      
      * restore tanh use_cudnn
      
      * debug tanh
      
      * fix tanh bug
      
      * delete activation cudnn kernel
      
      * polish code
      908a381d
  15. 03 11月, 2022 1 次提交
  16. 02 11月, 2022 2 次提交
  17. 01 11月, 2022 2 次提交
    • H
      [Kernel Selection] Remove hard code of PADDLE_WITH_CUDA (#47325) · f9134045
      HongyuJia 提交于
      * move cudnn hardcode outside GetExpectedKernelType
      
      * add header file
      
      * debug
      
      * update interpreter_util with hardcode
      
      * update interpreter_util headerfile
      
      * solve activation hardcode
      
      * debug with CI
      
      * add mkldnn_op_list header file
      
      * temporarily uncomment mkldnn
      
      * temporarily uncomment mkldnn
      
      * delete sequence_softmax cudnn hardcode
      
      * add hardcode to data_transfer.cc
      
      * update data_transfer headerfile
      
      * try fix segment fault
      
      * update cudnn&miopen_helper
      
      * reset HasAttr of DygraphExctnCtx
      
      * debug, this commit should pass all CI
      
      * debug should pass CI, temporarily disable activation
      
      * debug should pass CI
      
      * fix default_attr=nullptr bug
      
      * clean debug code
      f9134045
    • R
      Support custom stream for standalone executor (#47411) · e12b6c04
      Ruibiao Chen 提交于
      * [Auto Parallel] Improve the c++ dist attr
      
      * [Auto Parallel] Modify test_program.py
      
      * Support custom stream for standalone executor
      Co-authored-by: NYulong Ao <aoyulong@baidu.com>
      e12b6c04
  18. 31 10月, 2022 1 次提交
    • K
      [ControlFlow] replace executor in run method of control flow ops with standalone_executor (#45696) · 3b219e5e
      kangguangli 提交于
      * replace executor in conditional_block_op.run with standalone_executor
      
      * add block_id as the argument of standalone executor's method run; add print for program
      
      * fix scope bug about conditional block op
      
      * fix bug: unnecessary return of fetch value
      
      * fix typo
      
      * fix: quantization will set variable persistable, and these variables must exist in global scope
      
      * add interpretercore cache for conditional block op but not activate in default
      
      * fix bug: local scope reuse for conditional block op
      
      * reset scope when conditional block op runs
      
      * fix typo
      
      * fix typo and code style
      
      * add build scope for conditional block op
      
      * add skip for transfer_layout kernel
      
      * refind code
      
      * fix reset_scope
      
      * fix reset_scope
      
      * refine code
      
      * refine code
      
      * refine code
      
      1. remove flag use in conditional_block_op
      2. pass execution_config to BuildOpFuncList instead of individual parameter
      
      * refine code
      
      * remove the use of FLAGS_control_flow_use_new_executor_cache
      
      * change FLAGS_control_flow_use_new_executor to false
      3b219e5e
  19. 26 10月, 2022 1 次提交
  20. 19 10月, 2022 1 次提交
  21. 10 10月, 2022 1 次提交
  22. 27 9月, 2022 1 次提交
  23. 23 9月, 2022 1 次提交
  24. 02 9月, 2022 1 次提交
  25. 19 8月, 2022 1 次提交
  26. 02 8月, 2022 1 次提交
  27. 20 7月, 2022 1 次提交