1. 26 12月, 2022 2 次提交
  2. 23 12月, 2022 1 次提交
  3. 19 12月, 2022 2 次提交
  4. 12 12月, 2022 1 次提交
  5. 09 12月, 2022 1 次提交
  6. 08 12月, 2022 1 次提交
  7. 30 11月, 2022 1 次提交
  8. 29 11月, 2022 2 次提交
  9. 28 11月, 2022 2 次提交
  10. 26 11月, 2022 1 次提交
  11. 25 11月, 2022 2 次提交
    • C
      [PROFILER] add flops for Profiler (#47766) · 3d1981ad
      Chitsing KUI 提交于
      * attr ready
      
      * op ip ready
      
      * start dynamic
      
      * end2end ok
      
      * input shape to map, stat by op
      
      * layer wip
      
      * first version ready
      
      * fix proto depds
      
      * fix profiler deps
      
      * fix flops typo, rm tuple shape
      3d1981ad
    • R
      Refactor stream anayzer (#48158) · 889318d8
      Ruibiao Chen 提交于
      * Move stream_anayzer to interpreter
      
      * Refactor StreamAnalyzer
      
      * Refactor RunNextInstructionList
      
      * Remove no_data_transform_index
      
      * Fix typos
      
      * Fix data_transfer OpFuncType error
      
      * Add event for depend_op
      
      * Update transfer OpFuncType for heter place
      889318d8
  12. 17 11月, 2022 1 次提交
  13. 15 11月, 2022 1 次提交
    • S
      mkldnn directory cleanup (#47779) · 8a339d24
      Sławomir Siwek 提交于
      * cleanup unused code
      
      * unify is_int8 is_bfloat16
      
      * Simplify matmul_v2 FWD kernel
      
      * remove RunKernel methods
      
      * remove import namespace
      
      * remove headers
      
      * clean fluid/phi cross imports
      
      * remove fluid axpy_handler
      
      * delete fluid methods
      
      * activations
      
      * OneDNNMemDesc
      
      * MKLDNNFormatForSize
      
      * MatchShapeToLayout
      
      * MKLDNNMemoryFormat
      
      * MKLDNNFormat
      
      * ReorderMKLDNNHandler
      
      * to_void_cast
      
      * review suggestions
      
      * interpolate
      
      * remove fluid depedency
      8a339d24
  14. 14 11月, 2022 1 次提交
  15. 11 11月, 2022 1 次提交
    • Z
      Refine shape op lanch method for standalone executor (#47843) · 981d1a10
      zhangbo9674 提交于
      * refine shape op in new_exe
      
      * Revert "refine shape op in new_exe"
      
      This reverts commit 0e0336ddc5eede3da019b348a0bcc0ef0f3be64e.
      
      * refine shape op in new_exe
      
      * refine shape expected_kernel_type
      
      * add SelectedRows check for shape op
      
      * refine code
      981d1a10
  16. 07 11月, 2022 1 次提交
    • H
      [Restore PR] Remove hard code of PADDLE_WITH_CUDA (#47630) · 908a381d
      HongyuJia 提交于
      * move cudnn hardcode outside GetExpectedKernelType
      
      * add header file
      
      * debug
      
      * update interpreter_util with hardcode
      
      * update interpreter_util headerfile
      
      * solve activation hardcode
      
      * debug with CI
      
      * add mkldnn_op_list header file
      
      * temporarily uncomment mkldnn
      
      * temporarily uncomment mkldnn
      
      * delete sequence_softmax cudnn hardcode
      
      * add hardcode to data_transfer.cc
      
      * update data_transfer headerfile
      
      * try fix segment fault
      
      * update cudnn&miopen_helper
      
      * reset HasAttr of DygraphExctnCtx
      
      * debug, this commit should pass all CI
      
      * debug should pass CI, temporarily disable activation
      
      * debug should pass CI
      
      * fix default_attr=nullptr bug
      
      * clean debug code
      
      * Call SetDnnFallback function in the base class
      
      * activation fallback to plain kernel
      
      * fix default GetExpectedKernelType find wrong kernel
      
      * search cudnn kernel instead of fallback
      
      * fix cudnn_handle bug
      
      * remove tanh use_cudnn
      
      * restore tanh use_cudnn
      
      * debug tanh
      
      * fix tanh bug
      
      * delete activation cudnn kernel
      
      * polish code
      908a381d
  17. 03 11月, 2022 1 次提交
  18. 02 11月, 2022 2 次提交
  19. 01 11月, 2022 3 次提交
    • H
      [Kernel Selection] Remove hard code of PADDLE_WITH_CUDA (#47325) · f9134045
      HongyuJia 提交于
      * move cudnn hardcode outside GetExpectedKernelType
      
      * add header file
      
      * debug
      
      * update interpreter_util with hardcode
      
      * update interpreter_util headerfile
      
      * solve activation hardcode
      
      * debug with CI
      
      * add mkldnn_op_list header file
      
      * temporarily uncomment mkldnn
      
      * temporarily uncomment mkldnn
      
      * delete sequence_softmax cudnn hardcode
      
      * add hardcode to data_transfer.cc
      
      * update data_transfer headerfile
      
      * try fix segment fault
      
      * update cudnn&miopen_helper
      
      * reset HasAttr of DygraphExctnCtx
      
      * debug, this commit should pass all CI
      
      * debug should pass CI, temporarily disable activation
      
      * debug should pass CI
      
      * fix default_attr=nullptr bug
      
      * clean debug code
      f9134045
    • Y
      [PHI]Standardise some C++ API (Part2) (#47510) · 399047d7
      YuanRisheng 提交于
      * standard_api
      
      * add hardtanh
      399047d7
    • R
      Support custom stream for standalone executor (#47411) · e12b6c04
      Ruibiao Chen 提交于
      * [Auto Parallel] Improve the c++ dist attr
      
      * [Auto Parallel] Modify test_program.py
      
      * Support custom stream for standalone executor
      Co-authored-by: NYulong Ao <aoyulong@baidu.com>
      e12b6c04
  20. 31 10月, 2022 1 次提交
    • K
      [ControlFlow] replace executor in run method of control flow ops with standalone_executor (#45696) · 3b219e5e
      kangguangli 提交于
      * replace executor in conditional_block_op.run with standalone_executor
      
      * add block_id as the argument of standalone executor's method run; add print for program
      
      * fix scope bug about conditional block op
      
      * fix bug: unnecessary return of fetch value
      
      * fix typo
      
      * fix: quantization will set variable persistable, and these variables must exist in global scope
      
      * add interpretercore cache for conditional block op but not activate in default
      
      * fix bug: local scope reuse for conditional block op
      
      * reset scope when conditional block op runs
      
      * fix typo
      
      * fix typo and code style
      
      * add build scope for conditional block op
      
      * add skip for transfer_layout kernel
      
      * refind code
      
      * fix reset_scope
      
      * fix reset_scope
      
      * refine code
      
      * refine code
      
      * refine code
      
      1. remove flag use in conditional_block_op
      2. pass execution_config to BuildOpFuncList instead of individual parameter
      
      * refine code
      
      * remove the use of FLAGS_control_flow_use_new_executor_cache
      
      * change FLAGS_control_flow_use_new_executor to false
      3b219e5e
  21. 27 10月, 2022 1 次提交
    • L
      make all cpp tests dynamic linked to libpaddle.so [except windows] (#47088) · 2096448b
      Leo Chen 提交于
      * make all cpp tests dynamic linked to libpaddle.so
      
      * add comments
      
      * keep old cc_test for some tests
      
      * fix some ut
      
      * make some ut use cc_test_old
      
      * fix typos and fit for win32
      
      * fix lib path
      
      * fix some tests
      
      * skip lite test
      
      * fit for rocm
      
      * fit for cinn
      
      * fit for mac
      
      * fit for win32
      
      * skip inference ut
      
      * skip  windows
      
      * fix coverage
      2096448b
  22. 26 10月, 2022 2 次提交
  23. 19 10月, 2022 1 次提交
  24. 17 10月, 2022 1 次提交
  25. 13 10月, 2022 2 次提交
    • L
      [new-exec] remove variable scope, stage2 (#43936) · 1230a3f4
      Leo Chen 提交于
      * remove class ScopeBase
      
      * reopen test
      1230a3f4
    • H
      [Kernel Selection] Remove hard code of PADDLE_WITH_MKLDNN (#46606) · ef1c8759
      HongyuJia 提交于
      * remove PADDLE_WITH_MKLDNN, test white_list=abs
      
      * fix unique_ptr
      
      * fix op.Type()
      
      * remove TODO in kernel_dispatch.h
      
      * remove IndicateVarDataType function, update white_list
      
      * remove mkldnn hard code
      
      * add comments
      
      * fix ==
      
      * update mkldnn_op_list
      
      * delete hard code of OPs
      
      * update mkldnn_op_list
      
      * update mkldnn_op_list, remove interp
      
      * add error check for ExecutionContext
      
      * update mkldnn_op_list, remove transpose2_grad
      
      * remove interpolate mkldnn
      
      * remove fill_constant mkldnn
      
      * opt HasAttr in DygraphExecutionContext
      
      * deprecated commit, test mkldnn_white_list
      
      * deprecated commit, test mkldnn_white_list
      
      * deprecated commit, test mkldnn_black_list
      
      * update mkldnn_op_list, add assert error op
      
      * solve cudnn related op
      
      * fix error
      
      * add mkldnn fallback in phi_utils.cc
      
      * remove mkldnn fallback in phi_utils.cc
      
      * opt code implementation
      
      * polish Copyright License
      ef1c8759
  26. 12 10月, 2022 1 次提交
  27. 11 10月, 2022 1 次提交
  28. 10 10月, 2022 2 次提交
    • Y
      [PHI]Add RNN yaml (#46812) · ab60fd8b
      YuanRisheng 提交于
      * add yaml entry for rnn and rrnn_grad, move infershape function for rnn_grad to phi infer meta
      
      * WIP: move rnn kernrl to phi
      
      * Change the code generation to avoid converting from intializer list to tuple of heterogeneous types.
      This is only triggered when an api has intermediate outputs, and the result of the outputs are of heterogeneous types.
      
      * fix the bug that when none in a vector of tensors requires gradient, the conversion to InferShapeContext to InferMetaContext (a.k.a. BuildInferMetaContext) produces errorous results.
      
      * fix ci bugs
      
      * fix ci bugs
      
      * fix ci bugs
      
      * modify code according comment
      Co-authored-by: Nchenfeiyu <chenfeiyu@baidu.com>
      ab60fd8b
    • L
      reduce time cost on atomic in interpretercore (#46688) · dd3d45de
      Leo Chen 提交于
      * reduce time cost on atomic in interpretercore
      
      * clear code of PrepareAtomic in interpretercore
      
      * refine threadpool cache
      dd3d45de
  29. 09 10月, 2022 1 次提交