1. 07 11月, 2022 5 次提交
  2. 04 11月, 2022 7 次提交
  3. 03 11月, 2022 3 次提交
    • P
      Fix oneDNN elementwise_sub dnnl_error in unit test (#47237) · 30c7758f
      Piotr Paturej 提交于
      * Fix dnnl errors in elementwise_sub tests
      
      * Fix model accuracy attempt
      
      * Add new fix
      
      * Add proper fix
      
      * Refactor by removing code repetition
      30c7758f
    • R
      Improve performance of coalesce_tensor and depend op in standalone executor (#47606) · 5fb1e824
      Ruibiao Chen 提交于
      * Dispath computation OPs before communication in standalone executor
      
      * Update code
      
      * Fix CI errors
      
      * Improve performance of coalesce_tensor and depend OP in standalone executor
      
      * pre-commit check
      5fb1e824
    • S
      [PHI] Migrate softmax kernel (#47339) · b8ae3858
      Sławomir Siwek 提交于
      * add extra attr property set
      
      * add type_info for all context
      
      * add onednn context to all context
      
      * fix context compile error
      
      * simplify conv kernel args
      
      * pass runtime attr into dev_ctx
      
      * fix marco error
      
      * clear conv_grad_kernel extra args
      
      * merge conv_grad_grad into conv_grad
      
      * clear conv2d_grad_grad extra attrs
      
      * remove redundant imports
      
      * migrate softmax
      
      * clear yaml and eager extra attr
      
      * fix conv1d error
      
      * change to thread local
      
      * fix npu compile failed
      
      * try to fix windows compile failed
      
      * add conv2d onednn phi kernel
      
      * fix ci bugs (#36)
      
      * fix compile bugs (#38)
      
      * fix extra input transform bug (#39)
      
      * support dynamic created attr (#40)
      
      * reset extra info gen code
      
      * rm conv_grad_grad kernel
      
      * reimpl pass attr adapting
      
      * add int attr support
      
      * remove vector inputnames creating
      
      * merge dev
      
      * fix map at error
      
      * adjust attribute
      
      * adapt funcs to PHI
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
      b8ae3858
  4. 02 11月, 2022 4 次提交
  5. 01 11月, 2022 9 次提交
    • L
      fix dynamic link of xpu library (#47434) · 9d801855
      Leo Chen 提交于
      * refine comments,test=kunlun
      
      * link xpu lib, test=kunlun
      
      * add sleep for test, test=kunlun
      
      * merge develop, fix compile, test=kunlun
      
      * remove debug code, test=kunlun
      
      * add dependency to avoid potential concurrency error, test=kunlun
      9d801855
    • H
      [Kernel Selection] Remove hard code of PADDLE_WITH_CUDA (#47325) · f9134045
      HongyuJia 提交于
      * move cudnn hardcode outside GetExpectedKernelType
      
      * add header file
      
      * debug
      
      * update interpreter_util with hardcode
      
      * update interpreter_util headerfile
      
      * solve activation hardcode
      
      * debug with CI
      
      * add mkldnn_op_list header file
      
      * temporarily uncomment mkldnn
      
      * temporarily uncomment mkldnn
      
      * delete sequence_softmax cudnn hardcode
      
      * add hardcode to data_transfer.cc
      
      * update data_transfer headerfile
      
      * try fix segment fault
      
      * update cudnn&miopen_helper
      
      * reset HasAttr of DygraphExctnCtx
      
      * debug, this commit should pass all CI
      
      * debug should pass CI, temporarily disable activation
      
      * debug should pass CI
      
      * fix default_attr=nullptr bug
      
      * clean debug code
      f9134045
    • H
      clean mkldnn headerfile (#47507) · a341bb8c
      HongyuJia 提交于
      a341bb8c
    • L
      Fix bugs in tranpose kernel (#47212) · ec7fe888
      limingshu 提交于
      * first commit
      
      * transpose_kernel_optimization
      
      * first complishment of transpose op
      
      * second commit
      
      * refine code logics of tranpose_kernel
      
      * refine transpose kernel
      
      * first commit
      
      * fix DtoD copy bugs for hip
      
      * refine code according to the PR advice
      
      * change dim to int64_t type.
      
      * fix some type error
      ec7fe888
    • Y
      [PHI]Standardise some C++ API (Part2) (#47510) · 399047d7
      YuanRisheng 提交于
      * standard_api
      
      * add hardtanh
      399047d7
    • F
      fix:add no support for cuda_arch<700 (#47509) · 974f8f32
      feng_shuai 提交于
      974f8f32
    • W
      remove unused-local-typedefs warning on linux (#47513) · 96f36962
      Wang Xin 提交于
      96f36962
    • Z
      Generate static graph code for some activation ops by Yaml (part2) (#47440) · c5d99138
      zyfncg 提交于
      * gene static graph code for ceil, expm1 op
      
      * gene static graph code for some activation op
      
      * fix bug
      
      * revert doc of silu and logsigmoid
      c5d99138
    • C
      Adapting device-specific Extra Attributes for the PHI kernel (#46342) · c923e6c9
      Chen Weihang 提交于
      * add extra attr property set
      
      * add type_info for all context
      
      * add onednn context to all context
      
      * fix context compile error
      
      * simplify conv kernel args
      
      * pass runtime attr into dev_ctx
      
      * fix marco error
      
      * clear conv_grad_kernel extra args
      
      * merge conv_grad_grad into conv_grad
      
      * clear conv2d_grad_grad extra attrs
      
      * clear yaml and eager extra attr
      
      * fix conv1d error
      
      * change to thread local
      
      * fix npu compile failed
      
      * try to fix windows compile failed
      
      * add conv2d onednn phi kernel
      
      * fix ci bugs (#36)
      
      * fix compile bugs (#38)
      
      * fix extra input transform bug (#39)
      
      * support dynamic created attr (#40)
      
      * reset extra info gen code
      
      * rm conv_grad_grad kernel
      
      * reimpl pass attr adapting
      
      * add int attr support
      
      * remove vector inputnames creating
      
      * fix map at error
      
      * Update paddle/phi/kernels/onednn/conv_grad_kernel.cc
      Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
      
      * remove useless extra attrs
      
      * replace mkldnn_engine by onednn_engine
      Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
      Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
      c923e6c9
  6. 31 10月, 2022 6 次提交
    • Y
      [PHI]Standardise some C++ API (#47385) · 60e0c506
      YuanRisheng 提交于
      * standard api
      
      * fix ci bugs
      
      * fix ci bugs
      
      * fix ce bugs
      60e0c506
    • F
      optimize: vit 384 (#47432) · 520adc0e
      feng_shuai 提交于
      * optimize: vit 384
      
      * fix:bug
      
      * fix:bug
      
      * fix:supoort rocm complie
      
      * refactor:name
      
      * fix:support rocm
      
      * fix:__HIP_NO_HALF_CONVERSIONS__
      
      * optimize: delete scalar
      
      * fix:rocm can't support
      
      * fix:ernie error
      520adc0e
    • K
      [ControlFlow] replace executor in run method of control flow ops with standalone_executor (#45696) · 3b219e5e
      kangguangli 提交于
      * replace executor in conditional_block_op.run with standalone_executor
      
      * add block_id as the argument of standalone executor's method run; add print for program
      
      * fix scope bug about conditional block op
      
      * fix bug: unnecessary return of fetch value
      
      * fix typo
      
      * fix: quantization will set variable persistable, and these variables must exist in global scope
      
      * add interpretercore cache for conditional block op but not activate in default
      
      * fix bug: local scope reuse for conditional block op
      
      * reset scope when conditional block op runs
      
      * fix typo
      
      * fix typo and code style
      
      * add build scope for conditional block op
      
      * add skip for transfer_layout kernel
      
      * refind code
      
      * fix reset_scope
      
      * fix reset_scope
      
      * refine code
      
      * refine code
      
      * refine code
      
      1. remove flag use in conditional_block_op
      2. pass execution_config to BuildOpFuncList instead of individual parameter
      
      * refine code
      
      * remove the use of FLAGS_control_flow_use_new_executor_cache
      
      * change FLAGS_control_flow_use_new_executor to false
      3b219e5e
    • C
      bb6356e8
    • N
      fix typos for `True` and `False` (#47477) · f5912d0c
      Nyakku Shigure 提交于
      * fix typo `Fasle`/`Flase` -> `Flase`
      
      * fix typo `Ture` -> `True`
      f5912d0c
    • W
      remove boost compiler flags in flags.cmake (#47468) · 91096ae2
      Wang Xin 提交于
      91096ae2
  7. 28 10月, 2022 3 次提交
  8. 27 10月, 2022 3 次提交