1. 01 3月, 2022 1 次提交
  2. 28 2月, 2022 1 次提交
  3. 25 2月, 2022 2 次提交
    • Y
      Disable dist ut cases (#39906) · 4fe465cb
      YUNSHEN XIE 提交于
      * disable some distribute test case when in CPU test env
      
      * disable some test case when in CPU test env
      
      * fix
      4fe465cb
    • C
      [Phi] Support cudnn kernel moving & move softmax kernels (#39547) · 8895379a
      Chen Weihang 提交于
      * support cudnn kernel moving
      
      * polish cmake rules
      
      * add unittest for coverage
      
      * remove orig kernel
      
      * remove softmax cudnn kernel
      
      * fix softmax test failed
      
      * fix npu func error
      
      * resolve conflict
      
      * rename gpu dnn kernels
      
      * fix name rule error
      
      * fix compile error
      
      * update fp16 namespace
      8895379a
  4. 24 2月, 2022 2 次提交
  5. 23 2月, 2022 1 次提交
  6. 22 2月, 2022 3 次提交
  7. 21 2月, 2022 1 次提交
    • C
      Update record interface using part2 (#39694) · c984cd85
      chenjian 提交于
      * fix RecordEvent interface
      
      * modify default level to 4
      
      * update interface use
      
      * add const default trace level
      
      * update record event interface using
      
      * update record event interface using
      
      * update operator.cc
      
      * update part2
      
      * update part1
      
      * fix include profiler.h header in ps server
      
      * fix include profiler.h header in ps server
      
      * fix profiler.h header
      c984cd85
  8. 20 2月, 2022 1 次提交
  9. 19 2月, 2022 1 次提交
    • A
      [Pten]Unify paddle/pten::framework::ddim into pten::ddim (#39614) · 2fe04264
      Aurelius84 提交于
      * Unify paddle/pten::framework::ddim into pten::ddim
      
      * fix paddle namespace
      
      * compile sucessfully
      
      * fix npu src file
      
      * fix conflict
      
      * fix conflict
      
      * fix tensorrt compiler error
      
      * fix conflict
      
      * fix conflict
      
      * fix tesst file conflict
      
      * fix conflict
      
      * fix mlu file conflict
      
      * fix mlu file conflict
      
      * fix cinn header file conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      2fe04264
  10. 18 2月, 2022 2 次提交
  11. 17 2月, 2022 2 次提交
    • S
      [bugfix] to concat input squash (#39593) · f29da150
      Sylwester Fraczek 提交于
      * fix and add more tests
      
      * remove unwanted changes
      
      * check only concat and elementwise
      
      * move check to a function
      
      * add todo comment
      
      * Revert "fix ptq fc attr name fuse_activation->activation_type"
      
      This reverts commit ffd023353a5e9b0fd15e81b9e9f9fe1794035017.
      f29da150
    • W
      adaptive pool2d pass fix (#39600) · c1c5c1fc
      wenbin 提交于
      * first commit
      
      * teller fix
      
      * bug fix
      
      * enable for pool2d only
      
      * fix global_pooling issue
      
      * pooling_type
      
      * fix test
      c1c5c1fc
  12. 16 2月, 2022 2 次提交
    • W
      [Paddle-Inference] support preln-ernie: add preln_emb_eltwise_layernorm_op,... · f31c2426
      Wangzheee 提交于
      [Paddle-Inference] support preln-ernie: add preln_emb_eltwise_layernorm_op, preln_skip_layernorm_op (#39570)
      
      * support preln_ernie: add preln_emb_eltwise_layernorm_op, preln_skip_layernorm_op
      
      * support preln_ernie: add preln_emb_eltwise_layernorm_op, preln_skip_layernorm_op
      f31c2426
    • Y
      [Pten]Remove reshape and elementwise_add's registry code in Fluid (#39317) · c6478270
      YuanRisheng 提交于
      * remove reshape and elementwise_add registry
      
      * delete code
      
      * fix bugs when run ci ut
      
      * remove log
      
      * fix bugs when run unit test
      
      * fix bugs when run unit test
      
      * fix bugs when run cinn
      
      * fix bugs when run ci-mac-python3
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix bugs when run kunlun
      
      * fix bugs when compile
      
      * update code according comment
      c6478270
  13. 15 2月, 2022 5 次提交
    • W
      [Paddle-Inference] support preln_ernie: add... · 2bc91cc5
      Wangzheee 提交于
      [Paddle-Inference] support preln_ernie: add preln_embedding_eltwise_layernorm_fuse_pass, preln_skip_layernorm_fuse_pass (#39508)
      
      * support preln_ernie
      
      * support preln_ernie
      2bc91cc5
    • F
      pool2d_coonvert_ut (#39545) · cf8a5573
      feng_shuai 提交于
      cf8a5573
    • L
      [Paddle-TRT] Replace GeLU plugin with TensorRT built-in layer for TensorRT 7.0. (#38399) · a3689d8c
      Leo Chen 提交于
      * Replace GeLU plugin with TRT built-in layers for approximate GeLU
      
      * Add TensorRT built-in layer for nonapproximate GeLU
      a3689d8c
    • F
      delete mish_convert_ut skip (#39432) · 8cedcd3e
      feng_shuai 提交于
      8cedcd3e
    • A
      [PTen]Migrate proto::VarType outside of Pten (#39411) · 7e7e9404
      Aurelius84 提交于
      * #1 migrate dist-related type()-> dtype()
      
      * move datatype function from pten -> fluid/framework
      
      * change type() in imperative into convert(dtype())
      
      * modify xx_tensor->type into xx_tensor->dtype
      
      * change the set_type interface and the caller
      
      * modify xx_tensor.type into xx_tensor.dtype
      
      * fix mutable_data(place, dtype())
      
      * change caller of mutable_data in pten and distributed
      
      * change the caller of mutable_data in fluid/framework
      
      * change the caller of mutable_data in imperative directory
      
      * mutable_data: inference
      
      * update the call of mutable_data
      
      * transfer MakePenScalarArray MakePtenScalar ResetHolderWithType
      
      * pass the compile. the next step is remove VarType in Pten
      
      * fix all and remove VarType from pten. success in linux. Next task is other platform
      
      * fix conflict with develop
      
      * fix compiled error
      
      * Fix reset conversion
      
      * fix conflict
      
      * fix compiled problem
      
      * fix typo
      
      * Fix << in tensor_utils.cc
      
      * fix type->dtype
      
      * fix unittest
      
      * fix tensor init constructor
      
      * fix DataTypeSize for BFloat16
      
      * fix code style
      
      * fix npu compiled error
      
      * fix npu
      
      * compile npu sucessfully
      
      * fix conflict
      
      * fix conflict
      Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
      7e7e9404
  14. 14 2月, 2022 1 次提交
    • S
      [Bug fix] prevent squashing pair u8 dequantize -> s8 quantize (#39346) · 66b5348e
      Sylwester Fraczek 提交于
      * prevent squashing pair u8 dequantize -> s8 quantize
      
      * add relu op to check for uint8
      
      * fix ptq fc attr name fuse_activation->activation_type
      
      * fix
      
      * add unit test
      
      * remove unused variable
      
      * test fix unsuccessful
      
      * fix test and logic
      
      * multiline comment
      
      * remove cout
      
      * Revert "fix ptq fc attr name fuse_activation->activation_type"
      
      This reverts commit ffd023353a5e9b0fd15e81b9e9f9fe1794035017.
      
      * fix ptq fc attr name fuse_activation->activation_type
      66b5348e
  15. 11 2月, 2022 3 次提交
  16. 10 2月, 2022 2 次提交
  17. 09 2月, 2022 1 次提交
  18. 06 2月, 2022 1 次提交
  19. 28 1月, 2022 1 次提交
  20. 27 1月, 2022 4 次提交
    • A
      [PluggableDevice] Add custom kernel support based on pten kernel management (#38848) · a8879215
      Aganlengzi 提交于
      * [Demo] custom kernel based on pten kernel
      
      * merge and npu custom work well
      
      * del comments
      
      * delete other code
      
      * fix CUDAContext
      
      * fix not found small_vector.h
      
      * support NPU
      
      * fix NPUContext
      
      * fix DeviceContext support
      
      * add UT
      
      * fix call
      
      * add UT
      
      * fix
      
      * fix for comments and ut
      
      * add MACRO control
      
      * fix multi input output
      
      * support env CUSTOM_DEVICE_ROOT
      
      * deal with special cases
      
      * fix for Windows
      
      * try coverage with test_custom_kernel_dot.py
      
      * fix test_custom_kernel_dot
      
      * fix test_custom_kernel_dot
      
      * fix merge
      
      * fix merge
      
      * fix CI
      
      * update
      
      * merge and fix
      
      * remove WITH_CUSTOM_KERNEL
      
      * fix merge
      
      * merge and fix
      
      * fix ut
      
      * fix ut for mac
      
      * add more UT
      
      * add more UT
      
      * fix
      a8879215
    • W
      fix shuffle_channel_detect_pass (#39242) · af9ddeb7
      wenbin 提交于
      * shuffle channel pass
      
      * add ut
      
      * timeout fix
      
      * makefile fix
      af9ddeb7
    • 9b79988c
    • W
      [Paddle-Inference]: fix concat slice (#39096) · f080e8d5
      Wangzheee 提交于
      * Paddle-Inference:fix_concat_slice
      
      * Paddle-Inference:fix_concat_slice
      
      * Paddle-Inference:fix_concat_slice
      
      * Paddle-Inference:fix_concat_slice
      
      * [Paddle-Inference]: fix concat slice
      
      * [Paddle-Inference]: fix concat slice
      
      * [Paddle-Inference]: fix concat slice
      f080e8d5
  21. 26 1月, 2022 2 次提交
    • L
      [pten] remove deprecated fluid op kernel for pten (#38842) · 3ab9aef1
      Leo Chen 提交于
      * update cmake file to remove fluid kernel
      
      * add pten declaration.h to where pybind.h used
      
      * fix sync_bn and tensorrt_engine
      
      * refine detection_library
      
      * fix interpreter_core
      
      * support eager legacy
      
      * fit eager legacy for pten
      
      * fall back to cpu if not found kernel
      
      * fix compile problem
      
      * fix compile problem
      
      * refine fallback logic
      
      * fit operator.run()
      
      * fix xpu compile
      
      * fit for new_exec
      
      * add REGISTER_OP_WITHOUT_GRADIENT
      
      * un-cache pt_kernel_context
      
      * fix compile
      
      * fix cudnn
      
      * fix compiling with on_infer
      
      * fix mkldnn
      
      * fix isfinite_v2
      
      * fix xpu problem
      
      * fix op_device
      
      * refine fallback for xpu
      
      * fix xpu compile
      
      * merge develop
      
      * refine code format
      
      * fix compile
      
      * fix compile
      
      * add data_transfer
      
      * fix PreparePtenData
      
      * fix cpu context
      
      * merge develop
      
      * fix compile
      
      * fix error device context
      
      * fix xpu
      
      * fix dev_ctx
      3ab9aef1
    • B
      support npu weight unified H2D copy before inference (#39160) · 106b5514
      baoachun 提交于
      * support npu weight unified H2D copy
      
      * remove redundant variable
      106b5514
  22. 25 1月, 2022 1 次提交