1. 17 2月, 2022 3 次提交
  2. 16 2月, 2022 17 次提交
  3. 15 2月, 2022 15 次提交
    • J
      disabled unnecessary int reorders profiling (#39498) · 3581c075
      jakpiase 提交于
      3581c075
    • W
      [Paddle-Inference] support preln_ernie: add... · 2bc91cc5
      Wangzheee 提交于
      [Paddle-Inference] support preln_ernie: add preln_embedding_eltwise_layernorm_fuse_pass, preln_skip_layernorm_fuse_pass (#39508)
      
      * support preln_ernie
      
      * support preln_ernie
      2bc91cc5
    • R
      [PluggableDevice] Add custom runtime support (#38740) · 3e7825f3
      ronnywang 提交于
      * [CustomRuntime] Add DeviceManager
      
      * [CustomRuntime] Add DeviceInterface
      
      * [CustomRuntime] Add Stream, Event, DeviceGuard, CallbackManager
      
      * [CustomRuntime] Add plug-in device
      
      * [CustomRuntime] Memory module support PluggableDevice
      
      * [CustomRuntime] Add WITH_PLUGGABLE_DEVICE cmake option
      
      * update
      
      * [API] update API doc based on comments, test=develop
      Co-authored-by: Nqili93 <qili93@qq.com>
      3e7825f3
    • F
      [Pten] move paddle/operators/math/functors.h and compound_functors.h (#39514) · 0d46a108
      Feiyu Chan 提交于
      * move paddle/operators/math/functors.h
      * move paddle/operators/math/compound_functors.h
      0d46a108
    • T
      Add cinn_instruction_run_op for launching execution of a cinn instruction (#39435) · 9d0baeab
      TeFeng Chen 提交于
      * add cinn_instruction_run_op for launching execution of a cinn instruction
      
      * fix multi definition compilation error
      
      * update cmake
      
      * fix bug at infershape
      
      * fix compile error due to lacking header file
      9d0baeab
    • F
      pool2d_coonvert_ut (#39545) · cf8a5573
      feng_shuai 提交于
      cf8a5573
    • L
      [Paddle-TRT] Replace GeLU plugin with TensorRT built-in layer for TensorRT 7.0. (#38399) · a3689d8c
      Leo Chen 提交于
      * Replace GeLU plugin with TRT built-in layers for approximate GeLU
      
      * Add TensorRT built-in layer for nonapproximate GeLU
      a3689d8c
    • H
      move histogram to pten (#39496) · 556f6eb0
      hong 提交于
      * move histogram to pten; test=develop
      
      * fix format error; test=develop
      
      * fix histogram kernel format; test=develop
      556f6eb0
    • F
      Move Abs OP to pten (#39492) · fb473067
      From00 提交于
      * Move Abs op to pten
      
      * Fix NPU compilation error
      
      * Fix CI error
      
      * Use LaunchSameDimsElementwiseCudaKernel in pten
      fb473067
    • W
      [Eager] Support SellectedRows MergeAdd case (#39449) · 6549a041
      Weilong Wu 提交于
      
      * Refactor SelectedRows MergeAdd func by using template
      
      * Add GetInnerMutable func instead of modify GetInnerMutableTensor
      
      * Updated PADDLE_ENFORCE statement
      
      * Remove useless PADDLE_ENFORCE statement
      
      * Polish Code
      6549a041
    • S
      add dropout fp32 (#39501) · b81358d1
      sneaxiy 提交于
      b81358d1
    • F
      delete mish_convert_ut skip (#39432) · 8cedcd3e
      feng_shuai 提交于
      8cedcd3e
    • F
      move algorithm.h (#39502) · 7eb9593e
      Feiyu Chan 提交于
      Move paddle/fluid/operators/math/algorithm.h to paddle/pten/kernels/funcs and rename all references to symbols in it.
      7eb9593e
    • L
      [Pten]Move expand_v2 to pten (#39471) · 2d16d69b
      Linjie Chen 提交于
      * move expand to pten
      
      * move expand_v2 to pten
      
      * move expand_v2 to pten
      
      * fix grad register
      
      * fix grad register
      
      * fix tensorcpry
      
      * fix tensorcopy
      
      * fix tensorcopy
      
      * fix tensorcopy
      
      * fix tensorcopy
      
      * fix ci
      
      * fix tensorcopy
      2d16d69b
    • A
      [PTen]Migrate proto::VarType outside of Pten (#39411) · 7e7e9404
      Aurelius84 提交于
      * #1 migrate dist-related type()-> dtype()
      
      * move datatype function from pten -> fluid/framework
      
      * change type() in imperative into convert(dtype())
      
      * modify xx_tensor->type into xx_tensor->dtype
      
      * change the set_type interface and the caller
      
      * modify xx_tensor.type into xx_tensor.dtype
      
      * fix mutable_data(place, dtype())
      
      * change caller of mutable_data in pten and distributed
      
      * change the caller of mutable_data in fluid/framework
      
      * change the caller of mutable_data in imperative directory
      
      * mutable_data: inference
      
      * update the call of mutable_data
      
      * transfer MakePenScalarArray MakePtenScalar ResetHolderWithType
      
      * pass the compile. the next step is remove VarType in Pten
      
      * fix all and remove VarType from pten. success in linux. Next task is other platform
      
      * fix conflict with develop
      
      * fix compiled error
      
      * Fix reset conversion
      
      * fix conflict
      
      * fix compiled problem
      
      * fix typo
      
      * Fix << in tensor_utils.cc
      
      * fix type->dtype
      
      * fix unittest
      
      * fix tensor init constructor
      
      * fix DataTypeSize for BFloat16
      
      * fix code style
      
      * fix npu compiled error
      
      * fix npu
      
      * compile npu sucessfully
      
      * fix conflict
      
      * fix conflict
      Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
      7e7e9404
  4. 14 2月, 2022 5 次提交