- 01 3月, 2023 3 次提交
-
-
由 duanyanhui 提交于
* add support of int64 add for xpu * add transpose support for int64 * add randperm kernel * fix randperm * add distribute_fpn_proposal kernel * fix comment * add reduce_sum_int32
-
由 engineer1109 提交于
-
由 risemeup1 提交于
-
- 28 2月, 2023 20 次提交
-
-
由 joanna.wozna.intel 提交于
* Add gru qat int8 test * Change place of model downloading * Update paddle/fluid/inference/tests/api/CMakeLists.txt Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> * Correct flags names and add description --------- Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
-
由 HongyuJia 提交于
-
由 gouzil 提交于
* [phi] move device_wrapper from fluid to phi * [phi] fix ‘PADDLE_ENFORCE_XDNN_SUCCESS’ was not declared in this scope
-
由 iLeGend 提交于
-
由 Hulek 提交于
* Added file * Tests separated and rewritten, fixed fc_lstm_fuse_pass * Resolve conflicts
-
由 HongyuJia 提交于
-
由 HongyuJia 提交于
* [Extension Operants] Extension supports tensor operants * Polish fluid init_tensor_operants
-
由 xiaoguoguo626807 提交于
* modify name * merge develop * original code * build modify * success 2*2 * fused dim=1 failed * success * modify static * success for static except dim=1 * delete log * tmp modify * success * success * add fp1664 * delete fp16 cpu test * stop windows test * review modify * modify tanh test * modify tanh * fix_conflixt * modift static prim * fix_conflict * Update test_static_prim.cc * update * bug fix
-
由 HongyuJia 提交于
* [C++ API GetAllocator] Add C++ GetAllocator interface * move api to accurate directory
-
由 GGBond8488 提交于
* add cumsum prim backward * skip aixs=None test case * fix op generante eror * fix static test error * remove unused code * fix static test error * skip cpu float16 test case * skip eager cpu cumsum float16 test case * add cinn test * reshape flatten out * Disable cinn single test * remove cinn test * reformat todo * add prim in cumsum op test * remove old test * fix typro * fix typro * fix typro * pass axis=None test case * remove forward prim test * remove same name axis
-
由 zhupengyang 提交于
-
由 shentanyue 提交于
-
由 yuehuayingxueluo 提交于
-
由 张春乔 提交于
* add unittest for nn.DropOut2D * add fp16 * add fp16 in docs of temporal_shift_op.cc * Update test_dropout_op.py
-
由 zhoutianzi666 提交于
* forbid tensorrt_engine op's output is a persistable var
-
由 taixiurong 提交于
-
由 Yuanle Liu 提交于
-
由 wenbin 提交于
* fix concat bug * recommit for ci
-
由 niuliling123 提交于
-
由 Jiabin Yang 提交于
* support transpose and reshape * support reshpe, transpose, cast vjp * merge develop * recover unused file * remove prim base * support problem * remove additional status settting * remove additional status settting * fix ut * fix ut * fix ut * fix no grad branch * add more test * disable fp16 in cpu * fix test
-
- 27 2月, 2023 17 次提交
-
-
由 jiangcheng 提交于
-
由 zyfncg 提交于
* add inferface of get registered phi kernels * change KernelType to KernelKey * add test * refactor code
-
由 houj04 提交于
* [XPU] add fp16 support for shape op. * [XPU] add fp16 support for lookup_table_v2 op. * update approval list: add qingshu's id.
-
由 Zhang Jun 提交于
-
由 张春乔 提交于
* remove utils * remove utils * remove utils * remove utils * Update get_data_from_tensor.h * Update rnn_functor.h * Update rnn_grad_kernel.cu.cc * Update rnn_kernel.cu.cc * Update rnn_kernel.cc * Update rnn_grad_kernel.cu.cc * Update rnn_functor.h * Update rnn_kernel.cu.cc * Update rnn_kernel.cc * remove utils * Update rnn_functor.h * remove utils * remove utils * remove utils * remove utils * remove utils * Update rnn_functor.h * Update unsqueeze_op.h * Update utils.h * roll back * Update tensor_utils.h * Update tensor_utils.h * Update tensor_utils.h * Update tensor_utils.h * Update tensor_utils.h * use TensorToVector * use TensorToVector * use TensorToVector * use TensorToVector * use TensorToVector * Update rnn_kernel.cc * Update rnn_grad_kernel.cc * Update rnn_functor.h * Update rnn_grad_kernel.cu.cc * Update rnn_kernel.cu.cc * Update rnn_functor.h * Update rnn_grad_kernel.cu.cc * Update rnn_kernel.cu.cc * Update rnn_functor.h * Update rnn_grad_kernel.cu.cc * Update rnn_kernel.cu.cc * add TensorToVector * roll back * Update tensor_utils.h * Update rnn_functor.h * Update rnn_grad_kernel.cu.cc * Update tensor_utils.h * Update rnn_kernel.cu.cc * Update rnn_grad_kernel.cc * Update rnn_kernel.cc * Update rnn_grad_kernel.cu.cc * Update rnn_kernel.cu.cc * Update rnn_grad_kernel.cc * Update rnn_kernel.cc * TensorCopySync to phi::Copy * fix codestyle * rnn_kernel.cc: add ; * replace all GetDataFromTensor with phi::GetVectorFromTensor * delete include of util.h
-
由 Wang Bojun 提交于
* add sm version check * use GetGPUComputeCapability
-
由 HongyuJia 提交于
* [Tensor Operants & Prim] Tensor pow API uses elementwise_pow * unittest change to fill_constant+elementwise_pow
-
由 HongyuJia 提交于
* [Error Msg] Polish error message when GPU kernel not found * Only test in GPU environment
-
由 Bo Zhang 提交于
* conflict * add UpdateSliceAttrs
-
由 gaoziyuan 提交于
-
由 csy0225 提交于
-
由 jameszhang 提交于
* [kunlun] support reduce_scatter * uncomment unittest * update xccl to 1.0.10
-
由 Yiqun Liu 提交于
-
由 zhouweiwei2014 提交于
-
由 zhangbo9674 提交于
* add TypeUniquer and IrContext * refine include code * add Type, TypeBase * add built-in type * add bulit-in Float32Type * refine ut * refine code * refine code * delete type_base * rename ImplType to StorageType * rename ImplType to StorageType * add macros util for register type * add macros util for register type * refine name * refine name * change storage manager * add multi_thread for ir_ctx * rwlock_2_spinlock, add REGISTER_TYPE_2_IRCONTEXT * DECLARE_TYPE_UTILITY_FUNCTOR * refine ircontext singleton * del destructor for ParametricStorageManager * refine code * Add necessary logs for debugging * refine ir_context instance * refine type get interface * refine code by comment
-
由 wangshengxiang 提交于
* [XPU] bind op scatter_nd_add * [XPU] add more data type for op: clip, transpose2 & assign_value
-
由 shaojie_wang 提交于
* register bfloat16 datatype for squared l2 norm * register bfloat16 datatype for softmax with upper triangular mask * register bfloat16 for tril triu cuda kernel
-