- 01 11月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* add extra attr property set * add type_info for all context * add onednn context to all context * fix context compile error * simplify conv kernel args * pass runtime attr into dev_ctx * fix marco error * clear conv_grad_kernel extra args * merge conv_grad_grad into conv_grad * clear conv2d_grad_grad extra attrs * clear yaml and eager extra attr * fix conv1d error * change to thread local * fix npu compile failed * try to fix windows compile failed * add conv2d onednn phi kernel * fix ci bugs (#36) * fix compile bugs (#38) * fix extra input transform bug (#39) * support dynamic created attr (#40) * reset extra info gen code * rm conv_grad_grad kernel * reimpl pass attr adapting * add int attr support * remove vector inputnames creating * fix map at error * Update paddle/phi/kernels/onednn/conv_grad_kernel.cc Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> * remove useless extra attrs * replace mkldnn_engine by onednn_engine Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com> Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
-
- 27 10月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* add predictor_engine * add predictor_engine * fix zero shape * fix lodTensor * fix unittest * fix code style * update CmakeList
-
- 26 10月, 2022 2 次提交
-
-
由 Sławomir Siwek 提交于
* fc/matmuls + scale fuse pass * remove double-extension * add unit tests * comments from review * codestyle * add pass to int8 list * new codestyle * attr name typo
-
由 HongyuJia 提交于
* remove prior_box mkldnn hard code * add header file * simplify PD_VISIT_TYPE * decouple dependency between prior_box and density_prior_box * fix pragma omp parallel error * bypass #pragma omp_parallel_for error * polish code * remove visit_type headerfile * polish codestyle * polish codestyle * try fix CI error * add testcase, datatype=float64 * reset test_prior_box testcase * add datacheck to DenseTensor * update template name * call prior_box with macro expand
-
- 25 10月, 2022 2 次提交
- 24 10月, 2022 1 次提交
-
-
由 HongyuJia 提交于
* delete GetExpectedKernelType mkldnn of mul_grad * update mkldnn_op_list, remove mul_grad * delete GetExpectedKernelType mkldnn of mul
-
- 21 10月, 2022 1 次提交
-
-
由 Yuanle Liu 提交于
* fix nvprof_nvtx_push interface bug
-
- 20 10月, 2022 1 次提交
-
-
由 HongyuJia 提交于
* remove fc mkldnn hardcode * remove useless enum of kFCMKLDNN * fix macro error * update operators.cmake
-
- 19 10月, 2022 2 次提交
-
-
由 Yuanle Liu 提交于
-
由 Leo Chen 提交于
* clean unused code: piece.cc/h * clean usage
-
- 18 10月, 2022 1 次提交
-
-
由 HongyuJia 提交于
-
- 17 10月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* namespace modify * update by comment
-
- 15 10月, 2022 1 次提交
-
-
由 HongyuJia 提交于
-
- 13 10月, 2022 2 次提交
-
-
由 Leo Chen 提交于
-
由 HongyuJia 提交于
* remove PADDLE_WITH_MKLDNN, test white_list=abs * fix unique_ptr * fix op.Type() * remove TODO in kernel_dispatch.h * remove IndicateVarDataType function, update white_list * remove mkldnn hard code * add comments * fix == * update mkldnn_op_list * delete hard code of OPs * update mkldnn_op_list * update mkldnn_op_list, remove interp * add error check for ExecutionContext * update mkldnn_op_list, remove transpose2_grad * remove interpolate mkldnn * remove fill_constant mkldnn * opt HasAttr in DygraphExecutionContext * deprecated commit, test mkldnn_white_list * deprecated commit, test mkldnn_white_list * deprecated commit, test mkldnn_black_list * update mkldnn_op_list, add assert error op * solve cudnn related op * fix error * add mkldnn fallback in phi_utils.cc * remove mkldnn fallback in phi_utils.cc * opt code implementation * polish Copyright License
-
- 11 10月, 2022 2 次提交
-
-
由 Wen Sun 提交于
-
由 Chen Weihang 提交于
* remove using lodtensor part1 * polish history code format
-
- 10 10月, 2022 1 次提交
-
-
由 Sylwester Fraczek 提交于
* Add methods that find input or output name by var name * kind of bugfix - initialize variables * ci fix * review fixed
-
- 30 9月, 2022 3 次提交
-
-
由 Allen Guo 提交于
* paddle-inference support custom-ops Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> * fix tolower Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>
-
由 ykkk2333 提交于
* migrate sigmoid with cross entropy, and tile xpu kernels to phi, test=kunlun * migrate add_n kernep to phi, test=kunlun * fix bugs of tipc, test=kunlun
-
由 sneaxiy 提交于
* support pure bfloat16 * support bf16 linear * update PR to pass CI * tiny fix where_grad_kernel.cu * add bfloat16 to selu_grad to pass CI * fix selu grad compilation error
-
- 29 9月, 2022 1 次提交
-
-
由 Leo Guo 提交于
Add index_select, index_select_grad, reduce_min kernel and their unittests for kunlun. Add registers of index_select, index_select_grad, reduce_min, sqrt, sqrt_grad to xpu2_op_list.test=kunlun. (#46557)
-
- 28 9月, 2022 3 次提交
-
-
由 sneaxiy 提交于
-
由 Chen Weihang 提交于
* remove needless using tensor * remove needless using tensor * resolve conflict * replace tensor using * fix format error * revert needless changing * fix rocm and npu compile error * fix cinn compile error * fix format error * fix mkldnn format error * fix mkldnn format error * fix cinn compile error * fix cinn compile error * fix cinn compile error * resolve conflict
-
由 Sławomir Siwek 提交于
* Relu6 * remove fluid handler * add individual kernel signature * coding style * replace bounded_relu with clip * whitespace * code style
-
- 26 9月, 2022 1 次提交
-
-
由 cifar10 提交于
-
- 25 9月, 2022 1 次提交
-
-
由 sneaxiy 提交于
-
- 22 9月, 2022 1 次提交
-
-
由 Chenxiao Niu 提交于
-
- 18 9月, 2022 1 次提交
-
-
由 RichardWooSJTU 提交于
-
- 16 9月, 2022 5 次提交
-
-
由 sneaxiy 提交于
* support int64 non-broadcast * support broadcast case for int64 index * fix bug * support more Arity * remove some codes * upgrade patchelf to v0.15.0 to pass CI build * fix bug * fix patchelf installation * add debug flags * remove useless codes * fix viterbi_decode and set_value op uts * remove always enable int64
-
由 chenjian 提交于
* avoid to synchronize all devices * synchronize custom device
-
由 JingZhuangzhuang 提交于
-
由 Leo Chen 提交于
* add interpretercore for jit engine * add ut
-
由 ronnywang 提交于
* [CustomDevice] add custom_device_resource_pool & device_event_custom_device * update * update * update * update
-
- 15 9月, 2022 2 次提交
-
-
由 Jacek Czaja 提交于
* - mul & matmul changes - fix - bs16 correction of strides * - cosmetic fixes * - lint * - fix * - fix * - format -> mem_desc * - fix * - fix * - fix * - fix * - fix
-
由 Nyakku Shigure 提交于
-
- 14 9月, 2022 2 次提交
-
-
由 JingZhuangzhuang 提交于
* merge python lib * Update third_party.cmake * Update CMakeLists.txt
-
由 JingZhuangzhuang 提交于
* Delay TensorRT registry * Add unused define * Fix TensorRT test * fix function to reference * Update trt_plugin.h
-
- 09 9月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* add operator<< for BuildStrategy * add fake_coalesce * fit allreduce mode for new_exe * remove dubeg code * follow comments
-