- 25 5月, 2023 5 次提交
-
-
由 zhouweiwei2014 提交于
-
由 Leo Chen 提交于
* add log for memory stats * fix string_split in einsum
-
由 张春乔 提交于
-
由 ronnywang 提交于
-
由 zhoutianzi666 提交于
* add tranfer_elim * transfer layout elimination
-
- 24 5月, 2023 14 次提交
-
-
由 Leo Chen 提交于
-
由 Yiqun Liu 提交于
* Try to increase the repeat of autotune and fix the setting of allow_tf32_cublas. * Change the repeat of cublaslt to 10. * Use FLAGS_cublaslt_exhaustive_search_times as repeats. * Fix compiling error on CI. * Polish the key and simplify codes.
-
由 zhangyuqin1998 提交于
-
由 zhangyuqin1998 提交于
* move raw kernels to legacy * Update elementwise_add_kernel.cu * fix
-
由 wz1qqx 提交于
-
由 Leo Guo 提交于
Fixed the bug in the api.cc file where there was an inconsistency between the specified type (std::vector<DenseTensor*>&) in the function pointer kernel_signature and the type of the phi kernel parameter (std::vector<DenseTensor*>) when the phi kernel is set to output as std::vector<DenseTensor*>. test=kunlun (#54053)
-
由 kangguangli 提交于
* add vector type support for program translator * polish * resolve conflicts * add verify for combine/slice and unittests * polish
-
由 王明冬 提交于
-
由 xiaoguoguo626807 提交于
-
由 Zhang Jun 提交于
-
由 Winters Montagne 提交于
Removed unnecessary header files introduced
-
由 lijin23 提交于
[XPU][PHI Kernels] bind bitwise_add kernel & add int32/int64 support to scatter_nd_add kernel for xpu (#54066) * bind new kernels to xpu * refine code * fix bugs in unittest
-
由 Frank Lin 提交于
-
由 houj04 提交于
* [XPU] add retry for unittests * revert debug code.
-
- 23 5月, 2023 21 次提交
-
-
由 Zhang Zheng 提交于
* [AMP OP&Test] Support float16 in selu * fix
-
由 Fisher 提交于
* Enable check_cinn on some tests Tests: bitwise, compare, shape, assign_value, sum, expand_v2, lookup_table, lookup_table_v2 * Enable more CINN tests Tests with CINN: expand_v2, matmul, matmul_v2, mul, norm, one_hot_v2 Add target select in cinn_launch_op * Revert test_mul_op * Improve op unit tests
-
由 LiYuRio 提交于
-
由 RuohengMa 提交于
-
由 wanghuancoder 提交于
-
由 zhangbo9674 提交于
* Use copy_if_different to avoid recompilation of generated cutlass kernels. * add program parameter dialect_interface * fix op create bug * add conv2d * draft of paddle converter * fix CI * fix windows CI * fix program destructor * printer draft * fix bug * printer draft finish * fix windows CI * reserve inplace semantics * revert program::destroy since no need to do topology sort * revert * modify by reviews * commit printer and resnet50 related ops * fix * fix * fix op definition * refine op dyn_cast * fix bug * refine code * refine code * refine code * refine code * add code gen * refine code * refine code * refine code --------- Co-authored-by: Numiswing <umiswing@foxmail.com> Co-authored-by: Nkangguangli <kangguangli@hotmail.com>
-
由 zhenhailiu 提交于
* merge code from forsish * polish * paddle/fluid/pybind/auto_parallel_py.cc * polish * polish * polish * polish * polish * polish * polish * polish * polish * polish * polish
-
由 gouzil 提交于
* [phi] autogen code tril_triu * [phi][api]fix tril_triu_grad args * [fluid] clean cmake; [phi] fix infer_meta
-
由 co63oc 提交于
-
由 Yuanle Liu 提交于
* fix trt inference fp16 io * fix inference fp16 io
-
由 co63oc 提交于
-
由 weishengying 提交于
-
由 co63oc 提交于
* Fix typos * Fix
-
由 cyberslack_lee 提交于
-
由 huangjiyi 提交于
* update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update HostAlloc * update param name * update cpu kernel * remove kernel header * update * update
-
由 Tian Zheng 提交于
-
由 Leo Chen 提交于
* add host memory stats * add ut
-
由 huangjiyi 提交于
* update * update * update * set out dtype
-
由 Wang Xin 提交于
* static graph autogen code support for pad3d op * bug fixed * add ut for pad3d mkldnn op * fix coverage * fix bug * fix bug * Delete test_pad3d_mkldnn_op.py
-
由 zhangyikun02 提交于
-
由 ronnywang 提交于
* [CustomDevice] fix auto_paralell * update * update * update
-