- 25 5月, 2023 1 次提交
-
-
由 Leo Chen 提交于
* add log for memory stats * fix string_split in einsum
-
- 24 5月, 2023 6 次提交
-
-
由 Yiqun Liu 提交于
* Try to increase the repeat of autotune and fix the setting of allow_tf32_cublas. * Change the repeat of cublaslt to 10. * Use FLAGS_cublaslt_exhaustive_search_times as repeats. * Fix compiling error on CI. * Polish the key and simplify codes.
-
由 zhangyuqin1998 提交于
-
由 zhangyuqin1998 提交于
* move raw kernels to legacy * Update elementwise_add_kernel.cu * fix
-
由 wz1qqx 提交于
-
由 Winters Montagne 提交于
Removed unnecessary header files introduced
-
由 lijin23 提交于
[XPU][PHI Kernels] bind bitwise_add kernel & add int32/int64 support to scatter_nd_add kernel for xpu (#54066) * bind new kernels to xpu * refine code * fix bugs in unittest
-
- 23 5月, 2023 6 次提交
-
-
由 Zhang Zheng 提交于
* [AMP OP&Test] Support float16 in selu * fix
-
由 RuohengMa 提交于
-
由 weishengying 提交于
-
由 cyberslack_lee 提交于
-
由 Wang Xin 提交于
* static graph autogen code support for pad3d op * bug fixed * add ut for pad3d mkldnn op * fix coverage * fix bug * fix bug * Delete test_pad3d_mkldnn_op.py
-
由 zhangyikun02 提交于
-
- 22 5月, 2023 8 次提交
-
-
由 risemeup1 提交于
* update_c++14_to_c++17_on_windows * disable test_audio_logmel_feature and test_audio_mel_feature
-
由 risemeup1 提交于
-
由 lijin23 提交于
* fix empty bugs for xpu * fix empty bugs for xpu
-
由 zhupengyang 提交于
-
由 zhoutianzi666 提交于
* fix transfer_layout when input size if too big * do not add TransferLayoutKernelGPU * add int64 and add check
-
由 zhangyikun02 提交于
-
由 Tian Zheng 提交于
* Add GPU kernel for multiclass_nms3 op * Make multiclass_nms3 gpu kernel output consistent with cpu kernel * Fix API incompatibility * Fix unittests on builds without CUDA * Fix ROCM build * Remove fluid headers; Use default atol for unittest * Change function and variable naming * Add comments; Reduce redundant code * Use paddle test framework
-
由 wangshengxiang 提交于
* bind xpu op: 3D grid sample * fix edge cases in xpu op: reshape & slice
-
- 19 5月, 2023 10 次提交
-
-
由 wz1qqx 提交于
-
由 limingshu 提交于
-
由 Galaxy1458 提交于
-
由 Galaxy1458 提交于
-
由 Galaxy1458 提交于
-
由 Danyang Zhang 提交于
* delete bf16 of cross entropy * delete bf16 of cross entropy
-
由 Galaxy1458 提交于
-
由 Galaxy1458 提交于
-
由 Galaxy1458 提交于
-
由 zhangyuqin1998 提交于
-
- 18 5月, 2023 6 次提交
-
-
由 Charles-hit 提交于
* add meshgrid,expand_as, prod and grad bf16 kernel * fix bf16 for optest * modify code style * fix amp test
-
由 engineer1109 提交于
-
由 Hulek 提交于
* Fused elementwises kernels and ops * change fuse pass name * adjust .pbtxt files * adjust quantization attributes * add missing arguments and fix others, review fixed * simplify fused kernel registration * fix elementwise unit tests * reuse one fused elementwise op * adjust proto * Add supported datatypes * Change 'Scale' to 'scale' in tests, change some tests to onednn * Revert breaking changes * Fix unit tests * Delete obsolete test cases * Delete commented out code * Fix codestyle * delete temporary condition * fix conflicts and delete duplicate fusing * Fix code after merge * Move tests to new directory * fix tests volatility * Rename test_elementwise_add_onednn_op.py to test_elementwise_add_mkldnn_op.py * Update CMakeLists.txt add mkldnn op test --------- Co-authored-by: NSilv3S <slawomir.siwek@intel.com>
-
由 huangjiyi 提交于
-
由 co63oc 提交于
-
由 LoneRanger 提交于
-
- 17 5月, 2023 1 次提交
-
-
由 LyndonKong 提交于
* Add i1 and i1e op * resolve merge conflicts
-
- 16 5月, 2023 2 次提交