- 15 12月, 2022 4 次提交
-
-
由 Ruibiao Chen 提交于
-
由 Wangzheee 提交于
-
由 Wilber 提交于
* memory_optimize and mkdlnn problem * update * update * update
-
由 Wen Sun 提交于
-
- 14 12月, 2022 11 次提交
-
-
由 Ming-Xu Huang 提交于
-
由 PuQing 提交于
* fix c++17-extensions warning on macos * fix type fix c++17-extensions warning on macos fix c++17-extensions warning on macos
-
由 Yuanle Liu 提交于
-
由 limingshu 提交于
* First Commit. * add some codes * add elementwise loader * fix code styles * merge with develop * add some changes both in elementwise and transpose * add init operation in broadcast kernel. * change codes according to pr suggestions about transpose file * fix error for op-benchmark ci * fix according to ci
-
由 james 提交于
* nullptr bugfix for XPU pg mode Also a few kernels is added to xpu whitelist * increase error msg length
-
由 zqw_1997 提交于
* modify cmake file for cuda11.8 compile * add op_library(fused_embedding_eltwise_layernorm_op DEPS bert_encoder_functor)
-
由 zhangkaihuo 提交于
-
由 tianshuo78520a 提交于
* test=doucment_fix * test=document_fix
-
由 Hulek 提交于
* Deleted mkldnn_inplace_pass code * Fixed error with cmake * Resolve conflicts
-
由 Zhang Jun 提交于
* add more unary op and square
-
由 Yiqun Liu 提交于
-
- 13 12月, 2022 8 次提交
-
-
由 wanghuancoder 提交于
* fix rmsprop_ yaml bug
-
由 joanna.wozna.intel 提交于
-
由 sneaxiy 提交于
* save fused_attention memory when dropout_rate = 0.0 * add ut * fix ut bug * fix fused_layernorm_residual_dropout_bias_test.cu
-
由 HappyHeavyRain 提交于
* generate static graph code of some ops by yaml * fix the code-style of yaml * fix the framework_ci for triangular_solve * change the 'data_type' of scatter * add the 'out: Out' of scatter_nd_add
-
由 Qi Li 提交于
-
由 engineer1109 提交于
-
由 weishengying 提交于
-
由 Wangzheee 提交于
* fix some transformer unitest
-
- 12 12月, 2022 16 次提交
-
-
由 wanghuancoder 提交于
This reverts commit 592ed40b.
-
由 RichardWooSJTU 提交于
* fit paddle.matmul in fleetx.gpt
-
由 YuanRisheng 提交于
* add new tensor * fix windows compile bugs * fix ci bugs * fix ci bugs * fix ci bugs * perfect according comment * fix ci compile bugs * add raw tensor * fix ci bugs * modify code by comment * delete String
-
由 Netpunk 提交于
* replace dependency of inclusive_scan.h from phi * format code
-
由 傅剑寒 提交于
* fix codestyle * add double complex<float> complex<double> dtype support for syevj_batched * fix use_syevj flag for precision loss when input dtype of syevj_batch is complex128 in some case * optimize eigh in different case * fix missing ; bug * fix use_syevj bug * fix use_cusolver_syevj_batched flag
-
由 feng_shuai 提交于
* fix: there are some bugs with trt 8.0 * fix:windows CI trt is too old
-
由 Paulina Gacek 提交于
* OneDNN version of Copy, tranpose kernels adjusted * style fixes in tranpose_grad * redundant headers deleted
-
由 Yiqun Liu 提交于
* Enable to print device info. * Enhance the nan and inf checking for cpu. * Implement a common print function. * Unify the check of complex numbers. * Rewrite the omp method. * Count and print the number of nan and inf. * Change the print content. * Add unittest.
-
由 feng_shuai 提交于
-
由 zhoutianzi666 提交于
-
由 huangjiyi 提交于
* move norm_utils.cu.h from fluid to phi * remove norm_utils.h in fluid * fix bugs and replace mutable_data with Alloc * replace mutable_data with Alloc
-
由 zyfncg 提交于
-
由 zhoutianzi666 提交于
-
由 HappyHeavyRain 提交于
* generate static graph code of some ops by yaml, test = develop * fix 'take_along_axis' yaml style * reset scatter/scatter_nd_add * delete the comments of put_along_axis
-
由 Ruibiao Chen 提交于
* Add UT * Support cross-step stream synchronization for standalone executor * Fix typos * Fix typos * Update UTs
-
由 Wen Sun 提交于
* chore: unify `SingleTensor` * feat: dynamic check
-
- 11 12月, 2022 1 次提交
-
-
由 limingshu 提交于
* first commit. * refine performance with fast_divmod * refine performance with fast_divmod
-