- 14 12月, 2022 4 次提交
-
-
由 PuQing 提交于
* fix c++17-extensions warning on macos * fix type fix c++17-extensions warning on macos fix c++17-extensions warning on macos
-
由 limingshu 提交于
* First Commit. * add some codes * add elementwise loader * fix code styles * merge with develop * add some changes both in elementwise and transpose * add init operation in broadcast kernel. * change codes according to pr suggestions about transpose file * fix error for op-benchmark ci * fix according to ci
-
由 james 提交于
* nullptr bugfix for XPU pg mode Also a few kernels is added to xpu whitelist * increase error msg length
-
由 zhangkaihuo 提交于
-
- 12 12月, 2022 4 次提交
-
-
由 Netpunk 提交于
* replace dependency of inclusive_scan.h from phi * format code
-
由 傅剑寒 提交于
* fix codestyle * add double complex<float> complex<double> dtype support for syevj_batched * fix use_syevj flag for precision loss when input dtype of syevj_batch is complex128 in some case * optimize eigh in different case * fix missing ; bug * fix use_syevj bug * fix use_cusolver_syevj_batched flag
-
由 Paulina Gacek 提交于
* OneDNN version of Copy, tranpose kernels adjusted * style fixes in tranpose_grad * redundant headers deleted
-
由 huangjiyi 提交于
* move norm_utils.cu.h from fluid to phi * remove norm_utils.h in fluid * fix bugs and replace mutable_data with Alloc * replace mutable_data with Alloc
-
- 11 12月, 2022 1 次提交
-
-
由 limingshu 提交于
* first commit. * refine performance with fast_divmod * refine performance with fast_divmod
-
- 09 12月, 2022 4 次提交
-
-
由 Sławomir Siwek 提交于
* reshape * typo * remove header
-
由 niuliling123 提交于
-
由 Leo Chen 提交于
* move share_buffer kernel to phi * fix ut * add source file * fix window links
-
由 PuQing 提交于
-
- 08 12月, 2022 8 次提交
-
-
由 limingshu 提交于
-
由 haosicheng 提交于
-
由 jakpiase 提交于
Reenabled ext_reorder recording for TransDataLayoutFromOneDNN
-
由 huangjiyi 提交于
* move cuda_graph from fluid to phi * move device_memory_aligment from fluid to phi * Revert "move device_memory_aligment from fluid to phi" This reverts commit b92fcd39a0a50fdac13278f49be0237a85f3a13f. * update xpu cmake
-
由 201716010711 提交于
-
由 Netpunk 提交于
* remove bbox_util.h from phi * add file bbox_util.h * reframe bbox_util.h
-
由 Netpunk 提交于
-
由 Yiqun Liu 提交于
-
- 07 12月, 2022 5 次提交
-
-
由 Sławomir Siwek 提交于
* squeeze kernel * squeze fwd * whitespace
-
由 傅剑寒 提交于
* [Zero-Dim] Support 0D for paddle.diagflat
-
由 WangZhen 提交于
-
由 zhoutianzi666 提交于
-
由 zhangyikun02 提交于
-
- 06 12月, 2022 6 次提交
-
-
由 xiongkun 提交于
* make bilinear interpolate stable. * fix code
-
由 zyfncg 提交于
* delete Bias and ResidualData in OpMaker of conv2d * delete extra input of conv3d * refactor pass of conv_bias_fusion * fix mkldnn dependency * fix mkldnn compile * fix test_conv_bias_mkldnn_fuse_pass * police some code * remove useless log * fix analyzer_vit_ocr_tester * fix conv_activation_mkldnn_fuse_pass * fix test_analyzer_ocr * add fused_conv_sig * fix performence regression * fix performance regression
-
由 Sławomir Siwek 提交于
* remove fluid code * init * typo * fix merge conflicts
-
由 houj04 提交于
-
由 kangguangli 提交于
* remove layers.matmul in nets.py * remove layers.matmul in rnn_impl/test_quantization_pass/auto_parallel_gpt_model/test_auto_parallel_completion_gpt * remove layers.matmul in other files * fix * fix * remove layers.matmul itself * remove ref in CMakeLists.txt and tools directory * remove matmul in fluid.layers.nn.py * remove matmul in fluid.dygraph.rnn.py && resotre test_matmul_op.py * replace matmul in fluid.dygraph.rnn.py && clean api_test in test_matmul_op.py * fix error && restore empty test_auto_search_dist_matmul_op.py * fix check in test_auto_parallel_partitioner.py * fix test_dist_matmul && test_flags_mkldnn_ops_on_off * fix test_fused_attention_op_xpu.py && test_matmul_op_xpu.py * remove test_auto_search_dist_matmul_op.py * remove layers.matmul in auto_parallel_gpt_model.py && fix doc in fluid/io.py * fix for matmul_grad * fix codestyle * fix codestyle * resolve conflicts error * restore unit test file but not compiled it for later remove * fix codestyle * fix wrong unittest skip * fix unittest delete * fix scale cost * fix scale cost * resolve conflicts error * resolve conflicts error Co-authored-by: Njakpiase <jakpia21@gmail.com>
-
由 ykkk2333 提交于
* add stat tool * add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun * add xpu rmsprop centered, test=kunlun
-
- 05 12月, 2022 7 次提交
-
-
由 limingshu 提交于
* first commit * fix bugs according to ci * add some changes * change file name into function.cu.h * remove const_cast
-
由 Roc 提交于
-
由 huangjiyi 提交于
-
由 Ruibiao Chen 提交于
* Replace mutable_data with DeviceContext.Alloc in phi kernels * Fix CI errors * Fix CI errors * Fix CI errors, test=kunlun * Fix CI errors, test=kunlun * Handle rnn_functor * Update approvals
-
由 sneaxiy 提交于
* register more bf16 ops * update to register coresponding backward ops
-
由 heyanru 提交于
[Fluid Clean] remove nn.topk, nn.ctc_greedy_decoder, nn.im2sequence, nn.multiplex, nn.smooth_l1 (#48289)
-
由 Netpunk 提交于
* rm poly_util.h * format code * fix some problems * format code
-
- 03 12月, 2022 1 次提交
-
-
由 Yuang Liu 提交于
-