- 29 3月, 2023 1 次提交
-
-
由 chenxujun 提交于
-
- 24 3月, 2023 1 次提交
-
-
由 YuanRisheng 提交于
* decouple memory copy * fix ci bugs * fix ci compile bugs * fix rocm compile * fix ci bugs * decouple memory * deal with conflict * fix xpu compile bugs * fix xpu bugs * deal with xpu bugs * fix cmake bugs * fix windows bugs * fix ci bugs * fix ci bugs * delete redundance code * add code for pybind * fix py3 bugs * fix ci bugs
-
- 21 3月, 2023 1 次提交
-
-
由 iSerendipity 提交于
* move DataType from paddle::experimental to phi * convert namespace * convert namespace * convert namespace * clarify namespace * convert more datatype * Revert "convert more datatype" This reverts commit 083b462959e6a22d4d8767707b628b95b396642e. * convert more in auto_code_generator * fix conflicts for XPU * fix namespace conflicts * fix errors * Revert "fix errors" This reverts commit f9d9958b54ee32141112274c8a5c3c381ab0f876. * fix errors * fix formatting
-
- 16 3月, 2023 1 次提交
-
-
由 Huang Jiyi 提交于
* remove fluid thread_data_registry * update * fix bug
-
- 15 3月, 2023 3 次提交
-
-
由 limingshu 提交于
-
由 iSerendipity 提交于
* Revert "Revert "【Hackathon No.67】remove operator.h in blas.h (#50989)" (#51467)" This reverts commit b9d91531. * remove cout * add header * fix missing header * fix refer fluid error * fix missing header * 更新 repeat_interleave_grad_kernel_impl.h Change to phi style datatype. * 更新 repeat_interleave_grad_kernel_impl.h Fix missing header * datatype fluid -> phi * paddle::experimental -> phi * fix reference error * fix reference error * fix reference error * fix errors * fix missing FLAGS * fix missing headers * fix missing headers * fix missing headers * fix missing headers * fix missing header * fix missing header * fix errors
-
由 pangyoki 提交于
-
- 14 3月, 2023 1 次提交
-
-
由 pangyoki 提交于
* cuda graph support multi-stream for new executor * fix windows compile error * delete create_cuda_graph_stream
-
- 13 3月, 2023 1 次提交
-
-
由 Huang Jiyi 提交于
* platform::CUDAPinnedDeviceContext -> phi::GPUPinnedContext * replace platform::TraceEventCollector
-
- 06 3月, 2023 1 次提交
-
-
由 Huang Jiyi 提交于
* move DeviceContextPool to phi * add EmplaceExternalContextFunc * update namespace * update cmake * fix bugs and create context_pool_impl.h * replace platform::is_xxx_place * fix bugs * update generator * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix enforce usage * Revert "fix enforce usage" This reverts commit 5f521f08a69713cee506e64a00ec6d9fba709e27. * fix bugs * rm XPUDeviceContext and CustomDeviceContext * fix bugs * fix fix context init bug * fix bugs after merge * fix bugs * fix name * fix mutable_data * update and fix bugs * fix bugs * update * fix bugs * fix name * fix bugs * merge * fix bugs * create context_pool in phi/backends * create context_pool in phi/backends * fix bugs * fix xpu bugs * fix rocm bugs * fix bugs * fix bugs * fix bugs * fix xpu bugs * update * update * fix bugs * fix bugs
-
- 27 2月, 2023 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 26 2月, 2023 1 次提交
-
-
由 Yiqun Liu 提交于
* Enable matmul + bias fusion in fused_gat_attention. * Add a variable to control whether using fused matmul + bias.
-
- 30 1月, 2023 1 次提交
-
-
由 Ruibiao Chen 提交于
* Support stream priority for standalone executor * Fix compile error * Fix compile error * Fix compile error * Fix compile error * Fix compile error
-
- 10 1月, 2023 2 次提交
- 03 1月, 2023 1 次提交
-
-
由 limingshu 提交于
-
- 20 12月, 2022 1 次提交
-
-
由 huangjiyi 提交于
* move dropout_impl from fluid to phi * move cuda_graph_with_memory_pool from fluid to phi * update namespace * remove cuad_graph in fluid * fix mac-build * fix bugs * correct CodeStyle * fix mac-build * fix mutable_data * fix stl include * fix copy param
-
- 09 12月, 2022 1 次提交
-
-
由 PuQing 提交于
-
- 08 12月, 2022 1 次提交
-
-
由 huangjiyi 提交于
* move cuda_graph from fluid to phi * move device_memory_aligment from fluid to phi * Revert "move device_memory_aligment from fluid to phi" This reverts commit b92fcd39a0a50fdac13278f49be0237a85f3a13f. * update xpu cmake
-
- 05 12月, 2022 1 次提交
-
-
由 huangjiyi 提交于
-
- 30 11月, 2022 1 次提交
-
-
由 Netpunk 提交于
* migrate transpose_op.cu.h and gpu_utils.h * format code style * fix some problems * format code * reset tranpose_op.cc * test commit * recover transpose_op.h * delete transpose_op.h * adjust header files order in transpose_op.cc
-
- 28 11月, 2022 2 次提交
-
-
由 huangjiyi 提交于
* decouple cudnn_desc.h from fluid * move cudnn_desc.h from fluid to phi * fix bugs * decouple cudnn_helper.h from fluid * fix bugs * move cudnn_helper.h from fluid to phi * add fluid cudnn_helper.h * move miopen_desc.h from fluid to phi * move miopen_helper.h from fluid to phi * fix bugs * move gpu_dnn.h from fluid to phi * fix bugs * update copyright year * simplify gpu_dnn.h in fluid * fix bugs * fix xpu build bug * fix compile bug * fix bug
-
由 YuanRisheng 提交于
* Fix onednn kernel bugs * fix gpu bugs
-
- 25 11月, 2022 1 次提交
-
-
由 sneaxiy 提交于
-
- 24 11月, 2022 1 次提交
-
-
由 PuQing 提交于
-
- 23 11月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* make bfloat16 implicit convert to float/double * fix bfloat16_test ut compile
-
- 21 11月, 2022 1 次提交
-
-
由 LiYuRio 提交于
-
- 18 11月, 2022 2 次提交
- 17 11月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* add vectorized bfloat16 atomicAdd * fix compile error * fix compile error again * fix V100 compile error * fix V100 compile again
-
- 16 11月, 2022 1 次提交
-
-
由 Wang Xin 提交于
-
- 10 11月, 2022 1 次提交
-
-
由 pangyoki 提交于
change cudnn error to cuda error if compiled cuda version is incompatible with installed cuda version (#47743) * fix cudnn error * fix * fix * fix
-
- 04 11月, 2022 1 次提交
-
-
由 pangyoki 提交于
-
- 01 11月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* add extra attr property set * add type_info for all context * add onednn context to all context * fix context compile error * simplify conv kernel args * pass runtime attr into dev_ctx * fix marco error * clear conv_grad_kernel extra args * merge conv_grad_grad into conv_grad * clear conv2d_grad_grad extra attrs * clear yaml and eager extra attr * fix conv1d error * change to thread local * fix npu compile failed * try to fix windows compile failed * add conv2d onednn phi kernel * fix ci bugs (#36) * fix compile bugs (#38) * fix extra input transform bug (#39) * support dynamic created attr (#40) * reset extra info gen code * rm conv_grad_grad kernel * reimpl pass attr adapting * add int attr support * remove vector inputnames creating * fix map at error * Update paddle/phi/kernels/onednn/conv_grad_kernel.cc Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> * remove useless extra attrs * replace mkldnn_engine by onednn_engine Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com> Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
-
- 16 9月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* support int64 non-broadcast * support broadcast case for int64 index * fix bug * support more Arity * remove some codes * upgrade patchelf to v0.15.0 to pass CI build * fix bug * fix patchelf installation * add debug flags * remove useless codes * fix viterbi_decode and set_value op uts * remove always enable int64
-
- 06 9月, 2022 1 次提交
-
-
由 Wilber 提交于
-
- 05 9月, 2022 1 次提交
-
-
由 sneaxiy 提交于
-
- 24 8月, 2022 1 次提交
-
-
由 Rayman 提交于
* 【Hackathon No.34】优化 poisson op * [poisson] code style fix * modify code style * prevent from big number * modify code style * modify code style * modify import * modify import * modify code style
-
- 10 8月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* set cuda device before run * add header file * fix compile
-
- 05 8月, 2022 1 次提交
-
-
由 Qi Li 提交于
-