- 31 8月, 2023 1 次提交
-
-
由 Tian Zheng 提交于
* Add fused_scale_bias_relu_conv_bnstats op * Review changes * Fix no CUDNN Frontend build * Fix PADDLE_ENFORCE format * Fix PADDLE_ENFORCE CI error * Rename kernel filename * Refactor unittest to use paddle eager_op_test * Fix padding bugs * Review changes * test=cuda117 * test=cuda117
-
- 30 8月, 2023 2 次提交
-
-
由 huangjiyi 提交于
* update * repalce gflags header * replace DEFINE_<type> with PD_DEFINE_<type> * fix bug * fix bug * fix bug * update cmake * add :: before some paddle namespace * fix link error * fix CI-Py3 * allow commandline parse * fix SetFlagsFromEnv * fix bug * fix bug * fix CI-CINN * fix CI-Coverage-build * fix CI-Windows-build * fix CI-Inference * fix bug * fix bug * fix CI-CINN * fix inference api test * fix infer_ut test * revert infer_ut gflags usage * update * fix inference * remove flags export macro * revert inference demo_ci gflags usage * update * update * update * update * update * update * update * update * fix bug when turn on WITH_GFLAGS * turn on WITH_GFLAGS * fix bug when turn on WITH_GFLAGS * fix bug when turn on WITH_GFLAGS * update * update and add unittest * add unittest * fix conflict * rerun ci * update * resolve conflict
-
由 ronnywang 提交于
-
- 14 7月, 2023 1 次提交
-
-
由 Tian Zheng 提交于
* Update CUDNN Frontend API to v0.9.1 - Remove old patches - Remove workarounds that are no longer needed * Fix test_switch_autotune
-
- 30 5月, 2023 1 次提交
-
-
由 shaojie_wang 提交于
* softmax fwd: force vec size to 1 when dtype is float * use 1024 as threshold to use cudnn
-
- 10 5月, 2023 1 次提交
-
-
由 Zhang Zheng 提交于
-
- 13 4月, 2023 1 次提交
-
-
由 HongyuJia 提交于
* [enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h * Add logging.h for profiler.cc * Add logging.h for gloo_utils.h * Add logging.h for addmm_kernel_impl.h * Add logging.h for addmm_grad_kernel_impl.h * Add logging.h for p_send_kernel.cu * Add logging.h for determinant_grad_kernel_impl.h * Add logging.h for p_recv_kernel.cu * Add logging.h for elementwise_grad_base.h * Add logging.h for transfer_layout_kernel.cc * Add logging.h for eigvals_kernel.cc and index_select_impl.h * Add logging.h for all files in kernel directory * Add logging.h for xpu_info.cc * Add logging.h for xpu
-
- 06 4月, 2023 1 次提交
-
-
由 zhangyuqin1998 提交于
* Rename conv2d transpose grad grad * fix
-
- 31 3月, 2023 1 次提交
-
-
由 zhangyuqin1998 提交于
-
- 28 3月, 2023 1 次提交
-
-
由 wangxinxin08 提交于
* add unittest for conv2d/depthwise_conv2d/conv2d_transpose * add bf16 for DWConv and ConvTranspose * fix unitest of conv2d_transpose * modify DWConv2d op and unittest * fix unittest of conv2d_transpose_bf16 * modify unittest name according to review * modify atol of DWConv2D unittest
-
- 21 3月, 2023 1 次提交
-
-
由 iSerendipity 提交于
* move DataType from paddle::experimental to phi * convert namespace * convert namespace * convert namespace * clarify namespace * convert more datatype * Revert "convert more datatype" This reverts commit 083b462959e6a22d4d8767707b628b95b396642e. * convert more in auto_code_generator * fix conflicts for XPU * fix namespace conflicts * fix errors * Revert "fix errors" This reverts commit f9d9958b54ee32141112274c8a5c3c381ab0f876. * fix errors * fix formatting
-
- 16 3月, 2023 1 次提交
-
-
由 Huang Jiyi 提交于
* remove fluid thread_data_registry * update * fix bug
-
- 06 3月, 2023 1 次提交
-
-
由 Huang Jiyi 提交于
* move DeviceContextPool to phi * add EmplaceExternalContextFunc * update namespace * update cmake * fix bugs and create context_pool_impl.h * replace platform::is_xxx_place * fix bugs * update generator * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix enforce usage * Revert "fix enforce usage" This reverts commit 5f521f08a69713cee506e64a00ec6d9fba709e27. * fix bugs * rm XPUDeviceContext and CustomDeviceContext * fix bugs * fix fix context init bug * fix bugs after merge * fix bugs * fix name * fix mutable_data * update and fix bugs * fix bugs * update * fix bugs * fix name * fix bugs * merge * fix bugs * create context_pool in phi/backends * create context_pool in phi/backends * fix bugs * fix xpu bugs * fix rocm bugs * fix bugs * fix bugs * fix bugs * fix xpu bugs * update * update * fix bugs * fix bugs
-
- 03 3月, 2023 1 次提交
-
-
由 YuanRisheng 提交于
* decouple memory copy * fix ci bugs * fix ci compile bugs * fix rocm compile * fix ci bugs
-
- 06 2月, 2023 1 次提交
-
-
由 YuanRisheng 提交于
-
- 20 12月, 2022 1 次提交
-
-
由 huangjiyi 提交于
* move dropout_impl from fluid to phi * move cuda_graph_with_memory_pool from fluid to phi * update namespace * remove cuad_graph in fluid * fix mac-build * fix bugs * correct CodeStyle * fix mac-build * fix mutable_data * fix stl include * fix copy param
-
- 09 12月, 2022 1 次提交
-
-
由 PuQing 提交于
-
- 08 12月, 2022 2 次提交
- 28 11月, 2022 1 次提交
-
-
由 huangjiyi 提交于
* decouple cudnn_desc.h from fluid * move cudnn_desc.h from fluid to phi * fix bugs * decouple cudnn_helper.h from fluid * fix bugs * move cudnn_helper.h from fluid to phi * add fluid cudnn_helper.h * move miopen_desc.h from fluid to phi * move miopen_helper.h from fluid to phi * fix bugs * move gpu_dnn.h from fluid to phi * fix bugs * update copyright year * simplify gpu_dnn.h in fluid * fix bugs * fix xpu build bug * fix compile bug * fix bug
-
- 24 11月, 2022 1 次提交
-
-
由 huangjiyi 提交于
* rm dependence to "convert_utils.h" in some files * fix bugs * replace DataType2String with DataTypeToString * replace framework::DataTypeSize with phi::SizeOf * mv convert_function from fluid to phi and rm old map * recommit with pre-commit * repalce ProtoVarType with ProtoDataType and update comment. * fix error about include "dnnl.hpp" * revert add dep mkldnn to convert_utils in phi * add mkldnn deps in convert_utils.h in phi * move deps to convert_utils.h in phi
-
- 22 11月, 2022 1 次提交
-
-
由 Yuang Liu 提交于
-
- 18 11月, 2022 2 次提交
-
-
由 huangjiyi 提交于
* move "paddle/phi/backends/gpu/gpu_device_function.h" to phi * update copyright years * rm "fluid/platform/device/gpu/gpu_device_function.h" in phi * fix rocm-complie bugs
-
由 Tian Zheng 提交于
* Refactor conv_kernel and conv_grad_kernel to provide interface for CUDNNv8 implementation * Fix macro * Add implementation for conv_kernel and conv_grad_kernel * Modification after rebase onto latest develop * Modify plan cache to comply with the API of phi::autotune * Refactor to reduce duplicate code * Review fix: - move functions in conv_kernel_impl_v8.h and conv_grad_kernel_impl_v8.h to conv_kernel.cu and conv_grad_kernelk.cu - add const specifier for input tensor - add logging when plans fail to execute - move CudnnConvBwdFilterV8 and CudnnConvBwdDataV8 to conv_cudnn_frontend.h * - move plan building outside of cache * Fix ROCM build
-
- 16 11月, 2022 1 次提交
-
-
由 Wang Xin 提交于
-
- 11 11月, 2022 1 次提交
-
-
由 Wang Xin 提交于
-
- 10 11月, 2022 1 次提交
-
-
由 huangjiyi 提交于
[PHI Decoupling] remove "paddle/fluid/platform/float16.h" and "paddle/fluid/platform/for_range.h" in phi. (#47817) * rm "paddle/fluid/platform/float16.h" in phi * rm "paddle/fluid/platform/for_range.h" in phi
-
- 09 11月, 2022 1 次提交
-
-
由 huangjiyi 提交于
* rm "paddle/fluid/platform/dynload/cudnn.h" in phi * rm "paddle/fluid/platform/dynload/mklml.h" in phi * rm "paddle/fluid/platform/dynload/rocblas.h" in phi * replace "paddle::platform::dynload::" with "phi::dynload::" in phi * revert "blas_impl.cu.h"
-
- 08 11月, 2022 1 次提交
-
-
由 jzhang533 提交于
* removing dependent to fluid/framework/eigen.h in phi * more fix according to PR-CI-Py3 fail
-
- 07 11月, 2022 1 次提交
-
-
由 Yiqun Liu 提交于
* Define ConvRunner to wrapper the call of cudnn conv functions. * Use ConvKind in SearchAlgorithm.
-
- 02 11月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* Standardise batch norm * standardize conv3d and depwise_conv2d * fix ci bugs
-
- 01 11月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* add extra attr property set * add type_info for all context * add onednn context to all context * fix context compile error * simplify conv kernel args * pass runtime attr into dev_ctx * fix marco error * clear conv_grad_kernel extra args * merge conv_grad_grad into conv_grad * clear conv2d_grad_grad extra attrs * clear yaml and eager extra attr * fix conv1d error * change to thread local * fix npu compile failed * try to fix windows compile failed * add conv2d onednn phi kernel * fix ci bugs (#36) * fix compile bugs (#38) * fix extra input transform bug (#39) * support dynamic created attr (#40) * reset extra info gen code * rm conv_grad_grad kernel * reimpl pass attr adapting * add int attr support * remove vector inputnames creating * fix map at error * Update paddle/phi/kernels/onednn/conv_grad_kernel.cc Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> * remove useless extra attrs * replace mkldnn_engine by onednn_engine Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com> Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
-
- 25 10月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 24 10月, 2022 2 次提交
- 19 10月, 2022 1 次提交
-
-
由 Yiqun Liu 提交于
Enable to record whether the conv algo is got by exhaustive search to fix autotune cache bug. (#47065)
-
- 13 10月, 2022 1 次提交
-
-
由 carryyu 提交于
-
- 29 9月, 2022 1 次提交
-
-
由 carryyu 提交于
-
- 09 9月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* fix softmax int64 * follow comments
-
- 07 9月, 2022 1 次提交
-
-
由 WangZhen 提交于
Adapt tensor output_size for conv2d_transpose and depthwise_conv2d_transpose
-