- 18 11月, 2022 6 次提交
-
-
由 james 提交于
* correct sync behavior for XPU distributed training XPU support event mechanism similar to cuda event, so it is advisable to use an event to sync compute/comm streams for performance. However this mechanism is never fully tested, and inconsistent loss/ending_epochs are reported. Therefore, this PR replaces event sync with stream waiting as a temporary solution. * remove compile warning
-
由 Tian Zheng 提交于
* Refactor conv_kernel and conv_grad_kernel to provide interface for CUDNNv8 implementation * Fix macro * Add implementation for conv_kernel and conv_grad_kernel * Modification after rebase onto latest develop * Modify plan cache to comply with the API of phi::autotune * Refactor to reduce duplicate code * Review fix: - move functions in conv_kernel_impl_v8.h and conv_grad_kernel_impl_v8.h to conv_kernel.cu and conv_grad_kernelk.cu - add const specifier for input tensor - add logging when plans fail to execute - move CudnnConvBwdFilterV8 and CudnnConvBwdDataV8 to conv_cudnn_frontend.h * - move plan building outside of cache * Fix ROCM build
-
由 Yuang Liu 提交于
-
由 zhangyikun02 提交于
-
由 Sylwester Fraczek 提交于
-
由 huangjiyi 提交于
-
- 17 11月, 2022 10 次提交
-
-
由 Qi Li 提交于
* [NPU] add _npu_identity op and api, test=develop * fix doc * address comments
-
由 xiongkun 提交于
-
由 huangjiyi 提交于
-
由 YuanRisheng 提交于
* standard api * fix xpu bugs
-
由 taixiurong 提交于
-
由 Wang Xin 提交于
-
由 Yiqun Liu 提交于
* Implement a common dims simplifier. * Fix the include position error. * Reduce the cpu overhead of broadcast computing.
-
由 huangjiyi 提交于
-
由 huangjiyi 提交于
* rm "paddle/fluid/operators/math.h" in phi * rm "paddle/fluid/operators/math.h" in fluit
-
由 Yuang Liu 提交于
Support bfloat16 for adamw and adam optimizer. Fit the lr for pure bf16 training with tensor fusion. (#48041) * add bfloat16 for adamw * set lr not to bfloat16 for pure bf16 training * update the logic * update the adamw optimizer * support bfloat for adam
-
- 16 11月, 2022 4 次提交
-
-
由 huangjiyi 提交于
-
由 Piotr Paturej 提交于
* Enable bf16 in oneDNN bilinear_interp kernel * Fix bilinear_interp_v2 not enabled in models * Remove unnecessary checks
-
由 ykkk2333 提交于
* add stat tool * add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun * embedding and embedding_grad add int32 input, test=kunlun
-
由 Wang Xin 提交于
-
- 15 11月, 2022 6 次提交
-
-
由 sneaxiy 提交于
-
由 zhouweiwei2014 提交于
-
由 Sławomir Siwek 提交于
* cleanup unused code * unify is_int8 is_bfloat16 * Simplify matmul_v2 FWD kernel * remove RunKernel methods * remove import namespace * remove headers * clean fluid/phi cross imports * remove fluid axpy_handler * delete fluid methods * activations * OneDNNMemDesc * MKLDNNFormatForSize * MatchShapeToLayout * MKLDNNMemoryFormat * MKLDNNFormat * ReorderMKLDNNHandler * to_void_cast * review suggestions * interpolate * remove fluid depedency
-
由 huangjiyi 提交于
* rm "paddle/fluid/platform/complex.h" in phi * fix codestyle with pre-commit
-
由 Wang Xin 提交于
-
由 huangjiyi 提交于
* rm "paddle/fluid/operators/elementwise/xxx.h" in phi * fix bugs * add LaunchElementwiseCudaKernel in phi * Revert "add LaunchElementwiseCudaKernel in phi" This reverts commit 588f45bbdad2372ec7bff0c567a29bff675d22e1. * rm indirect dependence to "elementwise_op_impl.cu.h" rm indirect dependence to "elementwise_op_impl.cu.h" Revert "add LaunchElementwiseCudaKernel in phi" This reverts commit 588f45bbdad2372ec7bff0c567a29bff675d22e1. add LaunchElementwiseCudaKernel in phi fix bugs * rm LaunchSameDimsElementwiseCudaKernel and LaunchElementwiseCudaKernel in phi
-
- 14 11月, 2022 1 次提交
-
-
由 cyber-pioneer 提交于
-
- 11 11月, 2022 7 次提交
-
-
由 zhouweiwei2014 提交于
-
由 Wang Xin 提交于
-
由 Wang Xin 提交于
-
由 Yiqun Liu 提交于
-
由 zhangkaihuo 提交于
-
由 huangjiyi 提交于
* rm "paddle/fluid/operators/eigen/eigen_function.h" in phi * rm "paddle/fluid/operators/elementwise/elementwise_op_function.h" in phi * Revert "rm "paddle/fluid/operators/elementwise/elementwise_op_function.h" in phi" This reverts commit c4ba51225e3652f1d80925afba406612968f0ee9.
-
由 PuQing 提交于
-
- 10 11月, 2022 6 次提交
-
-
由 zhangyikun02 提交于
-
由 Sylwester Fraczek 提交于
* migrate prelu * remove cache * review fixes
-
由 YuanRisheng 提交于
* standard api * fix sparse bugs * fix xpu bugs, test=kunlun * remove hard code for custom unittest * open ci, test=kunlun * deal with conflict
-
由 Wang Xin 提交于
* remove fluid/framework/generator.h from phi * fix PR-CI-Kunlun-KP-Build fail
-
由 PuQing 提交于
* rm fluid gpu_launch_config * fix type
-
由 huangjiyi 提交于
[PHI Decoupling] remove "paddle/fluid/platform/float16.h" and "paddle/fluid/platform/for_range.h" in phi. (#47817) * rm "paddle/fluid/platform/float16.h" in phi * rm "paddle/fluid/platform/for_range.h" in phi
-