获取文件夹内容时发生错误.
名称 最后提交 最后更新
blas unify fluid::CUDADeviceContext and phi::GpuContext (#44723)
detail unify gpu context (#44740)
eigen Einsum grad complex (#44598)
lapack change svd_cpu_kernel from Eigen to Lapack, speed up the compile from 120s -> 20s (#43784)
sparse [Sparse] optimize sparse attention (#44743)
CMakeLists.txt transfer op multiclass_nms3 to phi (#44765)
activation_functor.h 【PFCC算子性能优化】 SeluKernel Optimization (#44490)
adam_functors.h 【code format check upgrade】 step2:clang-format (#42840)
algorithm.h [PTen->Phi PR1] Change pten dirname and namespace to phi (#39748)
aligned_vector.h 【code format check upgrade】 step2:clang-format (#42840)
axis_utils.h [Phi] Support cudnn kernel moving & move softmax kernels (#39547)
batch_norm_utils.h Change bn muable data to phi (#40748)
bitwise_functors.h 【Phi】Migrate bitwise_and/bitwise_or/bitwise_xor/bitwise_not op into phi (#40031)
broadcast_function.h Replace ReduceAmax/Amax.part.cu with KP (#43202)
common_shape.h 【Phi】Migrate triangular_solve op into phi (#40093)
compare_functors.h Move compare OPs to phi (#39970)
complex_functors.h [Phi] Unify complex type trait and fix real imag bug (#40036)
compound_functors.h [PTen->Phi PR1] Change pten dirname and namespace to phi (#39748)
concat_and_split_functor.cc [phi] refine code of randint, randperm, unbind kernel (#39909)
concat_and_split_functor.cu Enable inference multi stream ci test (#44275)
concat_and_split_functor.h [phi] refine code of randint, randperm, unbind kernel (#39909)
concat_funcs.h [Phi] Support cudnn kernel moving & move softmax kernels (#39547)
cpu_vec.h [phi] move cpu_vec (#39714)
cumprod.h [phi] Transfer lgamma, kldiv_loss, isclose, cumprod kernels into phi and pass the tests of these four kernels (#39770)
data_type_transform.h [PHI] Clean glog header in public header (#44216)
deformable_conv_functor.cc
deformable_conv_functor.cu
deformable_conv_functor.h
diag_functor.h
diagonal.h
distribution_helper.h
elementwise_base.h
elementwise_functor.h
elementwise_grad_base.h
elementwise_utils.h
embedding_util.h
fc_functor.cc
fc_functor.cu
fc_functor.h
for_range.h
frame_functor.h
functors.h
gather.cu.h
gather.h
gpc.cc
gpc.h
gru_compute.cc
gru_compute.cu
gru_compute.h
inclusive_scan.h
index_impl.cu.h
interpolate_function.h
isfinite_functor.h
layer_norm_util.h
logical_functor.h
lstm_compute.cc
lstm_compute.cu
lstm_compute.h
math_cuda_utils.h
math_function.cc
math_function.cu
math_function.h
math_function_impl.h
matrix_inverse.cc
matrix_inverse.cu.cc
matrix_inverse.h
matrix_reduce.cc
matrix_reduce.cu
matrix_reduce.h
matrix_solve.cc
matrix_solve.cu
matrix_solve.h
mode.h
multinomial_functor.h
norm_utils.h
overlap_add_functor.h
padding.h
parse_qr_mode.h
pooling.cc
pooling.cu
pooling.h
range_function.h
reduce_function.h
reduce_functor.h
reduce_grad_functions.h
scatter.cu.h
scatter.h
segment_pooling.cc
segment_pooling.cu
segment_pooling.h
select_impl.cu.h
seq2col.h
sequence2batch.cc
sequence2batch.cu
sequence2batch.h
slice.h
slice_utils.h
squared_l2_norm.h
stack_functor.h
strided_slice.h
tril_triu_compute.h
unfold_functor.h
unique_functor.h
unsqueeze.h
values_vectors_functor.h

项目简介

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

🚀 Github 镜像仓库 🚀

源项目地址

https://github.com/paddlepaddle/paddle

发行版本

当前项目没有发行版本

贡献者 228

全部贡献者

开发语言

  • C++ 45.8 %
  • Python 45.5 %
  • Cuda 6.4 %
  • CMake 1.1 %
  • Shell 0.7 %