名称 最后提交 最后更新
..
mkldnn [PTen->Phi PR1] Change pten dirname and namespace to phi (#39748)
CMakeLists.txt [pten] remove deprecated fluid op kernel for pten (#38842)
check_reduce_rank_test.cu [PTen->Phi PR1] Change pten dirname and namespace to phi (#39748)
frobenius_norm_op.cc [Pten] Replace platform::Place to pten::Place. (#38899)
frobenius_norm_op.cu Delete cub_reduce.h and modified the TensorReduce to TensorReduceFunctorImpl (#38197)
frobenius_norm_op.h Add new norm api, support frobenius norm and p-order vector norm. (#23716)
logsumexp_op.cc [PTen->Phi PR1] Change pten dirname and namespace to phi (#39748)
logsumexp_op.cu restruct logsumexp to speed up compiling (#27191)
logsumexp_op.h [pnorm] fix bug in fp16 & optimize memory (#39011)
logsumexp_op.part.cu restruct logsumexp to speed up compiling (#27191)
logsumexp_op_xpu.cc [XPU] Reorganize xpu device codes in platform, test=develop (#37428)
reduce_all_op.cc [Pten] Replace platform::Place to pten::Place. (#38899)
reduce_all_op.cu Add the transformop parameter in TensorReduceFunctorImpl (#38135)
reduce_all_op.h test=develop
reduce_amax_op.cc Add Amax and Amin API (#38417)
reduce_amax_op.cu Add Amax and Amin API (#38417)
reduce_amax_op.part.cu Add Amax and Amin API (#38417)
reduce_amin_op.cc Add Amax and Amin API (#38417)
reduce_amin_op.cu Add Amax and Amin API (#38417)
reduce_amin_op.part.cu Add Amax and Amin API (#38417)
reduce_any_op.cc [Pten] Replace platform::Place to pten::Place. (#38899)
reduce_any_op.cu Add the transformop parameter in TensorReduceFunctorImpl (#38135)
reduce_any_op.h test=develop
reduce_any_op_npu.cc [NPU] reorganization for device API abstraction (#37110)
reduce_any_op_npu_test.cc [PTen->Phi PR1] Change pten dirname and namespace to phi (#39748)
reduce_max_op.cc Refine operator cmake (#14413)
reduce_max_op.cu Add the transformop parameter in TensorReduceFunctorImpl (#38135)
reduce_max_op.part.cu Refine operator cmake (#14413)
reduce_max_op_mlu.cc add reduce_min and reduce_max (#39899)
reduce_max_op_npu.cc [PTen]Migrate proto::VarType outside of Pten (#39411)
reduce_max_op_xpu.cc [XPU] Reorganize xpu device codes in platform, test=develop (#37428)
reduce_mean_op.cc [Phi]rm reduce infershape (#39820)
reduce_mean_op.h Support FP16 mean (#38289)
reduce_mean_op.part.cu Replace EigenBroadcast with ElementwiseBroadcast in ReduceGrad (#39255)
reduce_mean_op_mlu.cc [PTen->Phi PR1] Change pten dirname and namespace to phi (#39748)
reduce_mean_op_npu.cc [PTen->Phi PR1] Change pten dirname and namespace to phi (#39748)
reduce_mean_op_xpu.cc add reduce_prod_xpu. fix reduce_mean_xpu bug. (#38481)
reduce_min_max_op.h reduce compile time of amax and amin (#38534)
reduce_min_op.cc Refine operator cmake (#14413)
reduce_min_op.cu Add the transformop parameter in TensorReduceFunctorImpl (#38135)
reduce_min_op.part.cu Refine operator cmake (#14413)
reduce_min_op_mlu.cc add reduce_min and reduce_max (#39899)
reduce_min_op_npu.cc [PTen]Migrate proto::VarType outside of Pten (#39411)
reduce_op.cu.h [PTen->Phi PR1] Change pten dirname and namespace to phi (#39748)
reduce_op.h [Pten->Phi PR4] Rename pten in funcs to phi (#39961)
reduce_op_function.h [PTen->Phi PR1] Change pten dirname and namespace to phi (#39748)
reduce_op_xpu.h [XPU] Reorganize xpu device codes in platform, test=develop (#37428)
reduce_prod_op.cc [Pten] Replace platform::Place to pten::Place. (#38899)
reduce_prod_op.cu Add the transformop parameter in TensorReduceFunctorImpl (#38135)
reduce_prod_op.h Refine operator cmake (#14413)
reduce_prod_op.part.cu Refine operator cmake (#14413)
reduce_prod_op_npu.cc [PTen]Migrate proto::VarType outside of Pten (#39411)
reduce_prod_op_xpu.cc add reduce_prod_xpu. fix reduce_mean_xpu bug. (#38481)
reduce_sum_op.cc [Phi]rm reduce infershape (#39820)
reduce_sum_op.h [PTen->Phi PR1] Change pten dirname and namespace to phi (#39748)
reduce_sum_op.part.cu [bf16] add bf16 kernel: layer_norm p_norm reduce_sum (#39843)
reduce_sum_op_npu.cc [PTen->Phi PR1] Change pten dirname and namespace to phi (#39748)
reduce_sum_op_xpu.cc [XPU] Reorganize xpu device codes in platform, test=develop (#37428)
unity_build_rule.cmake [pten]rm reduce_sum and reduce_mean raw kernel (#39484)

项目简介

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

:rocket: Github 镜像仓库 :rocket:

源项目地址 :arrow_down: :arrow_down: :arrow_down:

https://github.com/paddlepaddle/paddle

发行版本

当前项目没有发行版本

贡献者 233

全部贡献者

开发语言

  • C++ 47.1 %
  • Python 43.6 %
  • Cuda 7.0 %
  • CMake 1.1 %
  • Shell 0.7 %
反馈
建议
客服 返回
顶部