C
chenzhiyu 推送
fix format 016ee9aa
28276次提交
名称 最后提交 最后更新
..
details Document transform.h and fix cpplint errors (#9913)
dynload Add padding cudnn interface (#26370)
stream Add macro BOOST_GET to enrich the error information of boost :: get (#24175)
CMakeLists.txt Add bfloat16 data type (#25402)
bfloat16.h Add bfloat16 data type (#25402)
bfloat16_test.cc Add bfloat16 data type (#25402)
collective_helper.cc fix PADDLE_ENFORCE (#25297)
collective_helper.h refine PADDLE_ENFORCE (#25456)
cpu_helper.cc refine PADDLE_ENFORCE (#25456)
cpu_helper.h move SetNumThreads to platform
cpu_helper_test.cc move SetNumThreads to platform
cpu_info.cc Add NOMINMAX define due to windows.h max/min macro conflict (#25637)
cpu_info.h support build on arm. test=develop (#25212)
cpu_info_test.cc Fix cpplint errors with paddle/fluid/platform/cpu_info* (#9708)
cuda_device_function.h Refine elementwise kernel. (#16952)
cuda_device_guard.cc Refine code
cuda_device_guard.h Refine code
cuda_error.proto Optimize the error messages of paddle CUDA API (#23816)
cuda_helper.h Fix index overflow bug of the CUDA kernel loop increment (#25435)
cuda_helper_test.cu Fix index overflow bug of the CUDA kernel loop increment (#25435)
cuda_primitives.h Fix/float16 style (#12446)
cuda_profiler.h Refine PADDLE_ENFORCE (#25369)
cuda_resource_pool.cc refine PADDLE_ENFORCE (#25456)
cuda_resource_pool.h add cuda resource pool for BufferedReader, test=develop (#23152)
cudnn_desc.h replace CUDNN_ENFORCE with PADDLE_ENFORCE_CUDA_SUCCESS, test=develop (#22109)
cudnn_desc_test.cc polish cudnn related code and fix bug. (#15164)
cudnn_helper.h Add padding cudnn interface (#26370)
cudnn_helper_test.cc "fix link error" (#13545)
cudnn_workspace_helper.cc make_conv_workspace_size_configurable, test=develop (#20662)
cudnn_workspace_helper.h make_conv_workspace_size_configurable, test=develop (#20662)
device_code.cc refine PADDLE_ENFORCE (#25456)
device_code.h Add some check for CUDA Driver API and NVRTC (#22719)
device_code_test.cc Polish the PADDLE_ENFORCE in fusion_group pass related codes. (#22144)
device_context.cc Add mechanism for blocking oneDNN cache clearing (#26502)
device_context.h Add mechanism for blocking oneDNN cache clearing (#26502)
device_context_test.cu Revert "Revert "Remove op handle lock""
device_context_xpu_test.cc support Baidu Kunlun AI Accelerator (#25959)
device_memory_aligment.cc fix PADDLE_ENFORCE (#25297)
device_memory_aligment.h Make fuse_optimizer_op_pass also work when the model contains sparse gradients. (#18664)
device_tracer.cc Refine PADDLE_ENFORCE (#25369)
device_tracer.h fix the print error of PE record_event and framework overhead in profiler test=develop (#24744)
enforce.cc Fix the grammar in copyright. (#8403)
enforce.h fix windows no execinfo.h
enforce_test.cc Refine PADDLE_ENFORCE (#25369)
error_codes.proto Enrich the type of error and declare the error type interfaces (#21024)
errors.cc Enrich the type of error and declare the error type interfaces (#21024)
errors.h polish default error msg & cublas error hint, test=develop (#22032)
errors_test.cc Enrich the type of error and declare the error type interfaces (#21024)
event.h Add pe profiler Event (#24611)
flags.cc Update the demo code and the doc of varbase.backward. (#26506)
float16.h
float16_test.cc
float16_test.cu
for_range.h
gloo_context.cc
gloo_context.h
gpu_info.cc
gpu_info.h
gpu_launch_config.h
gpu_launch_param_config.h
hostdevice.h
init.cc
init.h
init_test.cc
lock_guard_ptr.h
lodtensor_printer.cc
lodtensor_printer.h
lodtensor_printer_test.cc
macros.h
mkldnn_helper.h
mkldnn_reuse.h
monitor.cc
monitor.h
nccl_helper.h
place.cc
place.h
place_test.cc
port.h
profiler.cc
profiler.cu
profiler.h
profiler.proto
profiler_helper.h
profiler_test.cc
resource_pool.h
stream_callback_manager.cc
stream_callback_manager.h
test_limit_gpu_memory.cu
timer.cc
timer.h
timer_test.cc
transform.h
transform_test.cu
variant.h
xpu_header.h
xpu_info.cc
xpu_info.h

项目简介

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

:rocket: Github 镜像仓库 :rocket:

源项目地址 :arrow_down: :arrow_down: :arrow_down:

https://github.com/paddlepaddle/paddle

发行版本

当前项目没有发行版本

贡献者 228

全部贡献者

开发语言

  • C++ 45.8 %
  • Python 45.5 %
  • Cuda 6.4 %
  • CMake 1.1 %
  • Shell 0.7 %
反馈
建议
客服 返回
顶部