Release Log

Major Features

  • Enhanced the inference library. Better memory buffer. Added several demos.
  • Inference library added support for Anakin engine, TensorRT engine.
  • ParallelExecutor supports multi-threaded CPU training. (In addition to multi-GPU training)
  • Added mean IOU operator, argsort operator, etc. Improved L2norm operator. Added crop API.
  • Released pre-trained ResNet50, Se-Resnext50, AlexNet, etc, Enahanced Transformer, etc.
  • New data augmentation operators.
  • Major documentation and API comment improvements.
  • Enhance the continuous evaluation system.

Performance Improvements

  • More overlap of distributed training network operation with computation. ~10% improvements
  • CPU performance improvements with more MKLDNN support.

Major Bug Fixes

  • Fix memory leak issues.
  • Fix concat operator.
  • Fix ParallelExecutor input data memcpy issue.
  • Fix ParallelExecutor deadlock issue.
  • Fix distributed training client timeout.
  • Fix distributed training pserver side learning rate decay.
  • Thread-safe Scope implementation.
  • Fix some issue using memory optimizer and parallelexecutor together.

Known Issues

  • IfElse has some bugs.
  • BatchNorm is not stable if batch_size=1

项目简介

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

🚀 Github 镜像仓库 🚀

源项目地址

https://github.com/paddlepaddle/paddle

发行版本 60

PaddlePaddle 2.5.0 Release Note

全部发行版

贡献者 246

全部贡献者

开发语言

  • C++ 49.8 %
  • Python 41.0 %
  • Cuda 7.0 %
  • CMake 1.1 %
  • Shell 0.6 %