Add Release note for v0.11.0
Created by: reyoung
We will release v0.11.0 within this week.
The difference between master
and develop
can be shown at this link. Or by the following command
git log --author='Yu Yang' master..develop
The release note draft is shown as below. Please edit this issue directly.
New Features
- Release
Fluid
API. TODO: need an introduction tofluid
. - Add C-API for model inference
- Use fluid API to create a simple GAN demo.
- Add develop guide about performance tunning.
- Add retry when download
paddle.v2.dataset
. - Linking protobuf-lite not protobuf in C++. Reduce the binary size.
- Feature Elastic Deep Learning (EDL) released.
- A new style cmake functions for Paddle. It is based on Bazel API.
- Automatically download and compile with Intel® MKLML library as CBLAS when build
WITH_MKL=ON
. -
Intel® MKL-DNN on PaddlePaddle:
- Complete 11 MKL-DNN layers: Convolution, Fully connectivity, Pooling, ReLU, Tanh, ELU, Softmax, BatchNorm, AddTo, Concat, LRN.
- Complete 3 MKL-DNN networks: VGG-19, ResNet-50, GoogleNet
- Benchmark on Intel Skylake 6148 CPU: 2~3x training speedup compared with MKLML.
- Add the
softsign
activation. - Add the dot product layer.
- Add the L2 distance layer.
- Add the sub-nested sequence layer.
- Add the kmax sequence score layer.
- Add the sequence slice layer.
- Add the row convolution layer
- Add mobile friendly webpages.
Improvements
- Build and install using a single
whl
package. - Custom evaluating in V2 API.
- Change
PADDLE_ONLY_CPU
toPADDLE_WITH_GPU
, since we will support many kinds of devices. - Remove buggy BarrierStat.
- Clean and remove unused functions in paddle::Parameter.
- Remove ProtoDataProvider.
- Huber loss supports both regression and classification.
- Add the
stride
parameter for sequence pooling layers. - Enable v2 API use cudnn batch normalization automatically.
- The BN layer's parameter can be shared by a fixed the parameter name.
- Support variable-dimension input feature for 2D convolution operation.
- Refine cmake about CUDA to automatically detect GPU architecture.
- Improved website navigation.
Bug Fixes
- Fix bug in ROI pooling. cc9a761a
- Fix AUC is zero when label is dense vector. #5274
- Fix bug in WarpCTC layer.