2018 milestones
Created by: reyoung
Fluid supports multi-GPUs and cluster, and high usability
Deadline:
KPI:
- Make fluid supports all models in PaddlePaddle/Book, PaddlePaddle/models. Complete the inference framework of Fluid on linux and mobile.
- Make Baidu teams (Speech, NLP, Image, Abacus) use fluid to train and inference models.
- The speed of training models in PaddlePaddle/book is not slower than TF in MultiGPUs and cluster.
- The memory consumption of training models in PaddlePaddle/book is not larger than TF in MultiGPUs and a cluster.
Fluid distributed computing
Deadline:
KPI:
- Make Fluid support EDL (Elastic Deep Learning). Make cluster training of Fluid can adapt to workload changes by provisioning and de-provisioning resources in an autonomic manner.
- Support model parallelism as well as data parallelism.
- Make Fluid support OpenMPI APIs to do distributed all-reduce.
- Make Fluid support GPU direct when possible.
Compatible with ONNX
Deadline:
KPI:
- Make
ProgramDesc
can be converted to ONNX model files. - Make ONNX can be converted into
ProgramDesc
, and make Fluid can train ONNX model.
Support CSP program model and imperative programming
Deadline:
KPI:
- Users only use Python as a compiler frontend and produce the
ProgramDesc
, and an interpreter will execute theProgramDesc
. - The
ProgramDesc
includesIfElse
operator andWhile
, and supportsauto diff
. - Saving and loading model, printing metrics are all configured in
ProgramDesc
. Deeply integrate withVisualDL
to give a GUI. - Support to configure CSP(
coroutines
,channel
,select
) inProgramDesc
. Use CSP to implement multi GPUs and cluster training.