Background¶
PaddlePaddle divides the description of neural network computation into two stages: compile time and runtime. At compile time, the neural network computation is described as a ProgramDesc
whereas at runtime an Executor
interprets the ProgramDesc
to compute the operations.
PaddlePaddle uses proto message to describe compile time program because :
- The computation program description must be serializable and saved in a file.
- During distributed training, the serialized program will be sent to multiple workers. It should also be possible to break the program into different components, each of which can be executed on a different worker.
The computation Program
consists of nested Blocks
. Each Block
will consist of data(i.e. Variable
) and Operations
. The concept to represent them is in the table below.
| |compile time|runtime| |—|—|—| |Data|VarDesc(proto)|Variable(cpp)| |Operation|OpDesc(proto)|Operator(cpp)|
Definition of VarType¶
A VarDesc should have a name, type and whether or not it is persistable. The are different kinds of variable types supported in PaddlePaddle, apart from the POD_Types like: LOD_TENSOR
, SELECTED_ROWS
, FEED_MINIBATCH
, FETCH_LIST
, STEP_SCOPES
, LOD_RANK_TABLE
, LOD_TENSOR_ARRAY
, PLACE_LIST
, READER
and CHANNEL
. These are declared inside VarType
. A VarDesc
then looks as the following:
message VarDesc {
required string name = 1;
required VarType type = 2;
optional bool persistable = 3 [ default = false ];
}
Definition of TensorDesc¶
message TensorDesc {
// Should only be PODType. Is enforced in C++
required Type data_type = 1;
repeated int64 dims = 2; // [UNK, 640, 480] is saved as [-1, 640, 480]
}
The Type
here comes from the enum defined inside of VarType
:
enum Type {
// Pod Types
BOOL = 0;
INT16 = 1;
INT32 = 2;
INT64 = 3;
FP16 = 4;
FP32 = 5;
FP64 = 6;
// Other types that may need additional descriptions
LOD_TENSOR = 7;
SELECTED_ROWS = 8;
FEED_MINIBATCH = 9;
FETCH_LIST = 10;
STEP_SCOPES = 11;
LOD_RANK_TABLE = 12;
LOD_TENSOR_ARRAY = 13;
PLACE_LIST = 14;
READER = 15;
CHANNEL = 16;
}
A TensorDesc describes SelectedRows
and LoDTensor
. For details of SelectedRows
, please reference SelectedRows
.
Definition of LodTensorDesc¶
message LoDTensorDesc {
required TensorDesc tensor = 1;
optional int32 lod_level = 2 [ default = 0 ];
}
A LoDTensorDesc contains a tensor and a lod_level.
Definition of Variable in Python¶
For Variable in Python, please reference Python API
.