program.md 3.3 KB
Newer Older
Y
Yi Wang 已提交
1
# Design Doc: PaddlePaddle Programs
Y
Yi Wang 已提交
2

Y
Yi Wang 已提交
3 4 5
## Compile and Execution

A PaddlePaddle program consists of two parts -- the first generates a `ProgramDesc` protobuf message that describes the program, and the second runs this message using a C++ class `Executor`.
Y
Yi Wang 已提交
6

W
weixing02 已提交
7
A simple example PaddlePaddle program can be found in [graph.md](../others/graph.md):
Y
Yi Wang 已提交
8 9 10 11 12 13 14 15 16 17

```python
x = layer.data("images")
l = layer.data("label")
y = layer.fc(x)
cost = layer.mse(y, l)
optimize(cost)
train(cost, reader=mnist.train())
```

Y
Yi Wang 已提交
18
The first five lines of the following PaddlePaddle program generates, or, compiles, the `ProgramDesc` message.  The last line runs it.
Y
Yi Wang 已提交
19

Y
Yi Wang 已提交
20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38
## Programs and Blocks

The basic structure of a PaddlePaddle program is some nested blocks, as a C++ or Java program.

- program: some nested blocks
- [block](./block.md):
  - some local variable definitions, and
  - a sequence of operators

The concept of block comes from usual programs.  For example, the following C++ program has three blocks:

```c++
int main() { // block 0
  int i = 0;
  if (i < 10) { // block 1
    for (int j = 0; j < 10; j++) { // block 2
    }
  }
  return 0;
Y
Yi Wang 已提交
39
}
Y
Yi Wang 已提交
40 41 42 43 44 45 46 47 48 49 50
```

The following PaddlePaddle program has three blocks:

```python
import paddle as pd  // block 0

x = minibatch([10, 20, 30]) # shape=[None, 1]
y = var(1) # shape=[1], value=1
z = minibatch([10, 20, 30]) # shape=[None, 1]
cond = larger_than(x, 15) # [false, true, true]
Y
Yi Wang 已提交
51

Y
Yi Wang 已提交
52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68
ie = pd.ifelse()
with ie.true_block():  // block 1
    d = pd.layer.add_scalar(x, y)
    ie.output(d, pd.layer.softmax(d))
with ie.false_block():  // block 2
    d = pd.layer.fc(z)
    ie.output(d, d+1)
o1, o2 = ie(cond)
```

## `BlockDesc` and `ProgramDesc`

All protobuf messages are defined in `framework.proto`.

`BlockDesc` is straight-forward -- it includes local variable definitions, `vars`, and a sequence of operators, `ops`.

```protobuf
Y
Yi Wang 已提交
69
message BlockDesc {
Y
Update  
Yi Wang 已提交
70 71 72
  required int32 parent = 1;
  repeated VarDesc vars = 2;
  repeated OpDesc ops = 3;
Y
Yi Wang 已提交
73
}
Y
Yi Wang 已提交
74 75 76 77 78 79 80 81 82 83 84 85 86 87
```

The parent ID indicates the parent block so that operators in a block can refer to variables defined locally and also those defined in their ancestor blocks.

All hierarchical blocks in a program are flattened and stored in an array. The block ID is the index of the block in this array.

```protobuf
message ProgramDesc {
  repeated BlockDesc blocks = 1;
}
```


### Global Block
Y
Yi Wang 已提交
88

Y
Yi Wang 已提交
89 90 91 92 93 94 95 96 97
The global block is the first one in the above array.

## Operators that Use Blocks

In the above example, the operator `IfElseOp` has two blocks -- the true branch and the false branch.

The definition of `OpDesc` shows that an operator could have some attributes:

```protobuf
Y
Yi Wang 已提交
98 99 100 101
message OpDesc {
  AttrDesc attrs = 1;
  ...
}
Y
Yi Wang 已提交
102 103 104
```

and an attribute could be of type block, which is, in fact, a block ID as described above:
Y
Yi Wang 已提交
105

Y
Yi Wang 已提交
106
```
Y
Yi Wang 已提交
107
message AttrDesc {
Y
Yi Wang 已提交
108
  required string name = 1;
Y
Yi Wang 已提交
109

Y
Yi Wang 已提交
110 111 112 113 114 115 116 117 118
  enum AttrType {
    INT = 1,
    STRING = 2,
    ...
    BLOCK = ...
  }
  required AttrType type = 2;

  optional int32 block = 10; // when type == BLOCK
Y
Yi Wang 已提交
119 120 121 122
  ...
}
```

Y
Yi Wang 已提交
123
## InferShape
Y
Yi Wang 已提交
124 125 126 127

With this design, the InferShape function should take the following parameters:

```c++
Y
Update  
Yi Wang 已提交
128 129 130 131
void InferShape(int current_block,
                int current_operator,
                ProgramDesc* program // might change VarDesc values.
                ) {
Y
Yi Wang 已提交
132 133 134 135 136 137 138 139
  ...
}
```

where

- `current_block` indices into `ProgramDesc::blocks`,
- `current_operator` indices into `BlockDesc::ops`.