draft.md 3.0 KB
Newer Older
X
Xin Pan 已提交
1 2 3 4 5 6 7 8 9 10 11
## Motivation

There is a ```gap``` between the ```Program``` defined by
user and the ```Executable``` that can be scheduled
efficiently on heterogeneous hardware, either locally
or distributedly.

Usually, the ```gap``` is bridged by

* A serious transformations with defined order.

X
Xin Pan 已提交
12
* These transformations usually involve
X
Xin Pan 已提交
13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45
```insert, delete, clustering, split, dependency analysis```.

* Has a simple way to verify and debug each transformation.

* Flexible to add, remove or customize transformations to fit
the requirements of various algorithms (models) and hardware secenarios.

Some other events also push us to a better unified pattern.

* The deep learning framework is built around the concepts of graphs.
To leverage tools such as compilation (e.g. TVM and nGraph) or
cross-framework conversion (e.g. ONNX), we also need a intermediate
representation that can be connected to the rest of the ecosystem.


We need a unified pattern to naturally support the requirements
described above. The pattern should fit both training, inference
and other offline serielized model transformations.
Learned from LLVM and other deep learning framework, we draft the
design below.


## Design

### Major Concepts

#### Node

```Node``` represents an operation that performs some computation or
a variable that is input or output of operation.

```Node```s are connected to other ```Node```s via inputs and outputs.

X
Xin Pan 已提交
46 47 48 49 50 51
Other properties (maybe device placement information) can be added
to ```Node``` in the future if it's a
common requirement of many other ```Pass```es. Otherwise, it should live
in a ```Node``` wrapper class that is private to some ```Pass``` or be
a local member of a ```Pass```.

X
Xin Pan 已提交
52 53
#### Graph

X
Xin Pan 已提交
54 55
```Graph``` contains a list of ```Node```s, which are connected to
each other via inputs and outputs.
X
Xin Pan 已提交
56 57 58 59 60 61

TODO: Better definitions for the graph.

```Graph``` can also contain ```Attribute```s. ```Attribute```s
can be ``any`` thing. For example, it can be a list of "wraper"
nodes. The ```wrapper``` nodes compose ```Node```s and provide
X
Xin Pan 已提交
62 63 64 65
helper method for execution or transformation. ```Attribute```
can also contain other things that describe some properties of
the ```Graph``` or ```Graph``` nodes. ```Attribute``` can be passed
across ```Pass```. However, it should be used with care.
X
Xin Pan 已提交
66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81

#### Pass

```Pass``` represents a transformation of ```Graph```. Its input
is a ```Graph``` and its output is also a ```Graph```. For example,
a ```Pass``` can simply print out the ```Graph```. A ```Pass```
can also fuse some ```Graph```'s ```Node```s.

#### Optimize

```Optimize``` contains a series of ```Pass``` with defined order.
```Optimize``` transforms a ```Graph``` that only contains raw
modeling logic to a ```Graph``` that can be run efficiently while
maintaining the original modeling logic.


X
Xin Pan 已提交
82
### Optimize Process
X
Xin Pan 已提交
83 84 85 86 87 88

* Program is first converted to Graph.
* Graph goes through a series of Pass
* Graph is transformed from raw model logic to a
form that is efficient to execute.

X
Xin Pan 已提交
89
Program->ProgramToGraph->Graph->Pass1->Graph->Pass2->Graph->Pass3->Graph->Executor