# Guide to Fluid Programming This document will instruct you to program and create a simple nueral network with Fluid API. From this guide, you will get the hang of: - Core concepts of Fluid - How to define computing process in Fluid - How to run fluid operators with executor - How to model practical problems logically - How to call API(layers, datasets, loss functions, optimization methods and so on) Before building model, you need to figure out several core concepts of Fluid at first: ## Express data with Tensor Like other mainstream frameworks, Fluid uses Tensor to hold data. All data transferred in neural network are Tensor which can simply be regarded as a multi-dimensional array. In general, the number of dimensions can be any. Tensor features its own data type and shape. Data type of each element in single Tensor is the same. And **the shape of Tensor** refers to the dimensions of Tensor. Picture below visually shows Tensor with dimension from one to six:
There are three special kinds of Tensor in Fluid: **1. Learnable parameters of models** The lifetime of learnable parameters (such as network weight, bias and so on) of model is equal to the time of training task. The parameters will be updated by optimization algorithms. We use Parameter, the derived class of Variable, to express parameters. We can create learnable parameters with `fluid.layers.create_parameter` in Fluid: ```python w = fluid.layers.create_parameter(name="w",shape=[1],dtype='float32') ``` In general, you don't need to explicitly create learnable parameters of network. Fluid encapsulates most fundamental computing modules in common networks. Take the fully connected model as a simplest example, The codes below create connection weight(W) and bias(bias) for fully connected layer with no need to explicitly call associated APIs of Parameter. ```python import paddle.fluid as fluid y = fluid.layers.fc(input=x, size=128, bias_attr=True) ``` **2. Input and Output Tensor** The input data of the whole neural network is also a special Tensor in which the sizes of some dimensions can not be decided at the definition time of models. Such dimensions usually includes batch size, or width and height of image when such data formats in a mini-batch are not constant. Placeholders for these uncertain dimension are necessary at the definition phase of model. `fluid.layers.data` is used to receive input data in Fluid, and it needs to be provided with the shape of input Tensor. When the shape is not certain, the correspondent dimension is defined as None. The code below exemplifies the usage of `fluid.layers.data` : ```python import paddle.fluid as fluid #Define the dimension of x : [3,None]. What we could make sure is that the first dimension of x is 3. #The second dimension is unknown and can only be known at runtime. x = fluid.layers.data(name="x", shape=[3,None], dtype="int64") #batch size doesn't have to be defined explicitly. #Fluid will automatically assign zeroth dimension as batch size dimension and fill right number at runtime. a = fluid.layers.data(name="a",shape=[3,4],dtype='int64') #If the width and height of image are variable, we can define the width and height as None. #The meaning of three dimensions of shape is channel, width of image, height of image respectively. b = fluid.layers.data(name="image",shape=[3,None,None],dtype="float32") ``` dtype=“int64” indicates signed int 64 bits data. For more data types supported by Fluid, please refer to [Data types currently supported by Fluid](../../user_guides/howto/prepare_data/feeding_data_en.html#fluid). **3. Constant Tensor** `fluid.layers.fill_constant` is used to define constant Tensor in Fluid. You can define the shape, data type and value of Constant Tensor. Code is as follows: ```python import paddle.fluid as fluid data = fluid.layers.fill_constant(shape=[1], value=0, dtype='int64') ``` Notice that the tensor defined above is not assigned with values. It merely represents the operation to perform. If you print data directly, you will get information about the description of this data: ```python print data ``` Output: ``` name: "fill_constant_0.tmp_0" type { type: LOD_TENSOR lod_tensor { tensor { data_type: INT64 dims: 1 } } } persistable: false ``` Specific output value will be shown at the runtime of Executor. Detailed process will be explained later. ## Feed data The method to feed data in Fluid: You need to use `fluid.layers.data` to configure data input layer and use ``executor.run(feed=...)`` to feed training data into `fluid.Executor` or `fluid.ParallelExecutor` . For specific preparation for data, please refer to [Preparation for data](../../user_guides/howto/prepare_data/index_en.html). ## Operators -- operations on data All operations on data are achieved by Operators in Fluid. To facilitate development, on Python end, Operators in Fluid are further encapsulated into `paddle.fluid.layers` , `paddle.fluid.nets` and other modules. It is because some common operations for Tensor may be composed of many fundamental operations. To make it more convenient, fundamental Operators are encapsulated in Fluid to reduce repeated coding, including the creation of learnable parameters which Operator relies on, details about initialization of learnable parameters and so on. For example, you can use `paddle.fluid.layers.elementwise_add()` to add up two input Tensor: ```python #Define network import paddle.fluid as fluid a = fluid.layers.data(name="a",shape=[1],dtype='float32') b = fluid.layers.data(name="b",shape=[1],dtype='float32') result = fluid.layers.elementwise_add(a,b) #Define Exector cpu = fluid.core.CPUPlace() #define computing place. Here we choose to train on CPU exe = fluid.Executor(cpu) #create executor exe.run(fluid.default_startup_program()) #initialize network parameters #Prepare data import numpy data_1 = int(input("Please enter an integer: a=")) data_2 = int(input("Please enter an integer: b=")) x = numpy.array([[data_1]]) y = numpy.array([[data_2]]) #Run computing outs = exe.run( feed={'a':x,'b':y}, fetch_list=[result.name]) #Verify result print "%d+%d=%d" % (data_1,data_2,outs[0][0]) ``` Output: ``` a=7 b=3 7+3=10 ``` At runtime, input a=7,b=3, and you will get output=10. You can copy the code, run it locally, input different numbers following the prompt instructions and check the computed result. If you want to get the specific value of a,b at the runtime of network, you can add variables you want to check into ``fetch_list`` . ```python ... #Run computing outs = exe.run( feed={'a':x,'b':y}, fetch_list=[a,b,result.name] #Check output print outs ``` Output: ``` [array([[7]]), array([[3]]), array([[10]])] ``` ## Use Program to describe neural network model Fluid is different from most other deep learning frameworks. In Fluid, static computing map is replaced by Program to dynamically describe the network. This dynamic method delivers both flexible modifications to network structure and convenience to build model. Moreover, the capability of expressing a model is enhanced significantly while the performance is guaranteed. All Operators will be written into Program, which will be automatically transformed into a descriptive language named ProgramDesc in Fluid. It's like to write a general program to define Program. If you are an experienced developer, you can naturally apply the knowledge you have acquired on Fluid programming. You can describe any complex model by combining sequential processes, branches and loops supported by Fluid. **Sequential Process** You can use sequential structure to build network: ```python x = fluid.layers.data(name='x',shape=[13], dtype='float32') y_predict = fluid.layers.fc(input=x, size=1, act=None) y = fluid.layers.data(name='y', shape=[1], dtype='float32') cost = fluid.layers.square_error_cost(input=y_predict, label=y) ``` **Conditional branch——switch,if else:** Switch and if-else class are used to implement conditional branch in Fluid. You can use the structure to adjust learning rate in learning rate adapter or perform other operations : ```python lr = fluid.layers.tensor.create_global_var( shape=[1], value=0.0, dtype='float32', persistable=True, name="learning_rate") one_var = fluid.layers.fill_constant( shape=[1], dtype='float32', value=1.0) two_var = fluid.layers.fill_constant( shape=[1], dtype='float32', value=2.0) with fluid.layers.control_flow.Switch() as switch: with switch.case(global_step == zero_var): fluid.layers.tensor.assign(input=one_var, output=lr) with switch.default(): fluid.layers.tensor.assign(input=two_var, output=lr) ``` For detailed design principles of Program, please refer to [Design principle of Fluid](../../advanced_usage/design_idea/fluid_design_idea_en.html). For more about control flow in Fluid, please refer to [Control Flow](../../api/layers.html#control-flow). ## Use Executor to run Program The design principle of Fluid is similar to C++, JAVA and other advanced programming language. The execution of program is divided into two steps: compile and run. Executor accepts the defined Program and transforms it to a real executable Fluid Program at the back-end of C++. This process performed automatically is the compilation. After compilation, it needs Executor to run the compiled Fluid Program. Take add operator above as an example, you need to create an Executor to initialize and train Program after the construction of Program: ```python #define Executor cpu = fluid.core.CPUPlace() #define computing place. Here we choose training on CPU exe = fluid.Executor(cpu) #create executor exe.run(fluid.default_startup_program()) #initialize Program #train Program and start computing #feed defines the order of data transferred to network in the form of dict #fetch_list defines the output of network outs = exe.run( feed={'a':x,'b':y}, fetch_list=[result.name]) ``` ## Code example So far, you have got a primary knowledge of core concepts in Fluid. Why not try to configure a simple network ? You can finish a very simple data prediction under the guide of the part if you are interested. If you have learned this part, you can skip this section and read [What's next](#what_next). Firstly, define input data format, model structure,loss function and optimized algorithm logically. Then you need to use PaddlePaddle APIs and operators to implement the logic of model. A typical model mainly contains four parts. They are: definition of input data format; forward computing logic; loss function; optimization algorithm. 1. Problem Given a pair of data $