Update the markdown again.

8ec007be · Wang,Jeff · 8db3055f · 8ec007be · 8ec007be
隐藏空白更改
内联并排

Showing with 148 addition and 76 deletion

02.recognize_digits/README.md 02.recognize_digits/README.md +74 -38

02.recognize_digits/index.html 02.recognize_digits/index.html +74 -38

未找到文件。
--- a/02.recognize_digits/README.md
+++ b/02.recognize_digits/README.md
 # Recognize Digits

-The source code for this tutorial is here:  [book/recognize_digits](https://github.com/PaddlePaddle/book/tree/develop/02.recognize_digits). For instructions on getting started with Paddle, please refer to [installation instructions](https://github.com/PaddlePaddle/book/blob/develop/README.md#running-the-book).
+The source code for this tutorial is here:  [book/recognize_digits](https://github.com/PaddlePaddle/book/tree/develop/02.recognize_digits).
+For instructions on getting started with Paddle, please refer to [installation instructions](https://github.com/PaddlePaddle/book/blob/develop/README.md#running-the-book).

 ## Introduction
-When one learns to program, the first task is usually to write a program that prints "Hello World!". In Machine Learning or Deep Learning, an equivalent task is to train a model to recognize hand-written digits using the [MNIST](http://yann.lecun.com/exdb/mnist/) dataset. Handwriting recognition is a classic image classification problem. The problem is relatively easy and MNIST is a complete dataset. As a simple Computer Vision dataset, MNIST contains images of handwritten digits and their corresponding labels (Fig. 1). The input image is a $28\times28$ matrix, and the label is one of the digits from $0$ to $9$. All images are normalized, meaning that they are both rescaled and centered.
+When one learns to program, the first task is usually to write a program that prints "Hello World!".
+In Machine Learning or Deep Learning, an equivalent task is to train a model to recognize hand-written digits using the [MNIST](http://yann.lecun.com/exdb/mnist/) dataset.
+Handwriting recognition is a classic image classification problem. The problem is relatively easy and MNIST is a complete dataset.
+As a simple Computer Vision dataset, MNIST contains images of handwritten digits and their corresponding labels (Fig. 1).
+The input image is a $28\times28$ matrix, and the label is one of the digits from $0$ to $9$. All images are normalized, meaning that they are both rescaled and centered.

 <p align="center">
 <img src="image/mnist_example_image.png" width="400"><br/>
 Fig. 1. Examples of MNIST images
 </p>

-The MNIST dataset is from the [NIST](https://www.nist.gov/srd/nist-special-database-19) Special Database 3 (SD-3) and the Special Database 1 (SD-1). The SD-3 is labeled by the staff of the U.S. Census Bureau, while SD-1 is labeled by high school students. Therefore the SD-3 is cleaner and easier to recognize than the SD-1 dataset. Yann LeCun et al. used half of the samples from each of SD-1 and SD-3 to create the MNIST training set of 60,000 samples and test set of 10,000 samples. 250 annotators labeled the training set, thus guaranteed that there wasn't a complete overlap of annotators of training set and test set.
+The MNIST dataset is from the [NIST](https://www.nist.gov/srd/nist-special-database-19) Special Database 3 (SD-3) and the Special Database 1 (SD-1).
+The SD-3 is labeled by the staff of the U.S. Census Bureau, while SD-1 is labeled by high school students. Therefore the SD-3 is cleaner and easier to recognize than the SD-1 dataset.
+Yann LeCun et al. used half of the samples from each of SD-1 and SD-3 to create the MNIST training set of 60,000 samples and test set of 10,000 samples.
+250 annotators labeled the training set, thus guaranteed that there wasn't a complete overlap of annotators of training set and test set.

-The MNIST dataset has been used for evaluating many image recognition algorithms such as a single layer linear classifier, Multilayer Perceptron (MLP) and Multilayer CNN LeNet\[[1](#references)\], K-Nearest Neighbors (k-NN) \[[2](#references)\], Support Vector Machine (SVM) \[[3](#references)\], Neural Networks \[[4-7](#references)\], Boosting \[[8](#references)\] and preprocessing methods like distortion removal, noise removal, and blurring.  Among these algorithms, the *Convolutional Neural Network* (CNN) has achieved a series of impressive results in Image Classification tasks, including VGGNet, GoogLeNet, and ResNet (See [Image Classification](https://github.com/PaddlePaddle/book/tree/develop/03.image_classification) tutorial).
+The MNIST dataset has been used for evaluating many image recognition algorithms such as a single layer linear classifier,
+Multilayer Perceptron (MLP) and Multilayer CNN LeNet\[[1](#references)\], K-Nearest Neighbors (k-NN) \[[2](#references)\], Support Vector Machine (SVM) \[[3](#references)\],
+Neural Networks \[[4-7](#references)\], Boosting \[[8](#references)\] and preprocessing methods like distortion removal, noise removal, and blurring.
+Among these algorithms, the *Convolutional Neural Network* (CNN) has achieved a series of impressive results in Image Classification tasks, including VGGNet, GoogLeNet,
+and ResNet (See [Image Classification](https://github.com/PaddlePaddle/book/tree/develop/03.image_classification) tutorial).

 In this tutorial, we start with a simple **softmax** regression model and go on with MLP and CNN.  Readers will see how these methods improve the recognition accuracy step-by-step.

@@ -124,6 +136,24 @@ PaddlePaddle provides a Python module, `paddle.dataset.mnist`, which downloads a
 |t10k-labels-idx1-ubyte |  Evaluation labels | 10,000 |


+## Fluid API Overview
+
+The demo will be using the latest paddle fluid API. Fluid API is the latest Paddle API. It simplifies the model configurations without sacrifice the performance.
+We recommend using Fluid API as it is much easier to pick up.
+Here are the quick overview on the major fluid API complements.
+
+1. `inference_program`: A function that specify how to get the prediction from the data input.
+This is where you specify the network flow.
+1. `train_program`: A function that specify how to get avg_cost from `inference_program` and labels.
+This is where you specify the loss calculations.
+1. `optimizer`: Configure how to minimize the loss. Paddle supports most major optimization methods.
+1. `Trainer`: Fluid trainer manages the training process specified by the `train_program` and `optimizer`. Users can monitor the training
+progress through the `event_handler` callback function.
+1. `Inferencer`: Fluid inferencer loads the `inference_program` and the parameters trained by the Trainer.
+It then can infer the data and return prediction
+
+We will go though all of them and dig more on the configurations in this demo.
+
 ## Model Configuration

 A PaddlePaddle program starts from importing the API package:
@@ -132,8 +162,11 @@ A PaddlePaddle program starts from importing the API package:
 import paddle.fluid as fluid
 ```

-We want to use this program to demonstrate three different classifiers, each defined as a Python function. We need to feed image data to the classifier.
-PaddlePaddle provides a special layer `layer.data` for reading data. Let us create a data layer for reading images and connect it to a classification network.
+### Program Functions Configuration
+
+First, We need to setup the `inference_program` function. We want to use this program to demonstrate three different classifiers, each defined as a Python function.
+We need to feed image data to the classifier. PaddlePaddle provides a special layer `layer.data` for reading data.
+Let us create a data layer for reading images and connect it to the classification network.

 - Softmax regression: the network has a fully-connection layer with softmax activation:

@@ -146,12 +179,14 @@ def softmax_regression():
    return predict
 ```

- Multi-Layer Perceptron: this network has two hidden fully-connected layers, one with ReLU and the other with softmax activation:
+- Multi-Layer Perceptron: this network has two hidden fully-connected layers, both are using ReLU as activation functino. The output layer is using softmax activation:

 ```python
 def multilayer_perceptron():
    img = fluid.layers.data(name='img', shape=[1, 28, 28], dtype='float32')
+    # first fully-connected layer, using ReLu as its activation function
    hidden = fluid.layers.fc(input=img, size=200, act='relu')
+    # second fully-connected layer, using ReLu as its activation function
    hidden = fluid.layers.fc(input=hidden, size=200, act='relu')
    prediction = fluid.layers.fc(input=hidden, size=10, act='softmax')
    return prediction
@@ -162,6 +197,7 @@ def multilayer_perceptron():
 ```python
 def convolutional_neural_network():
    img = fluid.layers.data(name='img', shape=[1, 28, 28], dtype='float32')
+    # first conv pool
    conv_pool_1 = fluid.nets.simple_img_conv_pool(
        input=img,
        filter_size=5,
@@ -170,6 +206,7 @@ def convolutional_neural_network():
        pool_stride=2,
        act="relu")
    conv_pool_1 = fluid.layers.batch_norm(conv_pool_1)
+    # second conv pool
    conv_pool_2 = fluid.nets.simple_img_conv_pool(
        input=conv_pool_1,
        filter_size=5,
@@ -177,11 +214,14 @@ def convolutional_neural_network():
        pool_size=2,
        pool_stride=2,
        act="relu")
+    # output layer with softmax activation function. size = 10 since there are only 10 possible digits.
    prediction = fluid.layers.fc(input=conv_pool_2, size=10, act='softmax')
    return prediction
 ```

+#### Train Program Configuration
 Then we need to setup the the `train_program`. It takes the prediction from the classifier first. During the training, it will calculate the `avg_loss` from the prediction.
+Please feel free to modify the code to test different results between `softmax regression`, `mlp`, and `convolutional neural network` classifier.

 ```python
 def train_program():
@@ -190,26 +230,15 @@ def train_program():
    # predict = softmax_regression(images) # uncomment for Softmax
    # predict = multilayer_perceptron() # uncomment for MLP
    predict = convolutional_neural_network() # uncomment for LeNet5
+
+    # Calculate the cost from the prediction and label.
    cost = fluid.layers.cross_entropy(input=predict, label=label)
    avg_cost = fluid.layers.mean(cost)
    acc = fluid.layers.accuracy(input=predict, label=label)
    return [avg_cost, acc]
 ```

-Now, we need to setup the trainer. The trainer need to take in `train_program`, `place`, and `optimizer`.
-In the following `Momentum` optimizer, `momentum=0.9` means that 90% of the current momentum comes from that of the previous iteration. The learning rate relates to the speed at which the network training converges. Regularization is meant to prevent over-fitting; here we use the L2 regularization.
-
- ```python
- use_cude = False # set to True if training with GPU
- place = fluid.CUDAPlace(0) if use_cuda else fluid.CPUPlace()
- optimizer = paddle.optimizer.Momentum(
-     learning_rate=0.1 / 128.0,
-     momentum=0.9,
-     regularization=paddle.optimizer.L2Regularization(rate=0.0005 * 128))
-
-trainer = fluid.Trainer(
-    train_func=train_program, place=place, optimizer=optimizer)
- ```
+### Data Feeders Configuration

 Then we specify the training data `paddle.dataset.mnist.train()` and testing data `paddle.dataset.mnist.test()`. These two methods are *reader creators*. Once called, a reader creator returns a *reader*.  A reader is a Python method, which, once called, returns a Python generator, which yields instances of data.

@@ -227,30 +256,37 @@ test_reader = paddle.batch(
            paddle.dataset.mnist.test(), batch_size=64)
 ```

-`event_handler` is used to plot some text data when training.
+### Trainer Configuration
+
+Now, we need to setup the trainer. The trainer need to take in `train_program`, `place`, and `optimizer`.
+In the following `Momentum` optimizer, `momentum=0.9` means that 90% of the current momentum comes from that of the previous iteration. The learning rate relates to the speed at which the network training converges. Regularization is meant to prevent over-fitting; here we use the L2 regularization.

 ```python
-lists = []
+ use_cude = False # set to True if training with GPU
+ place = fluid.CUDAPlace(0) if use_cuda else fluid.CPUPlace()
+ optimizer = paddle.optimizer.Momentum(
+     learning_rate=0.1 / 128.0,
+     momentum=0.9,
+     regularization=paddle.optimizer.L2Regularization(rate=0.0005 * 128))
+
+trainer = fluid.Trainer(
+    train_func=train_program, place=place, optimizer=optimizer)
+ ```

+#### Event Handler
+
+`event_handler` is used to plot some text data when training.
+
+```python
 # Save the parameter into a directory. The Inferencer can load the parameters from it to do infer
 params_dirname = "recognize_digits_network.inference.model"

-# event handler to print the progress
 def event_handler(event):
-    if isinstance(event, paddle.event.EndIteration):
-        if event.batch_id % 100 == 0:
-            print "Pass %d, Batch %d, Cost %f, %s" % (
-                event.pass_id, event.batch_id, event.cost, event.metrics)
-    if isinstance(event, paddle.event.EndPass):
-        # save parameters
-        with open('params_pass_%d.tar' % event.pass_id, 'w') as f:
-            trainer.save_parameter_to_tar(f)
-
-        result = trainer.test(reader=train_reader)
-        print "Test with Pass %d, Cost %f, %s\n" % (
-            event.pass_id, result.cost, result.metrics)
-        lists.append((event.pass_id, result.cost,
-                      result.metrics['classification_error_evaluator']))
+    if isinstance(event, fluid.EndEpochEvent):
+        avg_cost, acc = trainer.test(
+            reader=test_reader, feed_order=['img', 'label'])
+        print("avg_cost: %s, acc: %s" % (avg_cost, acc))
+        trainer.save_params(params_dirname)
 ```

 Now that we setup the event_handler and the reader, we can start training the model. `feed_order` is used to map the data dict to the train_program

--- a/02.recognize_digits/index.html
+++ b/02.recognize_digits/index.html
@@ -42,19 +42,31 @@
 <div id="markdown" style='display:none'>
 # Recognize Digits

-The source code for this tutorial is here:  [book/recognize_digits](https://github.com/PaddlePaddle/book/tree/develop/02.recognize_digits). For instructions on getting started with Paddle, please refer to [installation instructions](https://github.com/PaddlePaddle/book/blob/develop/README.md#running-the-book).
+The source code for this tutorial is here:  [book/recognize_digits](https://github.com/PaddlePaddle/book/tree/develop/02.recognize_digits).
+For instructions on getting started with Paddle, please refer to [installation instructions](https://github.com/PaddlePaddle/book/blob/develop/README.md#running-the-book).

 ## Introduction
-When one learns to program, the first task is usually to write a program that prints "Hello World!". In Machine Learning or Deep Learning, an equivalent task is to train a model to recognize hand-written digits using the [MNIST](http://yann.lecun.com/exdb/mnist/) dataset. Handwriting recognition is a classic image classification problem. The problem is relatively easy and MNIST is a complete dataset. As a simple Computer Vision dataset, MNIST contains images of handwritten digits and their corresponding labels (Fig. 1). The input image is a $28\times28$ matrix, and the label is one of the digits from $0$ to $9$. All images are normalized, meaning that they are both rescaled and centered.
+When one learns to program, the first task is usually to write a program that prints "Hello World!".
+In Machine Learning or Deep Learning, an equivalent task is to train a model to recognize hand-written digits using the [MNIST](http://yann.lecun.com/exdb/mnist/) dataset.
+Handwriting recognition is a classic image classification problem. The problem is relatively easy and MNIST is a complete dataset.
+As a simple Computer Vision dataset, MNIST contains images of handwritten digits and their corresponding labels (Fig. 1).
+The input image is a $28\times28$ matrix, and the label is one of the digits from $0$ to $9$. All images are normalized, meaning that they are both rescaled and centered.

 <p align="center">
 <img src="image/mnist_example_image.png" width="400"><br/>
 Fig. 1. Examples of MNIST images
 </p>

-The MNIST dataset is from the [NIST](https://www.nist.gov/srd/nist-special-database-19) Special Database 3 (SD-3) and the Special Database 1 (SD-1). The SD-3 is labeled by the staff of the U.S. Census Bureau, while SD-1 is labeled by high school students. Therefore the SD-3 is cleaner and easier to recognize than the SD-1 dataset. Yann LeCun et al. used half of the samples from each of SD-1 and SD-3 to create the MNIST training set of 60,000 samples and test set of 10,000 samples. 250 annotators labeled the training set, thus guaranteed that there wasn't a complete overlap of annotators of training set and test set.
+The MNIST dataset is from the [NIST](https://www.nist.gov/srd/nist-special-database-19) Special Database 3 (SD-3) and the Special Database 1 (SD-1).
+The SD-3 is labeled by the staff of the U.S. Census Bureau, while SD-1 is labeled by high school students. Therefore the SD-3 is cleaner and easier to recognize than the SD-1 dataset.
+Yann LeCun et al. used half of the samples from each of SD-1 and SD-3 to create the MNIST training set of 60,000 samples and test set of 10,000 samples.
+250 annotators labeled the training set, thus guaranteed that there wasn't a complete overlap of annotators of training set and test set.

-The MNIST dataset has been used for evaluating many image recognition algorithms such as a single layer linear classifier, Multilayer Perceptron (MLP) and Multilayer CNN LeNet\[[1](#references)\], K-Nearest Neighbors (k-NN) \[[2](#references)\], Support Vector Machine (SVM) \[[3](#references)\], Neural Networks \[[4-7](#references)\], Boosting \[[8](#references)\] and preprocessing methods like distortion removal, noise removal, and blurring.  Among these algorithms, the *Convolutional Neural Network* (CNN) has achieved a series of impressive results in Image Classification tasks, including VGGNet, GoogLeNet, and ResNet (See [Image Classification](https://github.com/PaddlePaddle/book/tree/develop/03.image_classification) tutorial).
+The MNIST dataset has been used for evaluating many image recognition algorithms such as a single layer linear classifier,
+Multilayer Perceptron (MLP) and Multilayer CNN LeNet\[[1](#references)\], K-Nearest Neighbors (k-NN) \[[2](#references)\], Support Vector Machine (SVM) \[[3](#references)\],
+Neural Networks \[[4-7](#references)\], Boosting \[[8](#references)\] and preprocessing methods like distortion removal, noise removal, and blurring.
+Among these algorithms, the *Convolutional Neural Network* (CNN) has achieved a series of impressive results in Image Classification tasks, including VGGNet, GoogLeNet,
+and ResNet (See [Image Classification](https://github.com/PaddlePaddle/book/tree/develop/03.image_classification) tutorial).

 In this tutorial, we start with a simple **softmax** regression model and go on with MLP and CNN.  Readers will see how these methods improve the recognition accuracy step-by-step.

@@ -166,6 +178,24 @@ PaddlePaddle provides a Python module, `paddle.dataset.mnist`, which downloads a
 |t10k-labels-idx1-ubyte |  Evaluation labels | 10,000 |


+## Fluid API Overview
+
+The demo will be using the latest paddle fluid API. Fluid API is the latest Paddle API. It simplifies the model configurations without sacrifice the performance.
+We recommend using Fluid API as it is much easier to pick up.
+Here are the quick overview on the major fluid API complements.
+
+1. `inference_program`: A function that specify how to get the prediction from the data input.
+This is where you specify the network flow.
+1. `train_program`: A function that specify how to get avg_cost from `inference_program` and labels.
+This is where you specify the loss calculations.
+1. `optimizer`: Configure how to minimize the loss. Paddle supports most major optimization methods.
+1. `Trainer`: Fluid trainer manages the training process specified by the `train_program` and `optimizer`. Users can monitor the training
+progress through the `event_handler` callback function.
+1. `Inferencer`: Fluid inferencer loads the `inference_program` and the parameters trained by the Trainer.
+It then can infer the data and return prediction
+
+We will go though all of them and dig more on the configurations in this demo.
+
 ## Model Configuration

 A PaddlePaddle program starts from importing the API package:
@@ -174,8 +204,11 @@ A PaddlePaddle program starts from importing the API package:
 import paddle.fluid as fluid
 ```

-We want to use this program to demonstrate three different classifiers, each defined as a Python function. We need to feed image data to the classifier.
-PaddlePaddle provides a special layer `layer.data` for reading data. Let us create a data layer for reading images and connect it to a classification network.
+### Program Functions Configuration
+
+First, We need to setup the `inference_program` function. We want to use this program to demonstrate three different classifiers, each defined as a Python function.
+We need to feed image data to the classifier. PaddlePaddle provides a special layer `layer.data` for reading data.
+Let us create a data layer for reading images and connect it to the classification network.

 - Softmax regression: the network has a fully-connection layer with softmax activation:

@@ -188,12 +221,14 @@ def softmax_regression():
    return predict
 ```

- Multi-Layer Perceptron: this network has two hidden fully-connected layers, one with ReLU and the other with softmax activation:
+- Multi-Layer Perceptron: this network has two hidden fully-connected layers, both are using ReLU as activation functino. The output layer is using softmax activation:

 ```python
 def multilayer_perceptron():
    img = fluid.layers.data(name='img', shape=[1, 28, 28], dtype='float32')
+    # first fully-connected layer, using ReLu as its activation function
    hidden = fluid.layers.fc(input=img, size=200, act='relu')
+    # second fully-connected layer, using ReLu as its activation function
    hidden = fluid.layers.fc(input=hidden, size=200, act='relu')
    prediction = fluid.layers.fc(input=hidden, size=10, act='softmax')
    return prediction
@@ -204,6 +239,7 @@ def multilayer_perceptron():
 ```python
 def convolutional_neural_network():
    img = fluid.layers.data(name='img', shape=[1, 28, 28], dtype='float32')
+    # first conv pool
    conv_pool_1 = fluid.nets.simple_img_conv_pool(
        input=img,
        filter_size=5,
@@ -212,6 +248,7 @@ def convolutional_neural_network():
        pool_stride=2,
        act="relu")
    conv_pool_1 = fluid.layers.batch_norm(conv_pool_1)
+    # second conv pool
    conv_pool_2 = fluid.nets.simple_img_conv_pool(
        input=conv_pool_1,
        filter_size=5,
@@ -219,11 +256,14 @@ def convolutional_neural_network():
        pool_size=2,
        pool_stride=2,
        act="relu")
+    # output layer with softmax activation function. size = 10 since there are only 10 possible digits.
    prediction = fluid.layers.fc(input=conv_pool_2, size=10, act='softmax')
    return prediction
 ```

+#### Train Program Configuration
 Then we need to setup the the `train_program`. It takes the prediction from the classifier first. During the training, it will calculate the `avg_loss` from the prediction.
+Please feel free to modify the code to test different results between `softmax regression`, `mlp`, and `convolutional neural network` classifier.

 ```python
 def train_program():
@@ -232,26 +272,15 @@ def train_program():
    # predict = softmax_regression(images) # uncomment for Softmax
    # predict = multilayer_perceptron() # uncomment for MLP
    predict = convolutional_neural_network() # uncomment for LeNet5
+
+    # Calculate the cost from the prediction and label.
    cost = fluid.layers.cross_entropy(input=predict, label=label)
    avg_cost = fluid.layers.mean(cost)
    acc = fluid.layers.accuracy(input=predict, label=label)
    return [avg_cost, acc]
 ```

-Now, we need to setup the trainer. The trainer need to take in `train_program`, `place`, and `optimizer`.
-In the following `Momentum` optimizer, `momentum=0.9` means that 90% of the current momentum comes from that of the previous iteration. The learning rate relates to the speed at which the network training converges. Regularization is meant to prevent over-fitting; here we use the L2 regularization.
-
- ```python
- use_cude = False # set to True if training with GPU
- place = fluid.CUDAPlace(0) if use_cuda else fluid.CPUPlace()
- optimizer = paddle.optimizer.Momentum(
-     learning_rate=0.1 / 128.0,
-     momentum=0.9,
-     regularization=paddle.optimizer.L2Regularization(rate=0.0005 * 128))
-
-trainer = fluid.Trainer(
-    train_func=train_program, place=place, optimizer=optimizer)
- ```
+### Data Feeders Configuration

 Then we specify the training data `paddle.dataset.mnist.train()` and testing data `paddle.dataset.mnist.test()`. These two methods are *reader creators*. Once called, a reader creator returns a *reader*.  A reader is a Python method, which, once called, returns a Python generator, which yields instances of data.

@@ -269,30 +298,37 @@ test_reader = paddle.batch(
            paddle.dataset.mnist.test(), batch_size=64)
 ```

-`event_handler` is used to plot some text data when training.
+### Trainer Configuration
+
+Now, we need to setup the trainer. The trainer need to take in `train_program`, `place`, and `optimizer`.
+In the following `Momentum` optimizer, `momentum=0.9` means that 90% of the current momentum comes from that of the previous iteration. The learning rate relates to the speed at which the network training converges. Regularization is meant to prevent over-fitting; here we use the L2 regularization.

 ```python
-lists = []
+ use_cude = False # set to True if training with GPU
+ place = fluid.CUDAPlace(0) if use_cuda else fluid.CPUPlace()
+ optimizer = paddle.optimizer.Momentum(
+     learning_rate=0.1 / 128.0,
+     momentum=0.9,
+     regularization=paddle.optimizer.L2Regularization(rate=0.0005 * 128))
+
+trainer = fluid.Trainer(
+    train_func=train_program, place=place, optimizer=optimizer)
+ ```

+#### Event Handler
+
+`event_handler` is used to plot some text data when training.
+
+```python
 # Save the parameter into a directory. The Inferencer can load the parameters from it to do infer
 params_dirname = "recognize_digits_network.inference.model"

-# event handler to print the progress
 def event_handler(event):
-    if isinstance(event, paddle.event.EndIteration):
-        if event.batch_id % 100 == 0:
-            print "Pass %d, Batch %d, Cost %f, %s" % (
-                event.pass_id, event.batch_id, event.cost, event.metrics)
-    if isinstance(event, paddle.event.EndPass):
-        # save parameters
-        with open('params_pass_%d.tar' % event.pass_id, 'w') as f:
-            trainer.save_parameter_to_tar(f)
-
-        result = trainer.test(reader=train_reader)
-        print "Test with Pass %d, Cost %f, %s\n" % (
-            event.pass_id, result.cost, result.metrics)
-        lists.append((event.pass_id, result.cost,
-                      result.metrics['classification_error_evaluator']))
+    if isinstance(event, fluid.EndEpochEvent):
+        avg_cost, acc = trainer.test(
+            reader=test_reader, feed_order=['img', 'label'])
+        print("avg_cost: %s, acc: %s" % (avg_cost, acc))
+        trainer.save_params(params_dirname)
 ```

 Now that we setup the event_handler and the reader, we can start training the model. `feed_order` is used to map the data dict to the train_program