add 2x model (#1450)

* add 2x model * refine README

add 2x model (#1450)
* add 2x model * refine README
d8544499 · jerrywgz · qingqing01 · dcea1e9f · d8544499 · d8544499
5 changed file
--- a/fluid/PaddleCV/faster_rcnn/README.md
+++ b/fluid/PaddleCV/faster_rcnn/README.md
@@ -90,21 +90,11 @@ To train the model, [cocoapi](https://github.com/cocodataset/cocoapi) is needed.

 *  Use momentum optimizer with momentum=0.9.
 *  Weight decay is 0.0001.
-*  In first 500 iteration, the learning rate increases linearly from 0.00333 to 0.01. Then lr is decayed at 120000, 160000 iteration with multiplier 0.1, 0.01. The maximum iteration is 180000.
+*  In first 500 iteration, the learning rate increases linearly from 0.00333 to 0.01. Then lr is decayed at 120000, 160000 iteration with multiplier 0.1, 0.01. The maximum iteration is 180000. Also, we released a 2x model which has 360000 iterations and lr is decayed at 240000, 320000. These configuration can be set by max_iter and lr_steps in config.py.
 *  Set the learning rate of bias to two times as global lr in non basic convolutional layers.
 *  In basic convolutional layers, parameters of affine layers and res body do not update.
 *  Use Nvidia Tesla V100 8GPU, total time for training is about 40 hours.

-Training result is shown as below：
-<p align="center">
-<img src="image/train_loss.jpg" height=500 width=650 hspace='10'/> <br />
-Faster RCNN train loss
-</p>
-
-* Fluid RoIPool minibatch padding: Use RoIPool. Images in one batch padding to the same size. This method is same as detectron.
-* Fluid RoIpool no padding: Use RoIPool. Images without padding.
-* Fluid RoIAlign no padding: Use RoIAlign. Images without padding.
-
 ## Evaluation

 Evaluation is to evaluate the performance of a trained model. This sample provides `eval_coco_map.py` which uses a COCO-specific mAP metric defined by [COCO committee](http://cocodataset.org/#detections-eval).
@@ -118,20 +108,18 @@ Evaluation is to evaluate the performance of a trained model. This sample provid
 - Set ```export CUDA_VISIBLE_DEVICES=0``` to specifiy one GPU to eval.

 Evalutaion result is shown as below:
-<p align="center">
-<img src="image/mAP.jpg" height=500 width=650 hspace='10'/> <br />
-Faster RCNN mAP
-</p>

 | Model              | RoI function    | Batch size     | Max iteration    | mAP  |
 | :--------------- | :--------: | :------------:    | :------------------:    |------: |
 | [Fluid RoIPool minibatch padding](http://paddlemodels.bj.bcebos.com/faster_rcnn/model_pool_minibatch_padding.tar.gz) | RoIPool | 8   |    180000        | 0.314 |
 | [Fluid RoIPool no padding](http://paddlemodels.bj.bcebos.com/faster_rcnn/model_pool_no_padding.tar.gz)  | RoIPool | 8   |    180000        | 0.316 |
 | [Fluid RoIAlign no padding](http://paddlemodels.bj.bcebos.com/faster_rcnn/model_align_no_padding.tar.gz)  | RoIAlign | 8   |    180000        | 0.345 |
+| [Fluid RoIAlign no padding 2x](http://paddlemodels.bj.bcebos.com/faster_rcnn/model_align_no_padding_2x.tar.gz)  | RoIAlign | 8   |    360000        | 0.364 |

 * Fluid RoIPool minibatch padding: Use RoIPool. Images in one batch padding to the same size. This method is same as detectron.
 * Fluid RoIPool no padding: Images without padding.
 * Fluid RoIAlign no padding: Images without padding.
+* Fluid RoIAlign no padding 2x: Images without padding, train for 360000 iterations, learning rate is decayed at 240000, 320000.

 ## Inference and Visualization


--- a/fluid/PaddleCV/faster_rcnn/README_cn.md
+++ b/fluid/PaddleCV/faster_rcnn/README_cn.md
@@ -81,20 +81,10 @@ Faster RCNN 目标检测模型
 * RPN选择anchor时，rpn\_fg\_fraction=0.5，rpn\_positive\_overlap=0.7，rpn\_negative\_overlap=0.3


-下图为模型训练结果：
-<p align="center">
-<img src="image/train_loss.jpg" height=500 width=650 hspace='10'/> <br />
-Faster RCNN 训练loss
-</p>
-
-* Fluid RoIPool minibatch padding: 使用RoIPool，同一个batch内的图像填充为相同尺寸。该方法与detectron处理相同。
-* Fluid RoIPool no padding: 使用RoIPool，不对图像做填充处理。
-* Fluid RoIAlign no padding: 使用RoIAlign，不对图像做填充处理。
-
 **训练策略：**

 *  采用momentum优化算法训练Faster RCNN，momentum=0.9。
-*  权重衰减系数为0.0001，前500轮学习率从0.00333线性增加至0.01。在120000，160000轮时使用0.1,0.01乘子进行学习率衰减，最大训练180000轮。
+*  权重衰减系数为0.0001，前500轮学习率从0.00333线性增加至0.01。在120000，160000轮时使用0.1,0.01乘子进行学习率衰减，最大训练180000轮。同时我们也提供了2x模型，该模型采用更多的迭代轮数进行训练，训练360000轮，学习率在240000，320000轮衰减，其他参数不变，训练最大轮数和学习率策略可以在config.py中对max_iter和lr_steps进行设置。
 *  非基础卷积层卷积bias学习率为整体学习率2倍。
 *  基础卷积层中，affine_layers参数不更新，res2层参数不更新。
 *  使用Nvidia Tesla V100 8卡并行，总共训练时长大约40小时。
@@ -111,24 +101,21 @@ Faster RCNN 训练loss

 - 通过设置export CUDA\_VISIBLE\_DEVICES=0指定单卡GPU评估。

-下图为模型评估结果：
-<p align="center">
-<img src="image/mAP.jpg" height=500 width=650 hspace='10'/> <br />
-Faster RCNN mAP
-</p>
+下表为模型评估结果：

 | 模型                   |   RoI处理方式  | 批量大小   | 迭代次数   | mAP  |
 | :--------------- | :--------: | :------------:    | :------------------:    |------: |
 | [Fluid RoIPool minibatch padding](http://paddlemodels.bj.bcebos.com/faster_rcnn/model_pool_minibatch_padding.tar.gz) | RoIPool | 8   |    180000        | 0.314 |
 | [Fluid RoIPool no padding](http://paddlemodels.bj.bcebos.com/faster_rcnn/model_pool_no_padding.tar.gz)  | RoIPool | 8   |    180000        | 0.316 |
 | [Fluid RoIAlign no padding](http://paddlemodels.bj.bcebos.com/faster_rcnn/model_align_no_padding.tar.gz)  | RoIAlign | 8   |    180000        | 0.345 |
-
+| [Fluid RoIAlign no padding 2x](http://paddlemodels.bj.bcebos.com/faster_rcnn/model_align_no_padding_2x.tar.gz)  | RoIAlign | 8   |    360000        | 0.364 |



 * Fluid RoIPool minibatch padding: 使用RoIPool，同一个batch内的图像填充为相同尺寸。该方法与detectron处理相同。
 * Fluid RoIPool no padding: 使用RoIPool，不对图像做填充处理。
 * Fluid RoIAlign no padding: 使用RoIAlign，不对图像做填充处理。
+* Fluid RoIAlign no padding 2x: 使用RoIAlign，不对图像做填充处理。训练360000轮，学习率在240000，320000轮衰减。

 ## 模型推断及可视化


--- a/fluid/PaddleCV/faster_rcnn/config.py
+++ b/fluid/PaddleCV/faster_rcnn/config.py
@@ -163,15 +163,17 @@ _C.spatial_scale = 1. / 16.
 # derived learning rate the to get the final learning rate.
 _C.learning_rate = 0.01

-# maximum number of iterations
+# maximum number of iterations, 1x: 180000, 2x:360000
 _C.max_iter = 180000
+#_C.max_iter = 360000

 # warm up to learning rate 
 _C.warm_up_iter = 500
 _C.warm_up_factor = 1. / 3.

-# lr steps_with_decay
+# lr steps_with_decay, 1x: [120000, 160000], 2x: [240000, 320000]
 _C.lr_steps = [120000, 160000]
+#_C.lr_steps = [240000, 320000]
 _C.lr_gamma = 0.1

 # L2 regularization hyperparameter

--- a/fluid/PaddleCV/faster_rcnn/image/mAP.jpg
+++ b/fluid/PaddleCV/faster_rcnn/image/mAP.jpg
--- a/fluid/PaddleCV/faster_rcnn/image/train_loss.jpg
+++ b/fluid/PaddleCV/faster_rcnn/image/train_loss.jpg