INSTALL.md 5.7 KB
Newer Older
Q
qingqing01 已提交
1 2
English | [简体中文](INSTALL_cn.md)

3 4 5 6 7 8 9 10 11 12 13 14 15 16 17
# Installation

---
## Table of Contents

- [Introduction](#introduction)
- [PaddlePaddle](#paddlepaddle)
- [Other Dependencies](#other-dependencies)
- [PaddleDetection](#paddle-detection)
- [Datasets](#datasets)


## Introduction

This document covers how to install PaddleDetection, its dependencies
Q
qingqing01 已提交
18
(including PaddlePaddle), together with COCO and Pascal VOC dataset.
19

G
Guanghua Yu 已提交
20
For general information about PaddleDetection, please see [README.md](https://github.com/PaddlePaddle/PaddleDetection/blob/master/).
21 22 23 24


## PaddlePaddle

W
wangguanzhong 已提交
25
Running PaddleDetection requires PaddlePaddle Fluid v.1.6 and later. please follow the instructions in [installation document](http://www.paddlepaddle.org.cn/).
26 27 28 29 30

Please make sure your PaddlePaddle installation was successful and the version
of your PaddlePaddle is not lower than required. Verify with the following commands.

```
K
Kaipeng Deng 已提交
31
# To check PaddlePaddle installation in your Python interpreter
W
wangguanzhong 已提交
32
>>> import paddle.fluid as fluid
K
Kaipeng Deng 已提交
33
>>> fluid.install_check.run_check()
34 35 36 37 38 39 40

# To check PaddlePaddle version
python -c "import paddle; print(paddle.__version__)"
```

### Requirements:

41
- Python2 or Python3 (Only support Python3 for windows)
42 43 44 45 46 47 48 49 50
- CUDA >= 8.0
- cuDNN >= 5.0
- nccl >= 2.1.2


## Other Dependencies

[COCO-API](https://github.com/cocodataset/cocoapi):

K
Kaipeng Deng 已提交
51
COCO-API is needed for running. Installation is as follows:
52 53 54 55 56 57 58 59 60 61 62

    git clone https://github.com/cocodataset/cocoapi.git
    cd cocoapi/PythonAPI
    # if cython is not installed
    pip install Cython
    # Install into global site-packages
    make install
    # Alternatively, if you do not have permissions or prefer
    # not to install the COCO API into global site-packages
    python setup.py install --user

63 64 65 66 67 68
**Installation of COCO-API in windows:**

    # if cython is not installed
    pip install Cython
    # Because the origin version of cocoapi does not support windows, another version is used which only supports Python3
    pip install git+https://github.com/philferriere/cocoapi.git#subdirectory=PythonAPI
69 70 71 72 73

## PaddleDetection

**Clone Paddle models repository:**

74
You can clone PaddleDetection with the following commands:
75 76

```
77 78
cd <path/to/clone/PaddleDetection>
git clone https://github.com/PaddlePaddle/PaddleDetection.git
79 80 81 82
```

**Install Python dependencies:**

G
Guanghua Yu 已提交
83
Required python packages are specified in [requirements.txt](https://github.com/PaddlePaddle/PaddleDetection/blob/master/requirements.txt), and can be installed with:
84 85 86 87 88

```
pip install -r requirements.txt
```

89
**Specify the current Python path:**
90

91 92 93 94 95
```shell
# In Linux/Mac
export PYTHONPATH=$PYTHONPATH:.
# In windows
set PYTHONPATH=%PYTHONPATH%;.
96
```
97 98 99 100

**Make sure the tests pass:**

```shell
101 102 103 104 105
python ppdet/modeling/tests/test_architectures.py
```

## Datasets

Q
qingqing01 已提交
106
PaddleDetection includes support for [COCO](http://cocodataset.org) and [Pascal VOC](http://host.robots.ox.ac.uk/pascal/VOC/) by default, please follow these instructions to set up the dataset.
107 108 109

**Create symlinks for local datasets:**

Q
qingqing01 已提交
110
Default dataset path in config files is `dataset/coco` and `dataset/voc`, if the
111 112 113 114
datasets are already available on disk, you can simply create symlinks to
their directories:

```
Q
qingqing01 已提交
115 116
ln -sf <path/to/coco> <path/to/paddle_detection>/dataset/coco
ln -sf <path/to/voc> <path/to/paddle_detection>/dataset/voc
117 118
```

K
Kaipeng Deng 已提交
119 120 121 122 123 124 125
For Pascal VOC dataset, you should create file list by:

```
export PYTHONPATH=$PYTHONPATH:.
python dataset/voc/create_list.py
```

126 127 128 129
**Download datasets manually:**

On the other hand, to download the datasets, run the following commands:

Q
qingqing01 已提交
130
- COCO
131 132

```
K
Kaipeng Deng 已提交
133 134
export PYTHONPATH=$PYTHONPATH:.
python dataset/coco/download_coco.py
135 136
```

K
Kaipeng Deng 已提交
137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157
`COCO` dataset with directory structures like this:

  ```
  dataset/coco/
  ├── annotations
  │   ├── instances_train2014.json
  │   ├── instances_train2017.json
  │   ├── instances_val2014.json
  │   ├── instances_val2017.json
  │   |   ...
  ├── train2017
  │   ├── 000000000009.jpg
  │   ├── 000000580008.jpg
  │   |   ...
  ├── val2017
  │   ├── 000000000139.jpg
  │   ├── 000000000285.jpg
  │   |   ...
  |   ...
  ```

Q
qingqing01 已提交
158
- Pascal VOC
159 160

```
K
Kaipeng Deng 已提交
161 162
export PYTHONPATH=$PYTHONPATH:.
python dataset/voc/download_voc.py
K
Kaipeng Deng 已提交
163
python dataset/voc/create_list.py
164 165
```

K
Kaipeng Deng 已提交
166 167 168 169 170 171 172 173 174 175 176 177
`Pascal VOC` dataset with directory structure like this:

  ```
  dataset/voc/
  ├── train.txt
  ├── val.txt
  ├── test.txt
  ├── label_list.txt (optional)
  ├── VOCdevkit/VOC2007
  │   ├── Annotations
  │       ├── 001789.xml
  │       |   ...
W
wangguanzhong 已提交
178
  │   ├── JPEGImages
K
Kaipeng Deng 已提交
179 180 181 182 183 184 185 186
  │       ├── 001789.xml
  │       |   ...
  │   ├── ImageSets
  │       |   ...
  ├── VOCdevkit/VOC2012
  │   ├── Annotations
  │       ├── 003876.xml
  │       |   ...
W
wangguanzhong 已提交
187
  │   ├── JPEGImages
K
Kaipeng Deng 已提交
188 189 190 191 192 193 194 195 196
  │       ├── 003876.xml
  │       |   ...
  │   ├── ImageSets
  │       |   ...
  |   ...
  ```

**NOTE:** If you set `use_default_label=False` in yaml configs, the `label_list.txt`
of Pascal VOC dataset will be read, otherwise, `label_list.txt` is unnecessary and
W
wangguanzhong 已提交
197
the default Pascal VOC label list which defined in
G
Guanghua Yu 已提交
198
[voc\_loader.py](https://github.com/PaddlePaddle/PaddleDetection/blob/master/ppdet/data/source/voc.py) will be used.
K
Kaipeng Deng 已提交
199

200 201 202
**Download datasets automatically:**

If a training session is started but the dataset is not setup properly (e.g,
Q
qingqing01 已提交
203 204
not found in `dataset/coco` or `dataset/voc`), PaddleDetection can automatically
download them from [COCO-2017](http://images.cocodataset.org) and
205 206 207 208 209
[VOC2012](http://host.robots.ox.ac.uk/pascal/VOC), the decompressed datasets
will be cached in `~/.cache/paddle/dataset/` and can be discovered automatically
subsequently.


G
Guanghua Yu 已提交
210
**NOTE:** For further informations on the datasets, please see [READER.md](../advanced_tutorials/READER.md)