| [pytorch-faster-rcnn](https://github.com/ruotianluo/pytorch-faster-rcnn) | TITAN Xp | NA | 5-6fps |
| [pytorch-faster-rcnn](https://github.com/ruotianluo/pytorch-faster-rcnn) | TITAN Xp | NA | 6fps |
[^1]:make sure you install cupy correctly and only one program run on the GPU.
It could be even faster by removing visualization, logging, averaging loss etc.
## Install dependencies
requires python3 and PyTorch 0.3
- install PyTorch >=0.3 with GPU (code are GPU-only), refer to [official website](http://pytorch.org)
- install cupy, you can install via `pip install` but it's better to read the [docs](https://docs-cupy.chainer.org/en/latest/install.html#install-cupy-with-cudnn-and-nccl) and make sure the environ is correctly set
- install cupy, you can install via `pip install` but it's better to read the [docs](https://docs-cupy.chainer.org/en/latest/install.html#install-cupy-with-cudnn-and-nccl) and make sure the environ is correctly set
- install other dependencies: `pip install -r requirements.txt `
- Optional, but strongly recommended: build cython code `nms_gpu_post`:
```Python
cd model/utils/nms/
python3 build.py build_ext --inplace
```
- start vidom for visualize
```
nohup python3 -m visdom.server &
```
If you're in China and have encounter problem with visdom (i.e. timeout, blank screen), you may refer to [visdom issue](https://github.com/facebookresearch/visdom/issues/111#issuecomment-321743890), ~~and a temporary solution provided by me~~
If you're in China and have encounter problem with visdom (i.e. timeout, blank screen), you may refer to [visdom issue](https://github.com/facebookresearch/visdom/issues/111#issuecomment-321743890), ~~and a temporary and fast solution provided by me~~
## Demo
...
...
@@ -111,7 +126,7 @@ If you want to use torchvision pretrained model, you may skip this step.
**NOTE**, caffe pretrained model has shown slight better performance.
**NOTE**: caffe model require images in BGR 0-255, while torchvision model requires images in RGB and 0-1. See `data/dataset.py`for more detail.
**NOTE**: caffe model require images in BGR 0-255, while torchvision model requires images in RGB and 0-1. See `data/dataset.py`for more detail.
### begin training
...
...
@@ -127,7 +142,7 @@ you may refer to `config.py` for more argument.
Some Key arguments:
-`--caffe-pretrain`=True: use caffe pretrain model or use torchvision pretrained model (Default: torchvison)
-`--caffe-pretrain`: use caffe pretrain model or use torchvision pretrained model (Default: torchvison)
-`--plot-every=n`: visualize predict, loss etc every n batches.