README.md 832 字节
Newer Older
H
Hongsheng Zeng 已提交
1
## Reproduce DQN with PARL
L
LI Yunxiang 已提交
2
Based on PARL, we provide a simple demonstration of DQN.
H
Hongsheng Zeng 已提交
3

R
rical730 已提交
4
+ Paper: DQN in [Human-level Control Through Deep Reinforcement Learning](http://www.nature.com/nature/journal/v518/n7540/full/nature14236.html)
H
Hongsheng Zeng 已提交
5

L
LI Yunxiang 已提交
6
### Result
H
Hongsheng Zeng 已提交
7

L
LI Yunxiang 已提交
8
Performance of DQN playing CartPole-v0
9

L
LI Yunxiang 已提交
10 11 12
<p align="left">
<img src="../QuickStart/performance.gif" alt="result" height="175"/>
<img src="cartpole.jpg" alt="result" height="175"/>
L
LI Yunxiang 已提交
13
</p>
H
Hongsheng Zeng 已提交
14 15 16

## How to use
### Dependencies:
B
Bo Zhou 已提交
17
+ [paddlepaddle>=1.6.1](https://github.com/PaddlePaddle/Paddle)
H
Hongsheng Zeng 已提交
18
+ [parl](https://github.com/PaddlePaddle/PARL)
H
Hongsheng Zeng 已提交
19 20 21 22 23 24
+ gym
+ tqdm


### Start Training:
```
L
LI Yunxiang 已提交
25
# To train a model for CartPole-v0 game
L
LI Yunxiang 已提交
26
python train.py
H
Hongsheng Zeng 已提交
27
```
L
LI Yunxiang 已提交
28 29 30 31

## DQN-Variants

For DQN variants such as Double DQN and Dueling DQN, please check [here](https://github.com/PaddlePaddle/PARL/tree/develop/examples/DQN_variant)