README.md 825 字节
Newer Older
H
Hongsheng Zeng 已提交
1
## Reproduce DQN with PARL
L
LI Yunxiang 已提交
2
Based on PARL, we provide a simple demonstration of DQN.
H
Hongsheng Zeng 已提交
3

R
rical730 已提交
4
> DQN in
H
Hongsheng Zeng 已提交
5 6
[Human-level Control Through Deep Reinforcement Learning](http://www.nature.com/nature/journal/v518/n7540/full/nature14236.html)

L
LI Yunxiang 已提交
7
### Result
H
Hongsheng Zeng 已提交
8

L
LI Yunxiang 已提交
9
Performance of DQN playing CartPole-v0
10

L
LI Yunxiang 已提交
11 12 13
<p align="left">
<img src="../QuickStart/performance.gif" alt="result" height="175"/>
<img src="cartpole.jpg" alt="result" height="175"/>
L
LI Yunxiang 已提交
14
</p>
H
Hongsheng Zeng 已提交
15 16 17

## How to use
### Dependencies:
B
Bo Zhou 已提交
18
+ [paddlepaddle>=1.6.1](https://github.com/PaddlePaddle/Paddle)
H
Hongsheng Zeng 已提交
19
+ [parl](https://github.com/PaddlePaddle/PARL)
H
Hongsheng Zeng 已提交
20 21 22 23 24 25
+ gym
+ tqdm


### Start Training:
```
L
LI Yunxiang 已提交
26
# To train a model for CartPole-v0 game
L
LI Yunxiang 已提交
27
python train.py
H
Hongsheng Zeng 已提交
28
```
L
LI Yunxiang 已提交
29 30 31 32

## DQN-Variants

For DQN variants such as Double DQN and Dueling DQN, please check [here](https://github.com/PaddlePaddle/PARL/tree/develop/examples/DQN_variant)