提交 · de84714ee580ab70d661d1505ca4dbc7b7d159a9 · PaddlePaddle / models

28 6月, 2018 2 次提交
- Z
  
  Update README · de84714e
  由 zenghsh3 提交于 6月 28, 2018
  
  de84714e
- Z
  
  update README · 4a9aed3d
  由 zenghsh3 提交于 6月 28, 2018
  
  4a9aed3d
26 6月, 2018 2 次提交
- Z
  
  Fix with pre-commit · 2448dd5e
  由 zenghsh3 提交于 6月 26, 2018
  
  2448dd5e
- Z
  
  update README · 2121b938
  由 zenghsh3 提交于 6月 26, 2018
  
  2121b938
25 6月, 2018 3 次提交
- Z
  
  Update README of English version · d8ec4424
  由 zenghsh3 提交于 6月 25, 2018
  
  d8ec4424
- T
  
  Update README.md · 202a48ec
  由 TomorrowIsAnOtherDay 提交于 6月 25, 2018
  
  202a48ec
- Z
  
  Update README · f6472390
  由 zenghsh3 提交于 6月 25, 2018
  
  f6472390
19 6月, 2018 3 次提交
- Z
  
  update README.MD · 948553b4
  由 zenghsh3 提交于 6月 19, 2018
  
  948553b4
- Z
  
  update README.MD · cbe14ab7
  由 zenghsh3 提交于 6月 19, 2018
  
  cbe14ab7
- R
  
  add saved model of pong and breakout, update README.MD · 7cd8ee2d
  由 robot 提交于 6月 19, 2018
  
  7cd8ee2d
14 6月, 2018 1 次提交

Update fluid/DeepQNetwork models with Atari environment (#981) · 3ccb855b

由 Hongsheng Zeng 提交于 6月 14, 2018

* DQN Atari based on RLLab

* DQN Atari based on RLLab

* DQN Atari based on RLLab

* refactor code without RLLab framework

* add module of saving and loading policy model

* refactor code structure, add DoubleDQN and DuelingDQN modules

* add fluid argmax, flatten utils

* update README.md

* udpdate replay memory

* udpdate replay memory

* update readme, code clean

* clean code and fix codestyle

* fix codestyle

* update README.md

* revisions|history->context, randint->random

* revisions| add comment for max-pooling operation in atari

3ccb855b

15 5月, 2018 1 次提交

【Fluid models】implement DQN model (#889) · bdc13f2f

由 TomorrowIsAnOtherDay 提交于 5月 15, 2018

* [DQN]source code commit

* Update README.md

* Update README.md

* add mountain-car curve

* Update README.md

* Update README.md

* clean code

* fix code style

* [fix code style]/2

* remove some tensorflow package

* a better way to sample from replay memory

* code style

bdc13f2f

PaddlePaddle / models 大约 1 年 前同步成功

PaddlePaddle / models
大约 1 年前同步成功