-
由 Hongsheng Zeng 提交于
* DQN Atari based on RLLab * DQN Atari based on RLLab * DQN Atari based on RLLab * refactor code without RLLab framework * add module of saving and loading policy model * refactor code structure, add DoubleDQN and DuelingDQN modules * add fluid argmax, flatten utils * update README.md * udpdate replay memory * udpdate replay memory * update readme, code clean * clean code and fix codestyle * fix codestyle * update README.md * revisions|history->context, randint->random * revisions| add comment for max-pooling operation in atari
3ccb855b