1. 25 6月, 2018 2 次提交
  2. 19 6月, 2018 3 次提交
  3. 14 6月, 2018 1 次提交
    • H
      Update fluid/DeepQNetwork models with Atari environment (#981) · 3ccb855b
      Hongsheng Zeng 提交于
      * DQN Atari based on RLLab
      
      * DQN Atari based on RLLab
      
      * DQN Atari based on RLLab
      
      * refactor code without RLLab framework
      
      * add module of saving and loading policy model
      
      * refactor code structure, add DoubleDQN and DuelingDQN modules
      
      * add fluid argmax, flatten utils
      
      * update README.md
      
      * udpdate replay memory
      
      * udpdate replay memory
      
      * update readme, code clean
      
      * clean code and fix codestyle
      
      * fix codestyle
      
      * update README.md
      
      * revisions|history->context, randint->random
      
      * revisions| add comment for max-pooling operation in atari
      3ccb855b
  4. 15 5月, 2018 1 次提交
    • T
      【Fluid models】implement DQN model (#889) · bdc13f2f
      TomorrowIsAnOtherDay 提交于
      * [DQN]source code commit
      
      * Update README.md
      
      * Update README.md
      
      * add mountain-car curve
      
      * Update README.md
      
      * Update README.md
      
      * clean code
      
      * fix code style
      
      * [fix code style]/2
      
      * remove some tensorflow package
      
      * a better way to sample from replay memory
      
      * code style
      bdc13f2f