From b29c45a6d2a187d1cfa8b5fe38e9c81b99ef37f2 Mon Sep 17 00:00:00 2001 From: Bo Zhou <2466956298@qq.com> Date: Tue, 13 Aug 2019 19:59:48 +0800 Subject: [PATCH] Zhoubo01 papers (#132) * Update model-based.md * Update and rename model-based.md to archive.md --- papers/{model-based.md => archive.md} | 38 +++++++++++++++++++++++++++ 1 file changed, 38 insertions(+) rename papers/{model-based.md => archive.md} (54%) diff --git a/papers/model-based.md b/papers/archive.md similarity index 54% rename from papers/model-based.md rename to papers/archive.md index d0bfedf..a0924df 100644 --- a/papers/model-based.md +++ b/papers/archive.md @@ -46,3 +46,41 @@ 12. **Learning and Policy Search in Stochastic Dynamical Systems with Bayesian Neural Networks** ICLR 2017. [paper](https://arxiv.org/abs/1605.07127) *Stefan Depeweg, José Miguel Hernández-Lobato, Finale Doshi-Velez, Steffen Udluft* + + +### Distributed Training +1. **Asynchronous Methods for Deep Reinforcement Learning** ICML 2016. [paper](https://arxiv.org/pdf/1602.01783.pdf) + + *Volodymyr Mnih, Adrià Puigdomènech Badia, Mehdi Mirza, Alex Graves, Timothy P. Lillicrap, Tim Harley, David Silver, Koray Kavukcuoglu* + +2. **GA3C: GPU-based A3C for Deep Reinforcement Learning** NIPS 2016. [paper](https://www.researchgate.net/publication/310610848_GA3C_GPU-based_A3C_for_Deep_Reinforcement_Learning) + + *Iuri Frosio, Stephen Tyree Jason Clemons Jan Kautz* + +3. **Distributed Prioritized Experience Replay** ICLR 2018. [paper](https://openreview.net/pdf?id=H1Dy---0Z) + + *Dan Horgan, John Quan, David Budden, Gabriel Barth-Maron, Matteo Hessel, Hado van Hasselt, David Silver* + +4. **IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures** ICML 2018. [paper](https://arxiv.org/abs/1802.01561) + + *Lasse Espeholt, Hubert Soyer, Remi Munos, Karen Simonyan, Volodymir Mnih, Tom Ward, Yotam Doron, Vlad Firoiu, Tim Harley, Iain Dunning, Shane Legg, Koray Kavukcuoglu* + +5. **Distributed Distributional Deterministic Policy Gradients** ICLR 2018. [paper](https://arxiv.org/pdf/1804.08617.pdf) + + *Gabriel Barth-Maron, Matthew W. Hoffman, David Budden, Will Dabney, Dan Horgan, Dhruva TB, Alistair Muldal, Nicolas Heess, Timothy Lillicrap* + +6. **Emergence of Locomotion Behaviours in Rich Environments** arXiv. [paper](https://arxiv.org/abs/1707.02286) + + *Nicolas Heess, Dhruva TB, Srinivasan Sriram, Jay Lemmon, Josh Merel, Greg Wayne, Yuval Tassa, Tom Erez, Ziyu Wang, S. M. Ali Eslami, Martin Riedmiller, David Silver* + +7. **Recurrent Experience Replay in Distributed Reinforcement Learning** ICLR 2019. [paper](https://openreview.net/pdf?id=r1lyTjAqYX) + + *Steven Kapturowski, Georg Ostrovski, John Quan, Remi Munos, Will Dabney* + +8. **GPU-Accelerated Robotic Simulation for Distributed Reinforcement Learning** CoRL 2018. [paper](https://arxiv.org/abs/1810.05762) + + *Jacky Liang, Viktor Makoviychuk, Ankur Handa, Nuttapong Chentanez, Miles Macklin, Dieter Fox* + +9. **SURREAL: Open-Source Reinforcement Learning Framework and Robot Manipulation Benchmark** CoRL 2018. [paper](https://surreal.stanford.edu/img/surreal-corl2018.pdf) + + *Linxi Fan, Yuke Zhu, Jiren Zhu, Zihua Liu, Orien Zeng, Anchit Gupta, Joan Creus-Costa, Silvio Savarese, Li Fei-Fei* -- GitLab