Distributed prioritized experience replay D Horgan, J Quan, D Budden, G Barth-Maron, M Hessel, H Van Hasselt, ... arXiv preprint arXiv:1803.00933, 2018 | 785 | 2018 |
Distributed distributional deterministic policy gradients G Barth-Maron, MW Hoffman, D Budden, W Dabney, D Horgan, D Tb, ... arXiv preprint arXiv:1804.08617, 2018 | 549 | 2018 |
A generalist agent S Reed, K Zolna, E Parisotto, SG Colmenarejo, A Novikov, G Barth-Maron, ... arXiv preprint arXiv:2205.06175, 2022 | 490 | 2022 |
Data-efficient deep reinforcement learning for dexterous manipulation I Popov, N Heess, T Lillicrap, R Hafner, G Barth-Maron, M Vecerik, ... arXiv preprint arXiv:1704.03073, 2017 | 286 | 2017 |
Acme: A research framework for distributed reinforcement learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020 | 215 | 2020 |
Observe and look further: Achieving consistent performance on atari T Pohlen, B Piot, T Hester, MG Azar, D Horgan, D Budden, G Barth-Maron, ... arXiv preprint arXiv:1805.11593, 2018 | 125 | 2018 |
Making efficient use of demonstrations to solve hard exploration problems TL Paine, C Gulcehre, B Shahriari, M Denil, M Hoffman, H Soyer, ... arXiv preprint arXiv:1909.01387, 2019 | 69 | 2019 |
Goal-based action priors D Abel, D Hershkowitz, G Barth-Maron, S Brawner, K O'Farrell, ... Proceedings of the International Conference on Automated Planning and …, 2015 | 56 | 2015 |
One-shot high-fidelity imitation: Training large-scale deep nets with rl TL Paine, SG Colmenarejo, Z Wang, S Reed, Y Aytar, T Pfaff, ... arXiv preprint arXiv:1810.05017, 2018 | 22 | 2018 |
Reverb: A framework for experience replay A Cassirer, G Barth-Maron, E Brevdo, S Ramos, T Boyd, T Sottiaux, ... arXiv preprint arXiv:2102.04736, 2021 | 19 | 2021 |
Launchpad: a programming model for distributed machine learning research F Yang, G Barth-Maron, P Stańczyk, M Hoffman, S Liu, M Kroiss, A Pope, ... arXiv preprint arXiv:2106.04516, 2021 | 16 | 2021 |
Toward affordance-aware planning D Abel, G Barth-Maron, J MacGlashan, S Tellex First Workshop on Affordances: Affordances in Vision for Cognitive Robotics, 2014 | 15 | 2014 |
QuaRL: Quantization for fast and environmentally sustainable reinforcement learning S Krishnan, M Lam, S Chitlangia, Z Wan, G Barth-Maron, A Faust, ... arXiv preprint arXiv:1910.01055, 2019 | 14 | 2019 |
Diego de Las Casas, Andreas Fidjeland, Tim Green, Adrià Puigdomènech, Sébastien Racanière, Jack Rae, and Fabio Viola. Open sourcing Sonnet-a new library for constructing neural … M Reynolds, G Barth-Maron, F Besse | 9 | 2017 |
Data-efficient reinforcement learning for continuous control tasks M Riedmiller, R Hafner, M Vecerik, TP Lillicrap, T Lampe, I Popov, ... US Patent 10,664,725, 2020 | 7 | 2020 |
Affordances as transferable knowledge for planning agents G Barth-Maron, D Abel, J MacGlashan, S Tellex 2014 AAAI Fall Symposium Series, 2014 | 7 | 2014 |
Quantized reinforcement learning (quarl) Z Wan | 4 | 2019 |
Learning deep state representations with convolutional autoencoders G Barth-Maron PhD thesis, Master’s thesis, Brown University, 2015 | 4 | 2015 |
Distributional reinforcement learning for continuous control tasks D Budden, MW Hoffman, G Barth-Maron US Patent 11,481,629, 2022 | 3 | 2022 |
Affordance-Aware Planning D Abel, G Barth-Maron, J MacGlashan, S Tellex | 2 | 2015 |