Dex-net 1.0: A cloud-based network of 3d objects for robust grasp planning using a multi-armed bandit model with correlated rewards J Mahler, FT Pokorny, B Hou, M Roderick, M Laskey, M Aubry, K Kohlhoff, ... 2016 IEEE international conference on robotics and automation (ICRA), 1957-1964, 2016 | 443 | 2016 |
Implementing the deep q-network M Roderick, J MacGlashan, S Tellex arXiv preprint arXiv:1711.07478, 2017 | 109 | 2017 |
Enforcing robust control guarantees within neural network policies PL Donti, M Roderick, M Fazlyab, JZ Kolter arXiv preprint arXiv:2011.08105, 2020 | 79 | 2020 |
Deep abstract q-networks M Roderick, C Grimm, S Tellex arXiv preprint arXiv:1710.00459, 2017 | 42 | 2017 |
Mean actor critic C Allen, K Asadi, M Roderick, A Mohamed, G Konidaris, M Littman arXiv preprint arXiv:1709.00503, 2017 | 37* | 2017 |
Provably safe pac-mdp exploration using analogies M Roderick, V Nagarajan, Z Kolter International Conference on Artificial Intelligence and Statistics, 1216-1224, 2021 | 12 | 2021 |
Implementing the deep q-network. arXiv M Roderick, J MacGlashan, S Tellex arXiv preprint arXiv:1711.07478, 2017 | 7 | 2017 |
Implementing the deep q-network. arXiv 2017 M Roderick, J MacGlashan, S Tellex arXiv preprint arXiv:1711.07478, 0 | 7 | |
The AmphibiaWeb app and use of mobile devices in research and outreach M Roderick, J Gross Herpetology Notes 7, 109-113, 2014 | 2 | 2014 |
Systems and methods for estimating input certainty for a neural network using generative modeling M Roderick, F Berkenkamp, F Sheikholeslami, J Kolter US Patent App. 17/488,096, 2023 | 1 | 2023 |
Device and method for improved policy learning for robots F Berkenkamp, G Manek, JZ Kolter, M Roderick US Patent App. 18/589,910, 2024 | | 2024 |
Generative Posterior Networks for Approximately Bayesian Epistemic Uncertainty Estimation M Roderick, F Berkenkamp, F Sheikholeslami, Z Kolter arXiv preprint arXiv:2312.17411, 2023 | | 2023 |
Projected Off-Policy Q-Learning (POP-QL) for Stabilizing Offline Reinforcement Learning M Roderick, G Manek, F Berkenkamp, JZ Kolter arXiv preprint arXiv:2311.14885, 2023 | | 2023 |
Ensuring the Safety of Reinforcement Learning Algorithms at Training and Deployment M Roderick Carnegie Mellon University, 2023 | | 2023 |
Ensuring Safety at Every Stage of the Reinforcement Learning Pipeline M Roderick Carnegie Mellon University Pittsburgh, PA, 2022 | | 2022 |
Controller with neural network and improved stability JZ Kolter, M Roderick, PL Donti, J Vinogradska US Patent App. 17/184,995, 2021 | | 2021 |
Interacting with an unsafe physical environment D Reeb, JZ Kolter, M Roderick, V Nagarajan US Patent App. 17/121,237, 2021 | | 2021 |
2023 Theses by Author JT BLANE, P CASANOVA, V DWIVEDI, TJ GLAZIER, J LACOMIS, ... | | |
DWIVEDI, VISHAL CMU-S3D-22-110 GLAZIER, Thomas J. CMU-S3D-23-110 LACOMIS, Jeremy CMU-S3D-23-103 MAGELINSKI, Thomas CMU-S3D-23-101 M RODERICK, ZR SHI, J SHIN, W DIVENCENZO, DG WIDDER | | |