Jumanji: Industry-Driven Hardware-Accelerated RL Environments.(2022) C Bonnet, D Byrne, V Le, L Midgley, D Luo, C Waters, S Abramowitz, ... URL https://github. com/instadeepai/jumanji, 2022 | 7 | 2022 |
A sequence modelling approach to question answering in text-based games G Furman, E Toledo, J Shock, J Buys Association for Computational Linguistics, 2022 | 3 | 2022 |
Policy-based reinforcement learning for generalisation in interactive text-based environments E Toledo, J Buys, J Shock Proceedings of the 17th Conference of the European Chapter of the …, 2023 | 1 | 2023 |
RepGraph: Visualising and analysing meaning representation graphs J Cohen, R Cohen, E Toledo, J Buys Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021 | 1 | 2021 |
SMX: Sequential Monte Carlo Planning for Expert Iteration MV Macfarlane, E Toledo, D Byrne, S Singh, P Duckworth, A Laterre arXiv preprint arXiv:2402.07963, 2024 | | 2024 |
Stoix: Distributed Single-Agent Reinforcement Learning End-to-End in JAX E Toledo https://github.com/EdanToledo/Stoix, 2024 | | 2024 |
Flashbax: Streamlining Experience Replay Buffers for Reinforcement Learning with JAX E Toledo, L Midgley, D Byrne, CR Tilbury, M Macfarlane, C Courtot, ... https://github.com/instadeepai/flashbax/, 2023 | | 2023 |