CORL: Research-oriented deep offline reinforcement learning library D Tarasov, A Nikulin, D Akimov, V Kurenkov, S Kolesnikov Advances in Neural Information Processing Systems 36, 2024 | 42 | 2024 |
Distributed soft actor-critic with multivariate reward representation and knowledge distillation D Akimov arXiv preprint arXiv:1911.13056, 2019 | 10 | 2019 |
Deep reinforcement learning with vizdoomfirst-person shooter D Akimov, I Makarov | 9 | 2019 |
Q-ensemble for offline rl: Don't scale the ensemble, scale the batch size A Nikulin, V Kurenkov, D Tarasov, D Akimov, S Kolesnikov arXiv preprint arXiv:2211.11092, 2022 | 8 | 2022 |
Let offline rl flow: Training conservative agents in the latent space of normalizing flows D Akimov, V Kurenkov, A Nikulin, D Tarasov, S Kolesnikov arXiv preprint arXiv:2211.11096, 2022 | 8 | 2022 |