Transfer reinforcement learning with shared dynamics R Laroche, M Barlier Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017 | 65 | 2017 |
Human-machine dialogue as a stochastic game M Barlier, J Perolat, R Laroche, O Pietquin 16th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL 2015), 2015 | 31 | 2015 |
A simple and efficient smoothing method for faster optimization and local exploration K Scaman, L Dos Santos, M Barlier, I Colin Advances in Neural Information Processing Systems 33, 6503-6513, 2020 | 7 | 2020 |
Training dialogue systems with human advice M Barlier, R Laroche, O Pietquin AAMAS 2018-the 17th International Conference on Autonomous Agents and …, 2018 | 7 | 2018 |
Enhancing reinforcement learning agents with local guides P Daoudi, B Robu, C Prieur, LD Santos, M Barlier arXiv preprint arXiv:2402.13930, 2024 | 6 | 2024 |
Multi-agent best arm identification with private communications A Rio, M Barlier, I Colin, M Soare International Conference on Machine Learning, 29082-29102, 2023 | 6 | 2023 |
Density Estimation For Conversative Q-Learning P Daoudi, L Dos Santos, M Barlier, A Virmaux | 5 | 2022 |
A stochastic model for computer-aided human-human dialogue M Barlier, R Laroche, O Pietquin Interspeech 2016 2016, 2051-2055, 2016 | 4 | 2016 |
Improving a proportional integral controller with reinforcement learning on a throttle valve benchmark P Daoudi, B Mavkov, B Robu, C Prieur, E Witrant, M Barlier, L Dos Santos 2024 IEEE Conference on Control Technology and Applications (CCTA), 217-222, 2024 | 3 | 2024 |
Price of safety in linear best arm identification X Shang, I Colin, M Barlier, H Cherkaoui arXiv preprint arXiv:2309.08709, 2023 | 2 | 2023 |
Learning dialogue dynamics with the method of moments M Barlier, R Laroche, O Pietquin 2016 IEEE Spoken Language Technology Workshop (SLT), 98-105, 2016 | 2 | 2016 |
Human-machine dialogue as a stochastic game B Merwan, P Julien, L Romain, P Olivier 16th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL 2015), 2015 | 2 | 2015 |
Density estimation for conservative q-learning, 2022 P Daoudi, M Barlier, L Dos Santos, A Virmaux URL https://openreview. net/forum, 0 | 2 | |
A trust region approach for few-shot sim-to-real reinforcement learning P Daoudi, B Robu, C PRIEUR, L Dos Santos, M Barlier | 1 | 2023 |
Differentially Private Policy Gradient A Rio, M Barlier, I Colin arXiv preprint arXiv:2501.19080, 2025 | | 2025 |
Measures of diversity and space-filling designs for categorical data C Malherbe, E Domínguez-Sánchez, M Barlier, I Colin, HB Ammar, ... Forty-first International Conference on Machine Learning, 2024 | | 2024 |
Differentially Private Deep Model-Based Reinforcement Learning A Rio, M Barlier, I Colin, A Thomas arXiv preprint arXiv:2402.05525, 2024 | | 2024 |
Differentially Private Model-Based Offline Reinforcement Learning A Rio, M Barlier, I Colin, A Thomas arXiv e-prints, arXiv: 2402.05525, 2024 | | 2024 |
A conservative approach for few-shot transfer in off-dynamics reinforcement learning P Daoudi, C Prieur, B Robu, M Barlier, LD Santos arXiv preprint arXiv:2312.15474, 2023 | | 2023 |
Clustered Multi-Agent Linear Bandits H Cherkaoui, M Barlier, I Colin arXiv preprint arXiv:2309.08710, 2023 | | 2023 |