Follow
Haitham Bou-Ammar
Haitham Bou-Ammar
RL-Team Leader, BO-Team Leader, MAS-Team Leader @ Huawei London & H. Assistant Professor @ UCL
Verified email at huawei.com
Title
Cited by
Cited by
Year
Online multi-task learning for policy gradient methods
HB Ammar, E Eaton, P Ruvolo, M Taylor
International conference on machine learning, 1206-1214, 2014
1852014
Smarts: Scalable multi-agent reinforcement learning training school for autonomous driving
M Zhou, J Luo, J Villella, Y Yang, D Rusu, J Miao, W Zhang, M Alban, ...
arXiv preprint arXiv:2010.09776, 2020
1382020
Controller design for quadrotor uavs using reinforcement learning
H Bou-Ammar, H Voos, W Ertel
2010 IEEE International Conference on Control Applications, 2130-2135, 2010
1252010
Hebo: Pushing the limits of sample-efficient hyper-parameter optimisation
AI Cowen-Rivers, W Lyu, R Tutunov, Z Wang, A Grosnit, RR Griffiths, ...
Journal of Artificial Intelligence Research 74, 1269-1349, 2022
116*2022
An automated measure of mdp similarity for transfer in reinforcement learning
HB Ammar, E Eaton, ME Taylor, DC Mocanu, K Driessens, G Weiss, ...
Workshops at the twenty-eighth AAAI conference on artificial intelligence 1, 2014
1042014
Autonomous cross-domain knowledge transfer in lifelong policy gradient reinforcement learning
HB Ammar, E Eaton, JM Luna, P Ruvolo
Twenty-fourth international joint conference on artificial intelligence, 2015
972015
Unsupervised cross-domain transfer in policy gradient reinforcement learning via manifold alignment
HB Ammar, E Eaton, P Ruvolo, M Taylor
Proceedings of the AAAI Conference on Artificial Intelligence 29 (1), 2015
872015
Reinforcement learning transfer via sparse coding
HB Ammar, K Tuyls, ME Taylor, K Driessens, G Weiss
Proceedings of the 11th international conference on autonomous agents and …, 2012
732012
Safe policy search for lifelong reinforcement learning with sublinear regret
HB Ammar, R Tutunov, E Eaton
International Conference on Machine Learning, 2361-2369, 2015
702015
Wasserstein robust reinforcement learning
MA Abdullah, H Ren, HB Ammar, V Milenkovic, R Luo, M Zhang, J Wang
arXiv preprint arXiv:1907.13196, 2019
652019
Distributed newton method for large-scale consensus optimization
R Tutunov, H Bou-Ammar, A Jadbabaie
IEEE Transactions on Automatic Control 64 (10), 3983-3994, 2019
652019
Theoretically-grounded policy advice from multiple teachers in reinforcement learning settings with applications to negative transfer
Y Zhan, HB Ammar
arXiv preprint arXiv:1604.03986, 2016
612016
Balancing two-player stochastic games with soft q-learning
J Grau-Moya, F Leibfried, H Bou-Ammar
arXiv preprint arXiv:1802.03216, 2018
542018
Nonlinear tracking and landing controller for quadrotor aerial robots
H Voos, H Bou-Ammar
Control Applications (CCA), 2010 IEEE International Conference on, 2136-2141, 2010
512010
Evolution of cooperation in arbitrary complex networks
B Ranjbar-Sahraei, H Bou Ammar, D Bloembergen, K Tuyls, G Weiss
Proceedings of the 2014 international conference on Autonomous agents and …, 2014
492014
Factored four way conditional restricted boltzmann machines for activity recognition
DC Mocanu, HB Ammar, D Lowet, K Driessens, A Liotta, G Weiss, K Tuyls
Pattern Recognition Letters 66, 100-108, 2015
482015
High-dimensional Bayesian optimisation with variational autoencoders and deep metric learning
A Grosnit, R Tutunov, AM Maraval, RR Griffiths, AI Cowen-Rivers, L Yang, ...
arXiv preprint arXiv:2106.03609, 2021
462021
Reduced reference image quality assessment via boltzmann machines
DC Mocanu, G Exarchakos, HB Ammar, A Liotta
2015 IFIP/IEEE International Symposium on Integrated Network Management (IM …, 2015
342015
Automatically mapped transfer between reinforcement learning tasks via three-way restricted boltzmann machines
HB Ammar, DC Mocanu, ME Taylor, K Driessens, K Tuyls, G Weiss
Machine Learning and Knowledge Discovery in Databases: European Conference …, 2013
342013
Reinforcement learning transfer via common subspaces
HB Ammar, ME Taylor
Adaptive and Learning Agents: International Workshop, ALA 2011, Held at …, 2012
33*2012
The system can't perform the operation now. Try again later.
Articles 1–20