Follow
Stephane Hatgis-Kessell
Title
Cited by
Cited by
Year
Models of human preference for learning reward functions
WB Knox, S Hatgis-Kessell, S Booth, S Niekum, P Stone, A Allievi
arXiv preprint arXiv:2206.02231, 2022
192022
Learning optimal advantage from preferences and mistaking it for reward
WB Knox, S Hatgis-Kessell, SO Adalgeirsson, S Booth, A Dragan, P Stone, ...
Proceedings of the AAAI Conference on Artificial Intelligence 38 (9), 10066 …, 2024
22024
The system can't perform the operation now. Try again later.
Articles 1–2