Sledovat
Erdem Bıyık
Název
Citace
Citace
Rok
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
S Casper, X Davies, C Shi, TK Gilbert, J Scheurer, J Rando, R Freedman, ...
arXiv preprint arXiv:2307.15217, 2023
1372023
Asking Easy Questions: A User-Friendly Approach to Active Reward Learning
E Bıyık, M Palan, NC Landolfi, DP Losey, D Sadigh
arXiv preprint arXiv:1910.04365, 2019
1092019
Batch Active Preference-Based Learning of Reward Functions
E Bıyık, D Sadigh
Proceedings of the 2nd Conference on Robot Learning 87 (Proceedings of …, 2018
952018
When humans aren't optimal: Robots that collaborate with risk-aware humans
M Kwon, E Biyik, A Talati, K Bhasin, DP Losey, D Sadigh
Proceedings of the 2020 ACM/IEEE international conference on human-robot …, 2020
942020
Learning reward functions from diverse sources of human feedback: Optimally integrating demonstrations and preferences
E Bıyık, DP Losey, M Palan, NC Landolfi, G Shevchuk, D Sadigh
The International Journal of Robotics Research 41 (1), 45-67, 2022
812022
Active preference-based gaussian process regression for reward learning
E Bıyık, N Huynh, MJ Kochenderfer, D Sadigh
arXiv preprint arXiv:2005.02575, 2020
812020
Reinforcement Learning based Control of Imitative Policies for Near-Accident Driving
Z Cao, E Bıyık, WZ Wang, A Raventos, A Gaidon, G Rosman, D Sadigh
arXiv preprint arXiv:2007.00178, 2020
602020
Batch Active Learning Using Determinantal Point Processes
E Bıyık, K Wang, N Anari, D Sadigh
arXiv preprint arXiv:1906.07975, 2019
542019
Learning how to dynamically route autonomous vehicles on shared roads
DA Lazar, E Bıyık, D Sadigh, R Pedarsani
Transportation Research Part C: Emerging Technologies 130, 103258, 2021
402021
Profile‐encoding reconstruction for multiple‐acquisition balanced steady‐state free precession imaging
E Ilicak, LK Senel, E Biyik, T Çukur
Magnetic resonance in medicine 78 (4), 1316-1329, 2017
382017
Learning multimodal rewards from rankings
V Myers, E Biyik, N Anari, D Sadigh
Conference on Robot Learning, 342-352, 2022
372022
Roial: Region of interest active learning for characterizing exoskeleton gait preference landscapes
K Li, M Tucker, E Bıyık, E Novoseller, JW Burdick, Y Sui, D Sadigh, Y Yue, ...
2021 IEEE International Conference on Robotics and Automation (ICRA), 3212-3218, 2021
352021
Active learning of reward dynamics from hierarchical queries
C Basu, E Bıyık, Z He, M Singhal, D Sadigh
2019 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2019
352019
The green choice: Learning and influencing human decisions on shared roads
E Bıyık, DA Lazar, D Sadigh, R Pedarsani
2019 IEEE 58th Conference on Decision and Control (CDC), 347-354, 2019
322019
Reconstruction by calibration over tensors for multi‐coil multi‐acquisition balanced SSFP imaging
E Biyik, E Ilicak, T Cukur
Magnetic resonance in medicine 79 (5), 2542-2554, 2018
302018
Altruistic autonomy: Beating congestion on shared roads
E Bıyık, DA Lazar, R Pedarsani, D Sadigh
International Workshop on the Algorithmic Foundations of Robotics, 887-904, 2018
272018
Emergent Prosociality in Multi-Agent Games Through Gifting
WZ Wang, M Beliaev, E Bıyık, DA Lazar, R Pedarsani, D Sadigh
arXiv preprint arXiv:2105.06593, 2021
242021
Real-Time Detection, Tracking and Classification of Multiple Moving Objects in UAV Videos
HC Baykara, E Bıyık, G Gül, D Onural, AS Öztürk, İ Yıldız
Tools with Artificial Intelligence (ICTAI), 2017 IEEE 29th International …, 2017
242017
Learning Reward Functions from Scale Feedback
N Wilde, E Bıyık, D Sadigh, SL Smith
arXiv preprint arXiv:2110.00284, 2021
222021
Efficient and safe exploration in deterministic markov decision processes with unknown transition models
E Biyik, J Margoliash, SR Alimo, D Sadigh
2019 American Control Conference (ACC), 1792-1799, 2019
222019
Systém momentálně nemůže danou operaci provést. Zkuste to znovu později.
Články 1–20