Follow
Mridul Agarwal
Title
Cited by
Cited by
Year
Achieving zero constraint violation for constrained reinforcement learning via primal-dual approach
Q Bai, AS Bedi, M Agarwal, A Koppel, V Aggarwal
Proceedings of the AAAI Conference on Artificial Intelligence 36 (4), 3682-3689, 2022
522022
Multi-agent multi-armed bandits with limited communication
M Agarwal, V Aggarwal, K Azizzadenesheli
Journal of Machine Learning Research 23 (212), 1-24, 2022
322022
Stochastic Top K-Subset Bandits with Linear Space and Non-Linear Feedback with Applications to Social Influence Maximization
M Agarwal, V Aggarwal, AK Umrawal, CJ Quinn
ACM/IMS Transactions on Data Science (TDS) 2 (4), 1-39, 2022
23*2022
On the approximation of cooperative heterogeneous multi-agent reinforcement learning (MARL) using mean field control (MFC)
WU Mondal, M Agarwal, V Aggarwal, SV Ukkusuri
Journal of Machine Learning Research 23 (129), 1-46, 2022
232022
Transferring dexterous surgical skill knowledge between robots for semi-autonomous teleoperation
MM Rahman, N Sanchez-Tamayo, G Gonzalez, M Agarwal, V Aggarwal, ...
2019 28th IEEE International Conference on Robot and Human Interactive …, 2019
232019
Multi-objective reinforcement learning with non-linear scalarization
M Agarwal, V Aggarwal, T Lan
Proceedings of the 21st International Conference on Autonomous Agents and …, 2022
192022
Deserts: Delay-tolerant semi-autonomous robot teleoperation for surgery
G Gonzalez, M Agarwal, MV Balakuntala, MM Rahman, U Kaur, ...
2021 IEEE International Conference on Robotics and Automation (ICRA), 12693 …, 2021
192021
Regret guarantees for model-based reinforcement learning with long-term average constraints
M Agarwal, Q Bai, V Aggarwal
Uncertainty in Artificial Intelligence, 22-31, 2022
18*2022
Blind decision making: Reinforcement learning with delayed observations
M Agarwal, V Aggarwal
Pattern Recognition Letters 150, 176-182, 2021
162021
Reinforcement learning for joint optimization of multiple rewards
M Agarwal, V Aggarwal
arXiv preprint arXiv:1909.02940, 2019
16*2019
Communication efficient parallel reinforcement learning
M Agarwal, B Ganguly, V Aggarwal
Uncertainty in Artificial Intelligence, 247-256, 2021
132021
Sartres: a semi-autonomous robot teleoperation environment for surgery
MM Rahman, MV Balakuntala, G Gonzalez, M Agarwal, U Kaur, ...
Computer Methods in Biomechanics and Biomedical Engineering: Imaging …, 2021
122021
An explore-then-commit algorithm for submodular maximization under full-bandit feedback
G Nie, M Agarwal, AK Umrawal, V Aggarwal, CJ Quinn
Uncertainty in Artificial Intelligence, 1541-1551, 2022
112022
Reinforcement learning for mean-field game
M Agarwal, V Aggarwal, A Ghosh, N Tiwari
Algorithms 15 (3), 73, 2022
102022
Concave utility reinforcement learning with zero-constraint violations
M Agarwal, Q Bai, V Aggarwal
arXiv preprint arXiv:2109.05439, 2021
102021
Asap: A semi-autonomous precise system for telesurgery during communication delays
G Gonzalez, M Balakuntala, M Agarwal, T Low, B Knoth, AW Kirkpatrick, ...
IEEE Transactions on Medical Robotics and Bionics 5 (1), 66-78, 2023
82023
Joint optimization of multi-objective reinforcement learning with policy gradient based algorithm
Q Bai, M Agarwal, V Aggarwal
arXiv preprint arXiv:2105.14125, 2021
82021
Dart: Adaptive accept reject algorithm for non-linear combinatorial bandits
M Agarwal, V Aggarwal, AK Umrawal, C Quinn
Proceedings of the AAAI Conference on Artificial Intelligence 35 (8), 6557-6565, 2021
8*2021
Escaping saddle points for zeroth-order non-convex optimization using estimated gradient descent
Q Bai, M Agarwal, V Aggarwal
2020 54th Annual Conference on Information Sciences and Systems (CISS), 1-6, 2020
72020
Grasping region identification in novel objects using microsoft kinect
A Rai, PK Patchaikani, M Agarwal, R Gupta, L Behera
Neural Information Processing: 19th International Conference, ICONIP 2012 …, 2012
62012
The system can't perform the operation now. Try again later.
Articles 1–20