Sledovat
Jian Qian
Jian Qian
E-mailová adresa ověřena na: mit.edu - Domovská stránka
Název
Citace
Citace
Rok
The statistical complexity of interactive decision making
DJ Foster, SM Kakade, J Qian, A Rakhlin
arXiv preprint arXiv:2112.13487, 2021
2242021
Convex and non-convex optimization under generalized smoothness
H Li, J Qian, Y Tian, A Rakhlin, A Jadbabaie
Advances in Neural Information Processing Systems 36, 40238-40271, 2023
552023
Exploration bonus for regret minimization in discrete and continuous average reward mdps
J Qian, R Fruit, M Pirotta, A Lazaric
Advances in Neural Information Processing Systems 32, 2019
46*2019
Importance resampling for off-policy prediction
M Schlegel, W Chung, D Graves, J Qian, M White
Advances in Neural Information Processing Systems 32, 2019
462019
Towards minimax optimal reinforcement learning in factored markov decision processes
Y Tian, J Qian, S Sra
Advances in Neural Information Processing Systems 33, 19896-19907, 2020
292020
Model-free reinforcement learning with the decision-estimation coefficient
DJ Foster, N Golowich, J Qian, A Rakhlin, A Sekhari
Thirty-seventh Conference on Neural Information Processing Systems, 2023
27*2023
Concentration inequalities for multinoulli random variables
J Qian, R Fruit, M Pirotta, A Lazaric
arXiv preprint arXiv:2001.11595, 2020
232020
Byzantine-robust federated linear bandits
A Jadbabaie, H Li, J Qian, Y Tian
2022 IEEE 61st Conference on Decision and Control (CDC), 5206-5213, 2022
152022
Robust learning under clean-label attack
A Blum, S Hanneke, J Qian, H Shao
Conference on Learning Theory, 591-634, 2021
102021
Online estimation via offline estimation: An information-theoretic framework
DJ Foster, Y Han, J Qian, A Rakhlin
Advances in Neural Information Processing Systems 37, 42840-42898, 2025
82025
Bridging multiple worlds: multi-marginal optimal transport for causal partial-identification problem
Z Gao, S Ge, J Qian
arXiv preprint arXiv:2406.07868, 2024
32024
How Does Variance Shape the Regret in Contextual Bandits?
Z Jia, J Qian, A Rakhlin, CY Wei
Advances in Neural Information Processing Systems 37, 83730-83785, 2024
12024
Refined Risk Bounds for Unbounded Losses via Transductive Priors
J Qian, A Rakhlin, N Zhivotovskiy
arXiv preprint arXiv:2410.21621, 2024
12024
Offline Oracle-Efficient Learning for Contextual MDPs via Layerwise Exploration-Exploitation Tradeoff
J Qian, H Hu, D Simchi-Levi
arXiv preprint arXiv:2405.17796, 2024
12024
The Non-linear -Design and Applications to Interactive Learning
A Agarwal, J Qian, A Rakhlin, T Zhang
Forty-first International Conference on Machine Learning, 2024
12024
Evolution of Information in Interactive Decision Making: A Case Study for Multi-Armed Bandits
Y Gu, Y Han, J Qian
arXiv preprint arXiv:2503.00273, 2025
2025
Assouad, Fano, and Le Cam with Interaction: A Unifying Lower Bound Framework and Characterization for Bandit Learnability
F Chen, DJ Foster, Y Han, J Qian, A Rakhlin, Y Xu
Advances in Neural Information Processing Systems 37, 75585-75641, 2024
2024
To bootstrap or to rollout? An optimal and adaptive interpolation
W Mou, J Qian
arXiv preprint arXiv:2411.09731, 2024
2024
Systém momentálně nemůže danou operaci provést. Zkuste to znovu později.
Články 1–18