The statistical complexity of interactive decision making DJ Foster, SM Kakade, J Qian, A Rakhlin arXiv preprint arXiv:2112.13487, 2021 | 224 | 2021 |
Convex and non-convex optimization under generalized smoothness H Li, J Qian, Y Tian, A Rakhlin, A Jadbabaie Advances in Neural Information Processing Systems 36, 40238-40271, 2023 | 55 | 2023 |
Exploration bonus for regret minimization in discrete and continuous average reward mdps J Qian, R Fruit, M Pirotta, A Lazaric Advances in Neural Information Processing Systems 32, 2019 | 46* | 2019 |
Importance resampling for off-policy prediction M Schlegel, W Chung, D Graves, J Qian, M White Advances in Neural Information Processing Systems 32, 2019 | 46 | 2019 |
Towards minimax optimal reinforcement learning in factored markov decision processes Y Tian, J Qian, S Sra Advances in Neural Information Processing Systems 33, 19896-19907, 2020 | 29 | 2020 |
Model-free reinforcement learning with the decision-estimation coefficient DJ Foster, N Golowich, J Qian, A Rakhlin, A Sekhari Thirty-seventh Conference on Neural Information Processing Systems, 2023 | 27* | 2023 |
Concentration inequalities for multinoulli random variables J Qian, R Fruit, M Pirotta, A Lazaric arXiv preprint arXiv:2001.11595, 2020 | 23 | 2020 |
Byzantine-robust federated linear bandits A Jadbabaie, H Li, J Qian, Y Tian 2022 IEEE 61st Conference on Decision and Control (CDC), 5206-5213, 2022 | 15 | 2022 |
Robust learning under clean-label attack A Blum, S Hanneke, J Qian, H Shao Conference on Learning Theory, 591-634, 2021 | 10 | 2021 |
Online estimation via offline estimation: An information-theoretic framework DJ Foster, Y Han, J Qian, A Rakhlin Advances in Neural Information Processing Systems 37, 42840-42898, 2025 | 8 | 2025 |
Bridging multiple worlds: multi-marginal optimal transport for causal partial-identification problem Z Gao, S Ge, J Qian arXiv preprint arXiv:2406.07868, 2024 | 3 | 2024 |
How Does Variance Shape the Regret in Contextual Bandits? Z Jia, J Qian, A Rakhlin, CY Wei Advances in Neural Information Processing Systems 37, 83730-83785, 2024 | 1 | 2024 |
Refined Risk Bounds for Unbounded Losses via Transductive Priors J Qian, A Rakhlin, N Zhivotovskiy arXiv preprint arXiv:2410.21621, 2024 | 1 | 2024 |
Offline Oracle-Efficient Learning for Contextual MDPs via Layerwise Exploration-Exploitation Tradeoff J Qian, H Hu, D Simchi-Levi arXiv preprint arXiv:2405.17796, 2024 | 1 | 2024 |
The Non-linear -Design and Applications to Interactive Learning A Agarwal, J Qian, A Rakhlin, T Zhang Forty-first International Conference on Machine Learning, 2024 | 1 | 2024 |
Evolution of Information in Interactive Decision Making: A Case Study for Multi-Armed Bandits Y Gu, Y Han, J Qian arXiv preprint arXiv:2503.00273, 2025 | | 2025 |
Assouad, Fano, and Le Cam with Interaction: A Unifying Lower Bound Framework and Characterization for Bandit Learnability F Chen, DJ Foster, Y Han, J Qian, A Rakhlin, Y Xu Advances in Neural Information Processing Systems 37, 75585-75641, 2024 | | 2024 |
To bootstrap or to rollout? An optimal and adaptive interpolation W Mou, J Qian arXiv preprint arXiv:2411.09731, 2024 | | 2024 |