Zhengyuan Zhou

Citace

	Všechny	Od 2019
Citace	5678	5194
h-index	36	35
i10-index	73	65

1300

650

325

975

2014201520162017201820192020202120222023202417 20 52 87 188 398 678 901 977 1288 841

Veřejný přístup

Zobrazit všechny

34 článků

2 články

dostupné

nedostupné

Vychází ze zplnomocnění pro financování

Sledovat

Zhengyuan Zhou

Dept of Technology, Operations and Statistics at NYU Stern

E-mailová adresa ověřena na: stern.nyu.edu - Domovská stránka

contextual bandits learning in games stochastic optimization reinforcement learning operations management


Název Seřadit podle citací Seřadit podle roku Seřadit podle názvu	Citace Citace	Rok
MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels L Jiang, Z Zhou, T Leung, LJ Li, L Fei-Fei International Conference on Machine Learning, 2309-2318, 2018	1659	2018
Learning in games with continuous action sets and unknown payoff functions P Mertikopoulos, Z Zhou Mathematical Programming 173 (1-2), 465-507, 2019	274	2019
Estimation considerations in contextual bandits M Dimakopoulou, Z Zhou, S Athey, G Imbens arXiv preprint arXiv:1711.07077, 2017	230	2017
Balanced linear contextual bandits M Dimakopoulou, Z Zhou, S Athey, G Imbens Proceedings of the AAAI Conference on Artificial Intelligence 33, 3445-3453, 2019	209	2019
Robust low-rank tensor recovery with rectification and alignment X Zhang, D Wang, Z Zhou, Y Ma IEEE transactions on pattern analysis and machine intelligence 43 (1), 238-255, 2019	193	2019
Cooperative pursuit with Voronoi partitions Z Zhou, W Zhang, J Ding, H Huang, DM Stipanović, CJ Tomlin Automatica 72, 64-72, 2016	183*	2016
Offline multi-action policy learning: Generalization and optimization Z Zhou, S Athey, S Wager Operations Research, 2022	176	2022
Multiplayer reach-avoid games via pairwise outcomes M Chen, Z Zhou, CJ Tomlin IEEE Transactions on Automatic Control 62 (3), 1451-1457, 2016	151	2016
Distributional soft actor critic for risk sensitive learning X Ma, Q Zhang, L Xia, Z Zhou, J Yang, Q Zhao arXiv preprint arXiv:2004.14547, 2020	102	2020
Stochastic mirror descent in variationally coherent optimization problems Z Zhou, P Mertikopoulos, N Bambos, S Boyd, PW Glynn Advances in Neural Information Processing Systems 30, 2017	98	2017
Learning in generalized linear contextual bandits with stochastic delays Z Zhou, R Xu, J Blanchet Advances in Neural Information Processing Systems, 5197-5208, 2019	93	2019
A general, open-loop formulation for reach-avoid games Z Zhou, R Takei, H Huang, CJ Tomlin Decision and Control (CDC), 2012 IEEE 51st Annual Conference on, 6501-6506, 2012	86	2012
Infinite time horizon maximum causal entropy inverse reinforcement learning Z Zhou, M Bloem, N Bambos IEEE Transactions on Automatic Control 63 (9), 2787-2802, 2018	82	2018
Efficient path planning algorithms in reach-avoid problems Z Zhou, J Ding, H Huang, R Takei, C Tomlin Automatica 89, 28-36, 2018	77	2018
Finite-Sample Regret Bound for Distributionally Robust Offline Tabular Reinforcement Learning Z Zhou, Z Zhou, Q Bai, L Qiu, J Blanchet, P Glynn International Conference on Artificial Intelligence and Statistics, 3331-3339, 2021	73	2021
Multiplayer reach-avoid games via low dimensional solutions and maximum matching M Chen, Z Zhou, CJ Tomlin 2014 American control conference, 1444-1449, 2014	72	2014
Distributed Asynchronous Optimization with Unbounded Delays: How Slow Can You Go? Z Zhou, P Mertikopoulos, N Bambos, PW Glynn, Y Ye, LJ Li, FF Li ICML 2018-35th International Conference on Machine Learning, 1-10, 2018	71	2018
Evasion as a team against a faster pursuer SY Liu, Z Zhou, C Tomlin, K Hedrick 2013 American Control Conference, 5368-5373, 2013	68	2013
Simple Agent, Complex Environment: Efficient Reinforcement Learning with Agent State S Dong, B Van Roy, Z Zhou arXiv preprint arXiv:2102.05261, 2021	67*	2021
Distributionally robust policy evaluation and learning in offline contextual bandits N Si, F Zhang, Z Zhou, J Blanchet International Conference on Machine Learning, 8884-8894, 2020	62	2020

Systém momentálně nemůže danou operaci provést. Zkuste to znovu později.

Články 1–20

Citace za rok

Duplicitní citace

Sloučené citace

Přidat spoluautorySpoluautoři

Sledovat

Citace