Shangtong Zhang

Cited by

	All	Since 2019
Citations	1316	1277
h-index	16	16
i10-index	25	24

300

150

225

201720182019202020212022202320246 27 62 161 229 283 296 242

Public access

View all

11 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, WaymoVerified email at cs.ox.ac.uk
Richard S. SuttonKeen, Amii, and University of AlbertaVerified email at richsutton.com
Bo LiuPhD, AAAI SM, IEEE SMVerified email at cs.umass.edu
Remi Tachet des CombesVerified email at alpacaml.com
Romain LarocheMicrosoft ResearchVerified email at polytechnique.org
Linglong KongProfessor, Canada Research Chair in Statistical Learning, UAlberta, and Canada CIFAR AI Chair, AmiiVerified email at ualberta.ca
Wendelin BöhmerSequential Decision Making Group, Delft University of TechnologyVerified email at tudelft.nl
Ray JiangResearch Scientist, DeepMindVerified email at google.com
Marcus EdelComputer Science, Free University of BerlinVerified email at fu-berlin.de
Ryan R. CurtinFree agentVerified email at ratml.org
Nando de FreitasCIFAR & DeepMindVerified email at google.com
Tom Le PaineStaff Research Scientist at Google DeepMindVerified email at google.com
Julian SchrittwieserDeepMindVerified email at furidamu.org
Roman RingGoogle DeepMindVerified email at deepmind.com
Petko GeorgievGoogle DeepMind, University of CambridgeVerified email at cam.ac.uk
Michael MathieuDeepMindVerified email at google.com
Aäron van den OordGoogle DeepMindVerified email at google.com
Caglar GulcehreAI Researcher, Prof at EPFL, Consultant@Google DeepMind, ex-Staff Research Scientist@Google DeepMindVerified email at google.com
Aja HuangDeepMindVerified email at google.com
Sherjil OzairTesla AIVerified email at tesla.com

Shangtong Zhang

University of Virginia

Verified email at virginia.edu - Homepage

reinforcement learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A Deeper Look at Experience Replay S Zhang, RS Sutton Deep Reinforcement Learning Symposium, NIPS 2017, 2017	369	2017
GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values S Zhang, B Liu, S Whiteson ICML 2020, 2020	104	2020
Distributional Reinforcement Learning for Efficient Exploration B Mavrin, S Zhang, H Yao, L Kong, K Wu, Y Yu ICML 2019, 2019	101	2019
mlpack 3: a fast, flexible machine learning library R Curtin, M Edel, M Lozhnikov, Y Mentekidis, S Ghaisas, S Zhang Journal of Open Source Software 3 (26), 726, 2018	91	2018
DAC: The Double Actor-Critic Architecture for Learning Options S Zhang, S Whiteson NeurIPS 2019, 2019	86	2019
Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function Approximation S Zhang, B Liu, H Yao, S Whiteson ICML 2020, 2019	61	2019
Generalized Off-Policy Actor-Critic S Zhang, W Boehmer, S Whiteson NeurIPS 2019, 2019	55	2019
Breaking the Deadly Triad with a Target Network S Zhang, H Yao, S Whiteson ICML 2021, 2021	49	2021
Mean-variance policy iteration for risk-averse reinforcement learning S Zhang, B Liu, S Whiteson Proceedings of the AAAI Conference on Artificial Intelligence 35 (12), 10905 …, 2021	42	2021
Average-Reward Off-Policy Policy Evaluation with Function Approximation S Zhang, Y Wan, RS Sutton, S Whiteson ICML 2021, 2021	38	2021
QUOTA: The Quantile Option Architecture for Reinforcement Learning S Zhang, B Mavrin, L Kong, B Liu, H Yao AAAI 2019, 2018	36	2018
ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search S Zhang, H Chen, H Yao AAAI 2019, 2018	31	2018
Modularized Implementation of Deep RL Algorithms in PyTorch S Zhang	31*	2018
Deep Residual Reinforcement Learning S Zhang, W Boehmer, S Whiteson AAMAS 2020, 2019	29	2019
A deep neural network for modeling music P Zhang, X Zheng, W Zhang, S Li, S Qian, W He, S Zhang, Z Wang Proceedings of the 5th ACM on International Conference on Multimedia …, 2015	29	2015
AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning M Mathieu, S Ozair, S Srinivasan, C Gulcehre, S Zhang, R Jiang, ... arXiv preprint arXiv:2308.03526, 2023	28*	2023
Learning expected emphatic traces for deep RL R Jiang, S Zhang, V Chelu, A White, H van Hasselt Proceedings of the AAAI Conference on Artificial Intelligence 36 (6), 7015-7023, 2022	16	2022
Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards Y Song, J Wang, T Lukasiewicz, Z Xu, S Zhang, M Xu AAAI 2020, 2019	15	2019
On the Convergence of SARSA with Linear Function Approximation S Zhang, RT Des Combes, R Laroche ICML 2023, 2023	13*	2023
A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms S Zhang, R Laroche, H van Seijen, S Whiteson, RT Combes arXiv preprint arXiv:2010.01069, 2020	13	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors