Tengyu Xu

Cited by

	All	Since 2019
Citations	869	852
h-index	15	15
i10-index	16	16

240

120

180

201720182019202020212022202320246 10 31 88 186 224 227 95

Public access

View all

14 articles

1 article

available

not available

Based on funding mandates

Co-authors

Yingbin LiangThe Ohio State UniversityVerified email at osu.edu
Guanghui (George) LanProfessor, Georgia Institute of TechnologyVerified email at isye.gatech.edu
HV PoorMichael Henry Strater University Professor, Princeton UniversityVerified email at princeton.edu
Zhaoran WangAssistant Professor at Northwestern UniversityVerified email at northwestern.edu

Tengyu Xu

Meta Platforms, Inc.

Verified email at meta.com - Homepage

Reinforcement Learning Deep Learning Natural Language Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Finite-sample analysis for sarsa with linear function approximation S Zou, T Xu, Y Liang Advances in neural information processing systems 32, 2019	176	2019
Improving sample complexity bounds for (natural) actor-critic algorithms T Xu, Z Wang, Y Liang Advances in Neural Information Processing Systems 33, 4358-4369, 2020	129*	2020
Crpo: A new approach for safe reinforcement learning with convergence guarantee T Xu, Y Liang, G Lan International Conference on Machine Learning, 11480-11491, 2021	125*	2021
Two time-scale off-policy TD learning: Non-asymptotic analysis over Markovian samples T Xu, S Zou, Y Liang Advances in neural information processing systems 32, 2019	83	2019
Reanalysis of variance reduced temporal difference learning T Xu, Z Wang, Y Zhou, Y Liang arXiv preprint arXiv:2001.01898, 2020	45	2020
Algorithms for the estimation of transient surface heat flux during ultra-fast surface cooling ZF Zhou, TY Xu, B Chen International Journal of Heat and Mass Transfer 100, 1-10, 2016	43	2016
Enhanced first and zeroth order variance reduced algorithms for min-max optimization T Xu, Z Wang, Y Liang, HV Poor	42*	2020
Proximal gradient descent-ascent: Variable convergence under k {\L} geometry Z Chen, Y Zhou, T Xu, Y Liang arXiv preprint arXiv:2102.04653, 2021	30	2021
Non-asymptotic convergence of adam-type reinforcement learning algorithms under markovian sampling H Xiong, T Xu, Y Liang, W Zhang Proceedings of the AAAI Conference on Artificial Intelligence 35 (12), 10460 …, 2021	28	2021
Sample complexity bounds for two timescale value-based reinforcement learning algorithms T Xu, Y Liang International Conference on Artificial Intelligence and Statistics, 811-819, 2021	27	2021
Faster algorithm and sharper analysis for constrained markov decision process T Li, Z Guan, S Zou, T Xu, Y Liang, G Lan Operations Research Letters 54, 107107, 2024	24	2024
When will generative adversarial imitation learning algorithms attain global convergence Z Guan, T Xu, Y Liang International Conference on Artificial Intelligence and Statistics, 1117-1125, 2021	24	2021
Doubly robust off-policy actor-critic: Convergence and optimality T Xu, Z Yang, Z Wang, Y Liang International Conference on Machine Learning, 11581-11591, 2021	23	2021
When Will Gradient Methods Converge to Max-margin Classifier under ReLU Models? T Xu, Y Zhou, K Ji, Y Liang arXiv preprint arXiv:1806.04339, 2018	23*	2018
Model-based offline meta-reinforcement learning with regularization S Lin, J Wan, T Xu, Y Liang, J Zhang arXiv preprint arXiv:2202.02929, 2022	15	2022
Provably efficient offline reinforcement learning with trajectory-wise reward T Xu, Y Wang, S Zou, Y Liang arXiv preprint arXiv:2206.06426, 2022	12	2022
PER-ETD: A polynomially efficient emphatic temporal difference learning method Z Guan, T Xu, Y Liang arXiv preprint arXiv:2110.06906, 2021	8	2021
Deterministic policy gradient: Convergence analysis H Xiong, T Xu, L Zhao, Y Liang, W Zhang Uncertainty in Artificial Intelligence, 2159-2169, 2022	6	2022
A Unifying Framework of Off-Policy General Value Function Evaluation T Xu, Z Yang, Z Wang, Y Liang Advances in Neural Information Processing Systems 35, 13570-13583, 2022	4*	2022
Constraint‐based multi‐agent reinforcement learning for collaborative tasks X Shang, T Xu, I Karamouzas, M Kallmann Computer Animation and Virtual Worlds 34 (3-4), e2182, 2023	2	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors