Follow
Zheng Tian
Title
Cited by
Cited by
Year
Thinking fast and slow with deep learning and tree search
T Anthony, Z Tian, D Barber
Advances in Neural Information Processing Systems, 5360-5370, 2017
3142017
SMARTS: An Open-Source Scalable Multi-Agent RL Training School for Autonomous Driving
M Zhou, J Luo, J Villella, Y Yang, D Rusu, J Miao, W Zhang, M Alban, ...
Conference on Robot Learning, 264-285, 2021
114*2021
A regularized opponent model with maximum entropy objective
Z Tian, Y Wen, Z Gong, F Punakkath, S Zou, J Wang
arXiv preprint arXiv:1905.08087, 2019
302019
Learning to Communicate Implicitly by Actions.
Z Tian, S Zou, I Davies, T Warr, L Wu, H Bou-Ammar, J Wang
AAAI, 7261-7268, 2020
25*2020
Online Double Oracle
SM McAleer, Z Tian, N Perez-Nieves, O Slumbers, DH Mguni, J Wang, ...
25*
Multi-Agent Constrained Policy Optimisation
S Gu, JG Kuba, M Wen, R Chen, Z Wang, Z Tian, J Wang, A Knoll, Y Yang
arXiv preprint arXiv:2110.02793, 2021
172021
A Game-Theoretic Approach to Multi-Agent Trust Region Optimization
Y Wen, H Chen, Y Yang, Z Tian, M Li, X Chen, J Wang
arXiv preprint arXiv:2106.06828, 2021
72021
Learning to Model Opponent Learning (Student Abstract)
I Davies, Z Tian, J Wang
Proceedings of the AAAI Conference on Artificial Intelligence 34 (10), 13771 …, 2020
62020
M2N: Mesh Movement Networks for PDE Solvers
W Song, M Zhang, JG Wallwork, J Gao, Z Tian, F Sun, MD Piggott, J Chen, ...
arXiv preprint arXiv:2204.11188, 2022
42022
Learning to Safely Exploit a Non-Stationary Opponent
Z Tian, H Ren, Y Yang, Y Sun, Z Han, I Davies, J Wang
22021
Multi-agent trust region learning
Y Wen, H Chen, Y Yang, Z Tian, M Li, X Chen, J Wang
22021
Order Matters: Agent-by-agent Policy Optimization
X Wang, Z Tian, Z Wan, Y Wen, J Wang, W Zhang
arXiv preprint arXiv:2302.06205, 2023
12023
On Realization of Intelligent Decision-Making in the Real World: A Foundation Decision Model Perspective
Y Wen, Z Wan, M Zhou, S Hou, Z Cao, C Le, J Chen, Z Tian, W Zhang, ...
arXiv preprint arXiv:2212.12669, 2022
12022
Multi-embodiment Legged Robot Control as a Sequence Modeling Problem
C Yu, W Zhang, H Lai, Z Tian, L Kneip, J Wang
arXiv preprint arXiv:2212.09078, 2022
12022
Sim-to-Real Transfer for Quadrupedal Locomotion via Terrain Transformer
H Lai, W Zhang, X He, C Yu, Z Tian, Y Yu, J Wang
arXiv preprint arXiv:2212.07740, 2022
12022
Opponent Modelling in Multi-Agent Systems
Z Tian
UCL (University College London), 2021
12021
Time-Series Representation Learning in Topology Prediction for Passive Optical Network of Telecom Operators
H Zhao, Y Fang, Y Zhao, Z Tian, W Zhang, X Feng, L Yu, W Li, H Fan, ...
Sensors 23 (6), 3345, 2023
2023
Joint Perception and Control as Inference with an Object-based Implementation
M Li, Z Tian, P Nashikkar, I Davies, Y Wen, J Wang
arXiv preprint arXiv:1903.01385, 2019
2019
The system can't perform the operation now. Try again later.
Articles 1–18