Thinking fast and slow with deep learning and tree search T Anthony, Z Tian, D Barber Advances in Neural Information Processing Systems, 5360-5370, 2017 | 314 | 2017 |
SMARTS: An Open-Source Scalable Multi-Agent RL Training School for Autonomous Driving M Zhou, J Luo, J Villella, Y Yang, D Rusu, J Miao, W Zhang, M Alban, ... Conference on Robot Learning, 264-285, 2021 | 114* | 2021 |
A regularized opponent model with maximum entropy objective Z Tian, Y Wen, Z Gong, F Punakkath, S Zou, J Wang arXiv preprint arXiv:1905.08087, 2019 | 30 | 2019 |
Learning to Communicate Implicitly by Actions. Z Tian, S Zou, I Davies, T Warr, L Wu, H Bou-Ammar, J Wang AAAI, 7261-7268, 2020 | 25* | 2020 |
Online Double Oracle SM McAleer, Z Tian, N Perez-Nieves, O Slumbers, DH Mguni, J Wang, ... | 25* | |
Multi-Agent Constrained Policy Optimisation S Gu, JG Kuba, M Wen, R Chen, Z Wang, Z Tian, J Wang, A Knoll, Y Yang arXiv preprint arXiv:2110.02793, 2021 | 17 | 2021 |
A Game-Theoretic Approach to Multi-Agent Trust Region Optimization Y Wen, H Chen, Y Yang, Z Tian, M Li, X Chen, J Wang arXiv preprint arXiv:2106.06828, 2021 | 7 | 2021 |
Learning to Model Opponent Learning (Student Abstract) I Davies, Z Tian, J Wang Proceedings of the AAAI Conference on Artificial Intelligence 34 (10), 13771 …, 2020 | 6 | 2020 |
M2N: Mesh Movement Networks for PDE Solvers W Song, M Zhang, JG Wallwork, J Gao, Z Tian, F Sun, MD Piggott, J Chen, ... arXiv preprint arXiv:2204.11188, 2022 | 4 | 2022 |
Learning to Safely Exploit a Non-Stationary Opponent Z Tian, H Ren, Y Yang, Y Sun, Z Han, I Davies, J Wang | 2 | 2021 |
Multi-agent trust region learning Y Wen, H Chen, Y Yang, Z Tian, M Li, X Chen, J Wang | 2 | 2021 |
Order Matters: Agent-by-agent Policy Optimization X Wang, Z Tian, Z Wan, Y Wen, J Wang, W Zhang arXiv preprint arXiv:2302.06205, 2023 | 1 | 2023 |
On Realization of Intelligent Decision-Making in the Real World: A Foundation Decision Model Perspective Y Wen, Z Wan, M Zhou, S Hou, Z Cao, C Le, J Chen, Z Tian, W Zhang, ... arXiv preprint arXiv:2212.12669, 2022 | 1 | 2022 |
Multi-embodiment Legged Robot Control as a Sequence Modeling Problem C Yu, W Zhang, H Lai, Z Tian, L Kneip, J Wang arXiv preprint arXiv:2212.09078, 2022 | 1 | 2022 |
Sim-to-Real Transfer for Quadrupedal Locomotion via Terrain Transformer H Lai, W Zhang, X He, C Yu, Z Tian, Y Yu, J Wang arXiv preprint arXiv:2212.07740, 2022 | 1 | 2022 |
Opponent Modelling in Multi-Agent Systems Z Tian UCL (University College London), 2021 | 1 | 2021 |
Time-Series Representation Learning in Topology Prediction for Passive Optical Network of Telecom Operators H Zhao, Y Fang, Y Zhao, Z Tian, W Zhang, X Feng, L Yu, W Li, H Fan, ... Sensors 23 (6), 3345, 2023 | | 2023 |
Joint Perception and Control as Inference with an Object-based Implementation M Li, Z Tian, P Nashikkar, I Davies, Y Wen, J Wang arXiv preprint arXiv:1903.01385, 2019 | | 2019 |