Follow
Muning Wen
Title
Cited by
Cited by
Year
Trust region policy optimisation in multi-agent reinforcement learning
JG Kuba, R Chen, M Wen, Y Wen, F Sun, J Wang, Y Yang
10th International Conference on Learning Representations, 2021
1452021
Multi-agent reinforcement learning is a sequence modeling problem
M Wen, J Kuba, R Lin, W Zhang, Y Wen, J Wang, Y Yang
Advances in Neural Information Processing Systems 35, 16509-16521, 2022
992022
Offline pre-trained multi-agent decision transformer: One big sequence model tackles all smac tasks
L Meng, M Wen, Y Yang, C Le, X Li, W Zhang, Y Wen, H Zhang, J Wang, ...
arXiv preprint arXiv:2112.02845, 2021
73*2021
Settling the variance of multi-agent policy gradients
JG Kuba, M Wen, L Meng, H Zhang, D Mguni, J Wang, Y Yang
Advances in Neural Information Processing Systems 34, 13458-13470, 2021
432021
Multi-agent constrained policy optimisation
S Gu, JG Kuba, M Wen, R Chen, Z Wang, Z Tian, J Wang, A Knoll, Y Yang
arXiv preprint arXiv:2110.02793, 2021
402021
Malib: A parallel framework for population-based multi-agent reinforcement learning
M Zhou, Z Wan, H Wang, M Wen, R Wu, Y Wen, Y Yang, W Zhang, ...
JMLR, 2021
362021
Alphazero-like tree-search can guide large language model decoding and training
X Feng, Z Wan, M Wen, Y Wen, W Zhang, J Wang
arXiv preprint arXiv:2309.17179, 2023
132023
Large sequence models for sequential decision-making: a survey
M Wen, R Lin, H Wang, Y Yang, Y Wen, L Mai, J Wang, H Zhang, ...
Frontiers of Computer Science 17 (6), 176349, 2023
112023
TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision
R Zhou, Y Yang, M Wen, Y Wen, W Wang, C Xi, G Xu, Y Yu, W Zhang
arXiv preprint arXiv:2403.06221, 2024
2024
Entropy-Regularized Token-Level Policy Optimization for Large Language Models
M Wen, C Deng, J Wang, W Zhang, Y Wen
arXiv preprint arXiv:2402.06700, 2024
2024
RoMAT: Role-based multi-agent transformer for generalizable heterogeneous cooperation
D Wang, F Zhong, M Wen, M Li, Y Peng, T Li, Y Yang
Neural Networks, 106129, 2024
2024
Open-Ended Learning in General-Sum Games: The Role of Diversity in Correlated Equilibrium
Z Zhao, M Wen, Y Wen, Y Yang
2023
The system can't perform the operation now. Try again later.
Articles 1–12