Mind2web: Towards a generalist agent for the web X Deng, Y Gu, B Zheng, S Chen, S Stevens, B Wang, H Sun, Y Su Advances in Neural Information Processing Systems 36, 2024 | 99 | 2024 |
Mmmu: A massive multi-discipline multimodal understanding and reasoning benchmark for expert agi X Yue, Y Ni, K Zhang, T Zheng, R Liu, G Zhang, S Stevens, D Jiang, ... arXiv preprint arXiv:2311.16502, 2023 | 71 | 2023 |
Gpt-4v (ision) is a generalist web agent, if grounded B Zheng, B Gou, J Kil, H Sun, Y Su arXiv preprint arXiv:2401.01614, 2024 | 23 | 2024 |
Exploring generalization ability of pretrained language models on arithmetic and logical reasoning C Wang, B Zheng, Y Niu, Y Zhang Natural Language Processing and Chinese Computing: 10th CCF International …, 2021 | 15 | 2021 |
Learn to remember: Transformer with recurrent memory for document-level machine translation Y Feng, F Li, Z Song, B Zheng, P Koehn arXiv preprint arXiv:2205.01546, 2022 | 12 | 2022 |
SemEval-2021 Task 4: Reading Comprehension of Abstract Meaning B Zheng, X Yang, YP Ruan, Z Ling, Q Liu, S Wei, X Zhu The 15th International Workshop on Semantic Evaluation (SemEval-2021), 2021 | 11 | 2021 |
The language barrier: Dissecting safety challenges of llms in multilingual contexts L Shen, W Tan, S Chen, Y Chen, J Zhang, H Xu, B Zheng, P Koehn, ... arXiv preprint arXiv:2401.13136, 2024 | 6 | 2024 |
Dual-View Visual Contextualization for Web Navigation J Kil, CH Song, B Zheng, X Deng, Y Su, WL Chao arXiv preprint arXiv:2402.04476, 2024 | 1 | 2024 |
Multilingual Coreference Resolution in Multiparty Dialogue B Zheng, P Xia, M Yarmohammadi, BV Durme Transactions of the Association for Computational Linguistics 11, 922-940, 2023 | 1 | 2023 |
Flatness-aware prompt selection improves accuracy and sample efficiency L Shen, W Tan, B Zheng, D Khashabi arXiv preprint arXiv:2305.10713, 2023 | 1 | 2023 |
An empirical study on finding spans W Gu, B Zheng, Y Chen, T Chen, B Van Durme arXiv preprint arXiv:2210.06824, 2022 | 1 | 2022 |
A Trembling House of Cards? Mapping Adversarial Attacks against Language Agents L Mo, Z Liao, B Zheng, Y Su, C Xiao, H Sun arXiv preprint arXiv:2402.10196, 2024 | | 2024 |