Olmo: Accelerating the science of language models D Groeneveld, I Beltagy, P Walsh, A Bhagia, R Kinney, O Tafjord, AH Jha, ... ACL 2024 🏆 Best Theme Paper, 2024 | 215* | 2024 |
Dolma: An open corpus of three trillion tokens for language model pretraining research L Soldaini, R Kinney, A Bhagia, D Schwenk, D Atkinson, R Authur, ... ACL 2024 🏆 Best Resource Paper, 2024 | 152* | 2024 |
What's In My Big Data? Y Elazar, A Bhagia, I Magnusson, A Ravichander, D Schwenk, A Suhr, ... ICLR 2024 (Spotlight; 5% papers), 2023 | 76 | 2023 |
HINT: Hypernetwork Instruction Tuning for Efficient Zero-& Few-Shot Generalisation H Ivison, A Bhagia, Y Wang, H Hajishirzi, M Peters ACL 2023, 2022 | 25 | 2022 |
Paloma: A benchmark for evaluating language model fit I Magnusson, A Bhagia, V Hofmann, L Soldaini, AH Jha, O Tafjord, ... NeurIPS 2024, 2023 | 22* | 2023 |
Findings of the WMT’22 shared task on large-scale machine translation evaluation for African languages D Adelani, MMI Alam, A Anastasopoulos, A Bhagia, MR Costa-jussą, ... Proceedings of the Seventh Conference on Machine Translation (WMT), 773-800, 2022 | 15 | 2022 |
Continued pretraining for better zero-and few-shot promptability Z Wu, RL Logan IV, P Walsh, A Bhagia, D Groeneveld, S Singh, I Beltagy EMNLP 2022, 2022 | 4 | 2022 |
Olmoe: Open mixture-of-experts language models N Muennighoff, L Soldaini, D Groeneveld, K Lo, J Morrison, S Min, W Shi, ... arXiv preprint arXiv:2409.02060, 2024 | 3 | 2024 |
Catwalk: A unified language model evaluation framework for many datasets D Groeneveld, A Awadalla, I Beltagy, A Bhagia, I Magnusson, H Peng, ... arXiv preprint arXiv:2312.10253, 2023 | 3 | 2023 |
On advances in text generation from images beyond captioning: A case study in self-rationalization S Palaskar, A Bhagia, Y Bisk, F Metze, AW Black, A Marasović Findings of EMNLP 2022, 2022 | 2 | 2022 |
2 OLMo 2 Furious T OLMo, P Walsh, L Soldaini, D Groeneveld, K Lo, S Arora, A Bhagia, ... arXiv preprint arXiv:2501.00656, 2024 | | 2024 |
Establishing Task Scaling Laws via Compute-Efficient Model Ladders A Bhagia, J Liu, A Wettig, D Heineman, O Tafjord, AH Jha, L Soldaini, ... arXiv preprint arXiv:2412.04403, 2024 | | 2024 |
Robust Tooling and New Resources for Large Language Model Evaluation via Catwalk K Richardson, I Magnusson, O Tafjord, A Bhagia, I Beltagy, A Cohan, ... | | |