Ms marco: A human generated machine reading comprehension dataset DF Campos, T Nguyen, M Rosenberg, X Song, J Gao, S Tiwary, ... ArXiv, abs/1611.09268, 2016 | 1628* | 2016 |
Overview of the TREC 2019 deep learning track N Craswell, B Mitra, E Yilmaz, D Campos, EM Voorhees arXiv preprint arXiv:2003.07820, 2020 | 317 | 2020 |
XGLUE: A new benchmark datasetfor cross-lingual pre-training, understanding and generation Y Liang, N Duan, Y Gong, N Wu, F Guo, W Qi, M Gong, L Shou, D Jiang, ... Proceedings of the 2020 Conference on Empirical Methods in Natural Language …, 2020 | 201 | 2020 |
Leading conversational search by suggesting useful questions C Rosset, C Xiong, X Song, D Campos, N Craswell, S Tiwary, P Bennett Proceedings of the web conference 2020, 1160-1170, 2020 | 66 | 2020 |
ORCAS: 20 million clicked query-document pairs for analyzing search N Craswell, D Campos, B Mitra, E Yilmaz, B Billerbeck Proceedings of the 29th ACM International Conference on Information …, 2020 | 52 | 2020 |
Open Domain Web Keyphrase Extraction Beyond Language Modeling AO Lee Xiong, Chuan Hu, Chenyan Xiong, Daniel Campos EMNLP-IJCNLP 2019, 2019 | 40* | 2019 |
Ms marco: Benchmarking ranking models in the large-data regime N Craswell, B Mitra, E Yilmaz, D Campos, J Lin Proceedings of the 44th International ACM SIGIR Conference on Research and …, 2021 | 35 | 2021 |
TREC deep learning track: Reusable test collections in the large data regime N Craswell, B Mitra, E Yilmaz, D Campos, EM Voorhees, I Soboroff Proceedings of the 44th international ACM SIGIR conference on research and …, 2021 | 25 | 2021 |
The optimal BERT surgeon: Scalable and accurate second-order pruning for large language models E Kurtic, D Campos, T Nguyen, E Frantar, M Kurtz, B Fineran, M Goin, ... arXiv preprint arXiv:2203.07259, 2022 | 22 | 2022 |
On the reliability of test collections for evaluating systems of different types E Yilmaz, N Craswell, B Mitra, D Campos proceedings of the 43rd International ACM SIGIR Conference on Research and …, 2020 | 18 | 2020 |
Significant improvements over the state of the art? a case study of the ms marco document ranking leaderboard J Lin, D Campos, N Craswell, B Mitra, E Yilmaz Proceedings of the 44th International ACM SIGIR Conference on Research and …, 2021 | 16 | 2021 |
Keyphrase extraction beyond language modeling L Xiong, C Hu, A Overwijk, J Ahmed, DF Campos, C Xiong US Patent 11,250,214, 2022 | 11 | 2022 |
Curriculum learning for language modeling D Campos arXiv preprint arXiv:2108.02170, 2021 | 9 | 2021 |
Fostering coopetition while plugging leaks: The design and implementation of the ms marco leaderboards J Lin, D Campos, N Craswell, B Mitra, E Yilmaz Proceedings of the 45th International ACM SIGIR Conference on Research and …, 2022 | 6 | 2022 |
GAIA at SMKBP 2020-a dockerlized multi-media multi-lingual knowledge extraction, clustering, temporal tracking and hypothesis generation system M Li, Y Lin, TM Lai, X Pan, H Wen, S Li, Z Wang, P Yu, L Huang, D Lu, ... Proceedings of Thirteenth Text Analysis Conference (TAC 2020), 2020 | 4 | 2020 |
IMG2SMI: Translating Molecular Structure Images to Simplified Molecular-input Line-entry System D Campos, H Ji arXiv preprint arXiv:2109.04202, 2021 | 3 | 2021 |
Sparse* BERT: Sparse Models are Robust D Campos, A Marques, T Nguyen, M Kurtz, CX Zhai arXiv preprint arXiv:2205.12452, 2022 | 2 | 2022 |
Using a Multi-Task-Trained Neural Network to Guide Interaction with a Query-Processing System via Useful Suggestions CL Rosset, C Xiong, PN Bennett, SK Tiwary, DF Campos, X Song, ... US Patent App. 16/850,886, 2021 | 1 | 2021 |
CAPOT: Creating Robust Dense Query Encoders using Post Training Contrastive Alignment D Campos, CX Zhai, A Magnani arXiv preprint arXiv:2304.03401, 2023 | | 2023 |
To Asymmetry and Beyond: Structured Pruning of Sequence to Sequence Models for Improved Inference Efficiency D Campos, CX Zhai arXiv preprint arXiv:2304.02721, 2023 | | 2023 |