Follow
William Merrill
Title
Cited by
Cited by
Year
CORD-19: The COVID-19 open research dataset
LL Wang, K Lo, Y Chandrasekhar, R Reas, J Yang, D Eide, K Funk, ...
Workshop on NLP for COVID-19, 2020
802*2020
Competency problems: On finding and removing artifacts in language data
M Gardner, W Merrill, J Dodge, ME Peters, A Ross, S Singh, N Smith
Empirical Methods in Natural Language Processing, 2021
682021
A formal hierarchy of RNN architectures
W Merrill, G Weiss, Y Goldberg, R Schwartz, NA Smith, E Yahav
Association of Computational Linguistics, 2020
442020
Sequential neural networks as automata
W Merrill
Deep Learning and Formal Languages (ACL workshop), 2019
442019
Provable limitations of acquiring meaning from ungrounded form: What will future language models understand?
W Merrill, Y Goldberg, R Schwartz, NA Smith
Transactions of the Association for Computational Linguistics 9, 1047-1060, 2021
412021
ReCLIP: A Strong Zero-Shot Baseline for Referring Expression Comprehension
S Subramanian, W Merrill, T Darrell, M Gardner, S Singh, A Rohrbach
Empirical Methods in Natural Language Processing, 2022
302022
Context-free transductions with neural stacks
Y Hao, W Merrill, D Angluin, R Frank, N Amsel, A Benz, S Mendelsohn
BlackboxNLP, 2018
292018
How language model hallucinations can snowball
M Zhang, O Press, W Merrill, A Liu, NA Smith
arXiv preprint arXiv:2305.13534, 2023
232023
Saturated transformers are constant-depth threshold circuits
W Merrill, A Sabharwal, NA Smith
Transactions of the Association for Computational Linguistics 10, 843-856, 2022
23*2022
Effects of parameter norm growth during transformer training: Inductive bias from gradient descent
W Merrill, V Ramanujan, Y Goldberg, R Schwartz, N Smith
Empirical Methods in Natural Language Processing, 2021
16*2021
End-to-end graph-based TAG parsing with neural networks
J Kasai, R Frank, P Xu, W Merrill, O Rambow
NAACL, 2018
132018
Formal language theory meets modern NLP
W Merrill
arXiv preprint arXiv:2102.10094, 2021
92021
On the linguistic capacity of real-time counter automata
W Merrill
arXiv preprint arXiv:2004.06866, 2020
82020
Finding hierarchical structure in neural stacks using unsupervised parsing
W Merrill, L Khazan, N Amsel, Y Hao, S Mendelsohn, R Frank
Proceedings of the 2019 ACL Workshop BlackboxNLP: Analyzing and Interpreting …, 2019
7*2019
The Parallelism Tradeoff: Limitations of Log-Precision Transformers
W Merrill, A Sabharwal
arXiv preprint arXiv:2207.00729, 2022
6*2022
Entailment Semantics Can Be Extracted from an Ideal Language Model
W Merrill, A Warstadt, T Linzen
CoNLL 2022, 2022
42022
A Tale of Two Circuits: Grokking as Competition of Sparse and Dense Subnetworks
W Merrill, N Tsilivis, A Shukla
arXiv preprint arXiv:2303.11873, 2023
22023
Transformers Can Be Expressed In First-Order Logic with Majority
W Merrill, A Sabharwal
arXiv preprint arXiv:2210.02671, 2022
2*2022
Extracting finite automata from rnns using state merging
W Merrill, N Tsilivis
arXiv preprint arXiv:2201.12451, 2022
22022
Detecting syntactic change using a neural part-of-speech tagger
W Merrill, GF Stark, R Frank
Proceedings of the 1st International Workshop on Computational Approaches to …, 2019
22019
The system can't perform the operation now. Try again later.
Articles 1–20