Follow
Simeng Sun
Simeng Sun
Verified email at nvidia.com - Homepage
Title
Cited by
Cited by
Year
Hard-coded gaussian attention for neural machine translation
W You, S Sun, M Iyyer
ACL 2020, 2020
652020
Do Long-Range Language Models Actually Use Long-Range Context?
S Sun, K Krishna, A Mattarella-Micke, M Iyyer
EMNLP 2021, 2021
512021
How to compare summarizers without target length? pitfalls, solutions and re-examination of the neural summarization literature
S Sun, O Shapira, I Dagan, A Nenkova
Proceedings of the Workshop on Methods for Optimizing and Evaluating Neural …, 2019
492019
Energy-based reranking: Improving neural machine translation using energy-based models
S Bhattacharyya, A Rooshenas, S Naskar, S Sun, M Iyyer, A McCallum
ACL 2021, 2020
392020
The feasibility of embedding based automatic evaluation for single document summarization
S Sun, A Nenkova
Proceedings of the 2019 conference on empirical methods in natural language …, 2019
232019
Pearl: Prompting large language models to plan and execute actions over long documents
S Sun, Y Liu, S Wang, C Zhu, M Iyyer
arXiv preprint arXiv:2305.14564, 2023
212023
Revisiting simple neural probabilistic language models
S Sun, M Iyyer
NAACL 2021, 2021
142021
Alternative Input Signals Ease Transfer in Multilingual Machine Translation
S Sun, A Fan, J Cross, V Chaudhary, C Tran, P Koehn, F Guzmán
ACL 2022, 2022
112022
Exploring the impact of low-rank adaptation on the performance, efficiency, and regularization of rlhf
S Sun, D Gupta, M Iyyer
arXiv preprint arXiv:2309.09055, 2023
102023
How does in-context learning help prompt tuning?
S Sun, Y Liu, D Iter, C Zhu, M Iyyer
arXiv preprint arXiv:2302.11521, 2023
102023
IGA: An intent-guided authoring assistant
S Sun, W Zhao, V Manjunatha, R Jain, V Morariu, F Dernoncourt, ...
EMNLP 2021, 2021
102021
Energy-based reranking: Improving neural machine translation using energy-based models
S Naskar, A Rooshenas, S Sun, M Iyyer, A McCallum
arXiv e-prints, arXiv: 2009.13267, 2020
102020
ChapterBreak: A Challenge Dataset for Long-Range Language Models
S Sun, K Thai, M Iyyer
NAACL 2022, 2022
92022
Name disambiguation for chinese scientific authors with multi-level clustering
S Sun, H Zhang, N Li, Y Chen
2017 IEEE International Conference on Computational Science and Engineering …, 2017
72017
TopicGPT: A prompt-based topic modeling framework
CM Pham, A Hoyle, S Sun, M Iyyer
arXiv preprint arXiv:2311.01449, 2023
52023
Efficiently Upgrading Multilingual Machine Translation Models to Support More Languages
S Sun, M Elbayad, A Sun, J Cross
EACL 2023, 2023
22023
RULER: What's the Real Context Size of Your Long-Context Language Models?
CP Hsieh, S Sun, S Kriman, S Acharya, D Rekesh, F Jia, B Ginsburg
arXiv preprint arXiv:2404.06654, 2024
2024
TOWARDS EFFECTIVE MODELING OF LONG-RANGE CONTEXT
S SUN
University of Massachusetts Amherst, 2024
2024
How Much Do Modifications to Transformer Language Models Affect Their Ability to Learn Linguistic Knowledge?
S Sun, BW Dillon, M Iyyer
Proceedings of the Third Workshop on Insights from Negative Results in NLP …, 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–19