Sledovat
Shruti Bhosale
Shruti Bhosale
Facebook AI Research
E-mailová adresa ověřena na: fb.com
Název
Citace
Citace
Rok
Llama 2: Open foundation and fine-tuned chat models
H Touvron, L Martin, K Stone, P Albert, A Almahairi, Y Babaei, ...
arXiv preprint arXiv:2307.09288, 2023
135812023
The llama 3 herd of models
A Grattafiori, A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, ...
arXiv preprint arXiv:2407.21783, 2024
36412024
Beyond english-centric multilingual machine translation
A Fan, S Bhosale, H Schwenk, Z Ma, A El-Kishky, S Goyal, M Baines, ...
Journal of Machine Learning Research 22 (107), 1-48, 2021
8892021
No language left behind: Scaling human-centered machine translation
MR Costa-Jussà, J Cross, O Çelebi, M Elbayad, K Heafield, K Heffernan, ...
arXiv preprint arXiv:2207.04672, 2022
7972022
BASE Layers: Simplifying Training of Large, Sparse Models
L Lewis, Mike and Bhosale, Shruti and Dettmers, Tim and Goyal, Naman and ...
International Conference on Machine Learning, 2021
2632021
Effective long-context scaling of foundation models
W Xiong, J Liu, I Molybog, H Zhang, P Bhargava, R Hou, L Martin, ...
arXiv preprint arXiv:2309.16039, 2023
2052023
Efficient Large Scale Language Modeling with Mixtures of Experts
M Artetxe, S Bhosale, N Goyal, T Mihaylov, M Ott, S Shleifer, XV Lin, J Du, ...
EMNLP 2022, 2021
171*2021
Llama 2: Open foundation and fine-tuned chat models. arXiv 2023
H Touvron, L Martin, K Stone, P Albert, A Almahairi, Y Babaei, ...
arXiv preprint arXiv:2307.09288 10, 2023
1582023
Jingfei Du, et al. 2021. Few-shot learning with multilingual language models
XV Lin, T Mihaylov, M Artetxe, T Wang, S Chen, D Simig, M Ott, N Goyal, ...
arXiv preprint arXiv:2112.10668, 35-40, 2021
1022021
Facebook AI’s WMT21 News Translation Task Submission
C Tran, S Bhosale, J Cross, P Koehn, S Edunov, A Fan
Proceedings of the Sixth Conference on Machine Translation, 205-215, 2021
1022021
No language left behind: Scaling human-centered machine translation
N Team, MR Costa-Jussà, J Cross, O Çelebi, M Elbayad, K Heafield, ...
arXiv preprint arXiv:2207.04672, 2022
842022
Few-shot learning with multilingual generative language models
XV Lin, T Mihaylov, M Artetxe, T Wang, S Chen, D Simig, M Ott, N Goyal, ...
Proceedings of the 2022 conference on empirical methods in natural language …, 2022
812022
& Scialom, T.(2023). Llama 2: Open foundation and fine-tuned chat models
H Touvron, L Martin, K Stone, P Albert, A Almahairi, Y Babaei, ...
arXiv preprint arXiv:2307.09288, 2023
692023
Llama 2: open foundation and fine-tuned chat models. CoRR abs/2307.09288 (2023)
H Touvron, L Martin, K Stone, P Albert, A Almahairi, Y Babaei, ...
arXiv preprint arXiv:2307.09288 10, 2023
662023
Few-shot learning with multilingual language models
XV Lin, T Mihaylov, M Artetxe, T Wang, S Chen, D Simig, M Ott, N Goyal, ...
EMNLP 2022, 2021
622021
Llama 2: Open foundation and fine-tuned chat models, 2023b
H Touvron, L Martin, K Stone, P Albert, A Almahairi, Y Babaei, ...
URL https://arxiv. org/abs/2307.09288, 2023
492023
Fairscale: A general purpose modular pytorch library for high performance and large scale training
M Baines, S Bhosale, V Caggiano, N Goyal, S Goyal, M Ott, B Lefaudeux, ...
482021
Llama 2: Open foundation and fine-tuned chat models. arXiv [Preprint](2023)
H Touvron, L Martin, K Stone, P Albert, A Almahairi, Y Babaei, ...
arXiv preprint arXiv:2307.09288, 0
36
Revisiting machine translation for cross-lingual classification
M Artetxe, V Goswami, S Bhosale, A Fan, L Zettlemoyer
arXiv preprint arXiv:2305.14240, 2023
272023
Tricks for Training Sparse Translation Models
D Dua, S Bhosale, V Goswami, J Cross, M Lewis, A Fan
Proceedings of the 2021 Conference of the North American Chapter of the …, 2021
272021
Systém momentálně nemůže danou operaci provést. Zkuste to znovu později.
Články 1–20