Follow
Daria Soboleva
Daria Soboleva
Cerebras Systems
Verified email at cerebras.net
Title
Cited by
Cited by
Year
SlimPajama: A 627B token cleaned and deduplicated version of RedPajama
D Soboleva, F Al-Khateeb, R Myers, JR Steeves, J Hestness, N Dey
URL: https://www. cerebras. net/blog/slimpajama-a-627b-token-cleaned-and …, 2023
1372023
Slimpajama-dc: Understanding data combinations for llm training
Z Shen, T Tao, L Ma, W Neiswanger, Z Liu, H Wang, B Tan, J Hestness, ...
arXiv preprint arXiv:2309.10818, 2023
442023
SlimPajama: A 627B token cleaned and deduplicated version of RedPajama, 2023
D Soboleva, F Al-Khateeb, R Myers, JR Steeves, J Hestness, N Dey
URL https://huggingface. co/datasets/cerebras/SlimPajama-627B, 0
23
Replacing human audio with synthetic audio for on-device unspoken punctuation prediction
D Soboleva, O Skopek, M Šajgalík, V Cărbune, F Weissenberger, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
132021
SlimPajama: A 627B token cleaned and deduplicated version of RedPajama, June 2023
D Soboleva, F Al-Khateeb, R Myers, JR Steeves, J Hestness, N Dey
URL https://huggingface. co/datasets/cerebras/SlimPajama-627B, 0
9
Btlm-3b-8k: 7b parameter performance in a 3b parameter model
N Dey, D Soboleva, F Al-Khateeb, B Yang, R Pathria, H Khachane, ...
arXiv preprint arXiv:2309.11568, 2023
62023
Three-stage question answering system with sentence ranking
D Soboleva, K Vorontsov
EPiC Series in Language and Linguistics 4, 18-25, 2019
42019
Position interpolation improves alibi extrapolation
F Al-Khateeb, N Dey, D Soboleva, J Hestness
arXiv preprint arXiv:2310.13017, 2023
22023
MULTI-PHASE TRAINING OF MACHINE LEARNING MODELS FOR SEARCH RANKING
A Boymel, S Daria
US Patent App. 18/074,432, 2023
12023
Straight to Zero: Why Linearly Decaying the Learning Rate to Zero Works Best for LLMs
S Bergsma, N Dey, G Gosal, G Gray, D Soboleva, J Hestness
arXiv preprint arXiv:2502.15938, 2025
2025
BLIMEY: Towards Better Routing Methods in Sparse Mixture of Experts
D Soboleva, E Singh, NS Dey, J Hestness
2024
REPLACING HUMAN-RECORDED AUDIO WITH SYNTHETIC AUDIOFOR ON-DEVICE UNSPOKEN PUNCTUATION PREDICTION
D Valcarce, V Carbune, J Proskurnia, R Prabhavalkar, O Skopek, ...
2021
Multi-Task Transformer Networks for Search Relevance Prediction and Ranking
D Soboleva, A Boymel, A Gotmanov, M Ryabinin
The system can't perform the operation now. Try again later.
Articles 1–13