Follow
Yani Donchev
Yani Donchev
Google DeepMind
Verified email at google.com
Title
Cited by
Cited by
Year
DiLoCo: Distributed Low-Communication Training of Language Models
A Douillard, Q Feng, AA Rusu, R Chhaparia, Y Donchev, A Kuncoro, ...
arXiv preprint arXiv:2311.08105, 2023
312023
Scaling instructable agents across many simulated worlds
MA Raad, A Ahuja, C Barros, F Besse, A Bolt, A Bolton, B Brownfield, ...
arXiv preprint arXiv:2404.10179, 2024
252024
DiPaCo: Distributed Path Composition
A Douillard, Q Feng, AA Rusu, A Kuncoro, Y Donchev, R Chhaparia, ...
arXiv preprint arXiv:2403.10616, 2024
62024
Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch
A Douillard, Y Donchev, K Rush, S Kale, Z Charles, Z Garrett, G Teston, ...
arXiv preprint arXiv:2501.18512, 2025
22025
The system can't perform the operation now. Try again later.
Articles 1–4