Pierre Richemond

Cited by

	All	Since 2019
Citations	6382	6378
h-index	10	10
i10-index	10	10

2700

1350

675

2025

2020202120222023202489 793 1890 2612 978

Pierre Richemond

Google DeepMind

Verified email at deepmind.com

Deep Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Bootstrap your own latent-a new approach to self-supervised learning JB Grill, F Strub, F Altché, C Tallec, P Richemond, E Buchatskaya, ... Advances in neural information processing systems 33, 21271-21284, 2020	5891	2020
Data distributional properties drive emergent in-context learning in transformers S Chan, A Santoro, A Lampinen, J Wang, A Singh, P Richemond, ... Advances in Neural Information Processing Systems 35, 18878-18891, 2022	178	2022
Byol works even without batch statistics PH Richemond, JB Grill, F Altché, C Tallec, F Strub, A Brock, S Smith, ... arXiv preprint arXiv:2010.10241, 2020	85	2020
k. kavukcuoglu, R JB Grill, F Strub, F Altché, C Tallec, P Richemond, E Buchatskaya, ... Munos, and M. Valko,“Bootstrap your own latent-a new approach to self …, 2020	64	2020
Continuous diffusion for categorical data S Dieleman, L Sartran, A Roshannai, N Savinov, Y Ganin, PH Richemond, ... arXiv preprint arXiv:2211.15089, 2022	56	2022
On Wasserstein reinforcement learning and the Fokker-Planck equation PH Richemond, B Maginnis arXiv preprint arXiv:1712.07185, 2017	24	2017
Understanding self-predictive learning for reinforcement learning Y Tang, ZD Guo, PH Richemond, BA Pires, Y Chandak, R Munos, ... International Conference on Machine Learning, 33632-33656, 2023	21	2023
Zipfian environments for reinforcement learning SCY Chan, AK Lampinen, PH Richemond, F Hill Conference on Lifelong Learning Agents, 406-429, 2022	14	2022
Categorical SDEs with simplex diffusion PH Richemond, S Dieleman, A Doucet arXiv preprint arXiv:2210.14784, 2022	13	2022
Semppl: Predicting pseudo-labels for better contrastive representations M Bošnjak, PH Richemond, N Tomasev, F Strub, JC Walker, F Hill, ... arXiv preprint arXiv:2301.05158, 2023	10	2023
Memory-efficient episodic control reinforcement learning with dynamic online k-means A Agostinelli, K Arulkumaran, M Sarrico, P Richemond, AA Bharath arXiv preprint arXiv:1911.09560, 2019	5	2019
Sample-efficient reinforcement learning with maximum entropy mellowmax episodic control M Sarrico, K Arulkumaran, A Agostinelli, P Richemond, AA Bharath arXiv preprint arXiv:1911.09615, 2019	4	2019
A short variational proof of equivalence between policy gradients and soft q learning PH Richemond, B Maginnis arXiv preprint arXiv:1712.08650, 2017	4	2017
Biologically inspired architectures for sample-efficient deep reinforcement learning PH Richemond, A Kolbeinsson, Y Guo arXiv preprint arXiv:1911.11285, 2019	3	2019
Combining learning rate decay and weight decay with complexity gradient descent-Part I PH Richemond, Y Guo arXiv preprint arXiv:1902.02881, 2019	3	2019
Efficiently applying attention to sequential data with the Recurrent Discounted Attention unit B Maginnis, PH Richemond arXiv preprint arXiv:1705.08480, 2017	3	2017
Generalized Preference Optimization: A Unified Approach to Offline Alignment Y Tang, ZD Guo, Z Zheng, D Calandriello, R Munos, M Rowland, ... arXiv preprint arXiv:2402.05749, 2024	2	2024
The edge of orthogonality: a simple view of what makes BYOL tick PH Richemond, A Tam, Y Tang, F Strub, B Piot, F Hill International Conference on Machine Learning, 29063-29081, 2023	2	2023
Human Alignment of Large Language Models through Online Preference Optimisation D Calandriello, D Guo, R Munos, M Rowland, Y Tang, BA Pires, ... arXiv preprint arXiv:2403.08635, 2024		2024
Data distributional properties drive emergent few-shot learning in transformers SCY Chan, A Santoro, AK Lampinen, JX Wang, A Singh, PH Richemond, ... arXiv preprint arXiv:2205.05055, 2022		2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by