Jérémy Scheurer - Google Scholar

Get my own profile

Cited by

	All	Since 2019
Citations	377	376
h-index	7	7
i10-index	5	5

0

200

100

50

150

202020212022202320242 1 14 186 170

Public access

1 article

0 articles

available

not available

Based on funding mandates

Jérémy Scheurer

Jérémy Scheurer

New York University

Verified email at nyu.edu

Deep Learning Reinforcement Learning NLP


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Open problems and fundamental limitations of reinforcement learning from human feedback S Casper, X Davies, C Shi, TK Gilbert, J Scheurer, J Rando, R Freedman, ... arXiv preprint arXiv:2307.15217, 2023	174	2023
Training language models with language feedback at scale J Scheurer, JA Campos, T Korbak, JS Chan, A Chen, K Cho, E Perez arXiv preprint arXiv:2303.16755, 2023	64	2023
Training Language Models with Language Feedback J Scheurer, JA Campos, JS Chan, A Chen, K Cho, E Perez arXiv preprint arXiv:2204.14146, 2022	56*	2022
Improving code generation by training with natural language feedback A Chen, J Scheurer, T Korbak, JA Campos, JS Chan, SR Bowman, K Cho, ... arXiv preprint arXiv:2303.16749, 2023	39	2023
Semantic Segmentation of Histopathological Slides for the Classification of Cutaneous Lymphoma and Eczema J Scheurer, C Ferrari, LBT Bom, M Beer, W Kempf, L Haug Annual Conference on Medical Image Understanding and Analysis, 26-42, 2020	11	2020
Technical report: Large language models can strategically deceive their users when put under pressure J Scheurer, M Balesni, M Hobbhahn arXiv preprint arXiv:2311.07590, 2023	9	2023
Black-Box Access is Insufficient for Rigorous AI Audits S Casper, C Ezell, C Siegmann, N Kolt, TL Curtis, B Bucknall, A Haupt, ... arXiv preprint arXiv:2401.14446, 2024	8	2024
A Causal Framework for AI Regulation and Auditing L Sharkey, CN Ghuidhir, D Braun, J Scheurer, M Balesni, L Bushnaq, ... Preprints, 2024	7*	2024
Instance-wise algorithm configuration with graph neural networks R Valentin, C Ferrari, J Scheurer, A Amrollahi, C Wendler, MB Paulus arXiv preprint arXiv:2202.04910, 2022	5	2022
Few-shot adaptation works with unpredictable data JS Chan, M Pieler, J Jao, J Scheurer, E Perez arXiv preprint arXiv:2208.01009, 2022	4	2022
Practical Pitfalls of Causal Scrubbing J Scheurer, H Philipp, M Tony, T Jacques, L David https://www.lesswrong.com/posts/DFarDnQjMnjsKvW8s/practical-pitfalls-of …, 2023		2023
Meta Reward Learning for Recommender Systems: Towards Value Alignment J Scheurer		2021
Meta-Learning an Image Editing Style J Scheurer		2019
Large Language Models can Strategically Deceive their Users when Put Under Pressure J Scheurer, M Balesni, M Hobbhahn ICLR 2024 Workshop on Large Language Model (LLM) Agents, 0

The system can't perform the operation now. Try again later.

Articles 1–14