Inferring the effectiveness of government interventions against COVID-19 J Brauner*, S Mindermann*, M Sharma*, D Johnston, J Salvatier, ... Science 371 (6531), 2021 | 1102 | 2021 |
Understanding the effectiveness of government interventions against the resurgence of COVID-19 in Europe M Sharma*, S Mindermann*, C Rogers-Smith, G Leech, B Snodin, J Ahuja, ... Nature Communications 12 (1), 1-13, 2021 | 235 | 2021 |
The alignment problem from a deep learning perspective R Ngo, L Chan, S Mindermann ICLR 2024, 2022 | 154 | 2022 |
Occam's razor is insufficient to infer the preferences of irrational agents S Armstrong*, S Mindermann* NeurIPS, 2018 | 130* | 2018 |
Changing composition of SARS-CoV-2 lineages and rise of Delta variant in England S Mishra*, S Mindermann*, M Sharma*, C Whittaker*, TA Mellan, T Wilton, ... EClinicalMedicine - The Lancet 39, 101064, 2021 | 128* | 2021 |
Prioritized training on points that are learnable, worth learning, and not yet learned S Mindermann*, M Razzak*, W Xu, A Kirsch, M Sharma, A Morisot, ... ICML, 2022 | 106 | 2022 |
Mask wearing in community settings reduces SARS-CoV-2 transmission G Leech, C Rogers-Smith, JT Monrad, JB Sandbrink, B Snodin, R Zinkov, ... Proceedings of the National Academy of Sciences 119 (23), e2119266119, 2022 | 106* | 2022 |
Is the cure really worse than the disease? The health impacts of lockdowns during COVID-19 G Meyerowitz-Katz, S Bhatt, O Ratmann, JM Brauner, S Flaxman, ... BMJ global health 6 (8), e006653, 2021 | 90 | 2021 |
Identifying Causal-Effect Inference Failure with Uncertainty-Aware Models A Jesson*, S Mindermann*, U Shalit, Y Gal NeurIPS, 2020 | 86 | 2020 |
Managing AI risks in an era of rapid progress Y Bengio, G Hinton, A Yao, D Song, P Abbeel, YN Harari, YQ Zhang, ... Science 384 (6698), 2023 | 71 | 2023 |
Quantifying Ignorance in Individual-Level Causal-Effect Estimates under Hidden Confounding A Jesson, S Mindermann, Y Gal, U Shalit ICML, 2021 | 61 | 2021 |
Managing extreme AI risks amid rapid progress Y Bengio, G Hinton, A Yao, D Song, P Abbeel, T Darrell, YN Harari, ... Science 384 (6698), 842-845, 2024 | 57 | 2024 |
Seasonal variation in SARS-CoV-2 transmission in temperate climates: A Bayesian modelling study in 143 European regions T Gavenčiak, JT Monrad, G Leech, M Sharma, S Mindermann, S Bhatt, ... PLoS computational biology 18 (8), e1010435, 2022 | 53 | 2022 |
Active Inverse Reward Design S Mindermann*, R Shah*, A Gleave, D Hadfield-Menell arXiv preprint arXiv:1809.03060, 2018 | 53* | 2018 |
Sleeper agents: Training deceptive llms that persist through safety training E Hubinger, C Denison, J Mu, M Lambert, M Tong, M MacDiarmid, ... arXiv preprint arXiv:2401.05566, 2024 | 36 | 2024 |
How to catch an ai liar: Lie detection in black-box llms by asking unrelated questions L Pacchiardi, AJ Chan, S Mindermann, I Moscovitz, AY Pan, Y Gal, ... ICLR 2024, 2023 | 31 | 2023 |
How Robust are the Estimated Effects of Nonpharmaceutical Interventions against COVID-19? M Sharma*, S Mindermann*, J Brauner*, G Leech, A Stephenson, ... NeurIPS (Spotlight talk), 2020 | 31* | 2020 |
Effectiveness assessment of non-pharmaceutical interventions: lessons learned from the COVID-19 pandemic A Lison, N Banholzer, M Sharma, S Mindermann, HJT Unwin, S Mishra, ... The Lancet Public Health 8 (4), e311-e317, 2023 | 25 | 2023 |
Inferring the effectiveness of government interventions against COVID-19. Science, eabd9338 JM Brauner, S Mindermann, M Sharma, D Johnston, J Salvatier, ... | 22 | 2020 |
Specific versus general principles for constitutional ai S Kundu, Y Bai, S Kadavath, A Askell, A Callahan, A Chen, A Goldie, ... arXiv preprint arXiv:2310.13798, 2023 | 18 | 2023 |