Follow
Jacob Steinhardt
Jacob Steinhardt
Verified email at cs.stanford.edu - Homepage
Title
Cited by
Cited by
Year
Concrete problems in AI safety
D Amodei, C Olah, J Steinhardt, P Christiano, J Schulman, D Mané
arXiv preprint arXiv:1606.06565, 2016
18012016
Certified defenses against adversarial examples
A Raghunathan, J Steinhardt, P Liang
arXiv preprint arXiv:1801.09344, 2018
8302018
The malicious use of artificial intelligence: Forecasting, prevention, and mitigation
M Brundage, S Avin, J Clark, H Toner, P Eckersley, B Garfinkel, A Dafoe, ...
arXiv preprint arXiv:1802.07228, 2018
6282018
Certified defenses for data poisoning attacks
J Steinhardt, PWW Koh, PS Liang
Advances in neural information processing systems 30, 2017
5852017
Natural adversarial examples
D Hendrycks, K Zhao, S Basart, J Steinhardt, D Song
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
5732021
The many faces of robustness: A critical analysis of out-of-distribution generalization
D Hendrycks, S Basart, N Mu, S Kadavath, F Wang, E Dorundo, R Desai, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
5182021
Semidefinite relaxations for certifying robustness to adversarial examples
A Raghunathan, J Steinhardt, PS Liang
Advances in neural information processing systems 31, 2018
3562018
Learning from untrusted data
M Charikar, J Steinhardt, G Valiant
Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing …, 2017
2552017
Troubling trends in machine learning scholarship
ZC Lipton, J Steinhardt
arXiv preprint arXiv:1807.03341, 2018
251*2018
Sever: A robust meta-algorithm for stochastic optimization
I Diakonikolas, G Kamath, D Kane, J Li, J Steinhardt, A Stewart
International Conference on Machine Learning, 1596-1606, 2019
2312019
Stronger data poisoning attacks break data sanitization defenses
PW Koh, J Steinhardt, P Liang
Machine Learning, 1-47, 2022
1432022
Rethinking bias-variance trade-off for generalization of neural networks
Z Yang, Y Yu, C You, J Steinhardt, Y Ma
International Conference on Machine Learning, 10767-10777, 2020
1202020
Resilience: A criterion for learning in the presence of arbitrary outliers
J Steinhardt, M Charikar, G Valiant
arXiv preprint arXiv:1703.04940, 2017
1172017
Robust moment estimation and improved clustering via sum of squares
PK Kothari, J Steinhardt, D Steurer
Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing …, 2018
1112018
Testing robustness against unforeseen adversaries
D Kang, Y Sun, D Hendrycks, T Brown, J Steinhardt
arXiv preprint arXiv:1908.08016, 2019
972019
Scaling out-of-distribution detection for real-world settings
D Hendrycks, S Basart, M Mazeika, M Mostajabi, J Steinhardt, D Song
arXiv preprint arXiv:1911.11132, 2019
812019
Measuring massive multitask language understanding
D Hendrycks, C Burns, S Basart, A Zou, M Mazeika, D Song, J Steinhardt
arXiv preprint arXiv:2009.03300, 2020
802020
Finite-time regional verification of stochastic non-linear systems
J Steinhardt, R Tedrake
The International Journal of Robotics Research 31 (7), 901-923, 2012
772012
Aligning ai with shared human values
D Hendrycks, C Burns, S Basart, A Critch, J Li, D Song, J Steinhardt
arXiv preprint arXiv:2008.02275, 2020
762020
Memory, communication, and statistical queries
J Steinhardt, G Valiant, S Wager
Conference on Learning Theory, 1490-1516, 2016
762016
The system can't perform the operation now. Try again later.
Articles 1–20