Tor Lattimore

Cited by

	All	Since 2019
Citations	7341	6864
h-index	39	35
i10-index	67	62

1600

800

400

1200

20132014201520162017201820192020202120222023202424 28 53 57 95 183 346 765 1223 1439 1580 1507

Public access

View all

23 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Csaba SzepesvariDeepMind & University of AlbertaVerified email at cs.ualberta.ca
Marcus HutterResearcher@DeepMind & Professor at ANUVerified email at anu.edu.au
Botao HaoOpenAIVerified email at openai.com
Andras GyorgyDeepMindVerified email at google.com
Laurent OrseauResearch Scientist at Google DeepMindVerified email at google.com
Branislav KvetonAdobe ResearchVerified email at adobe.com
Eren SezenerDeepMindVerified email at google.com
Ian OsbandOpenAIVerified email at openai.com
Joel VenessGoogle DeepMindVerified email at google.com
Christoph DannResearch Scientist, GoogleVerified email at google.com
Emma BrunskillAssociate Professor of Computer Science, Stanford UniversityVerified email at cs.stanford.edu
Julian ZimmertGoogle ResearchVerified email at google.com
Mengdi WangCenter for Statistics & Machine Learning, ECE, Princeton UniversityVerified email at princeton.edu
Avishkar BhoopchandResearch Engineer, DeepMindVerified email at google.com
Agnieszka Grabska BarwińskaDeepMindVerified email at google.com
Peter TothAI ResearchVerified email at techcombank.com.vn
Benjamin Van RoyStanford UniversityVerified email at stanford.edu
Satinder SinghGoogle DeepMind / U. of MichiganVerified email at umich.edu
Johannes KirschnerSwiss Data Science Center, ETH ZurichVerified email at sdsc.ethz.ch
Dale SchuurmansUniversity of Alberta, Google DeepMindVerified email at cs.ualberta.ca

Tor Lattimore

DeepMind

Verified email at google.com - Homepage

machine learning learning theory reinforcement learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Bandit algorithms T Lattimore, C Szepesvári Cambridge University Press, 2020	3066	2020
Unifying PAC and regret: Uniform PAC bounds for episodic reinforcement learning C Dann, T Lattimore, E Brunskill Advances in Neural Information Processing Systems 30, 2017	325	2017
Causal bandits: Learning good interventions via causal inference F Lattimore, T Lattimore, MD Reid Advances in neural information processing systems 29, 2016	283*	2016
Degenerate feedback loops in recommender systems R Jiang, S Chiappa, T Lattimore, A György, P Kohli Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society, 383-390, 2019	240	2019
Learning with good feature representations in bandits and in rl with a generative model T Lattimore, C Szepesvari, G Weisz International conference on machine learning, 5662-5670, 2020	196	2020
Behaviour suite for reinforcement learning I Osband, Y Doron, M Hessel, J Aslanides, E Sezener, A Saraiva, ... arXiv preprint arXiv:1908.03568, 2019	188	2019
PAC bounds for discounted MDPs T Lattimore, M Hutter Algorithmic Learning Theory: 23rd International Conference, ALT 2012, Lyon …, 2012	149	2012
The end of optimism? an asymptotic analysis of finite-armed linear bandits T Lattimore, C Szepesvari Artificial Intelligence and Statistics, 728-737, 2017	139	2017
Conservative bandits Y Wu, R Shariff, T Lattimore, C Szepesvári International Conference on Machine Learning, 1254-1262, 2016	128	2016
On explore-then-commit strategies A Garivier, T Lattimore, E Kaufmann Advances in Neural Information Processing Systems 29, 2016	126	2016
A geometric perspective on optimal representations for reinforcement learning M Bellemare, W Dabney, R Dadashi, A Ali Taiga, PS Castro, N Le Roux, ... Advances in neural information processing systems 32, 2019	106	2019
Model selection in contextual stochastic bandit problems A Pacchiano, M Phan, Y Abbasi Yadkori, A Rao, J Zimmert, T Lattimore, ... Advances in Neural Information Processing Systems 33, 10328-10337, 2020	105	2020
Garbage in, reward out: Bootstrapping exploration in multi-armed bandits B Kveton, C Szepesvari, S Vaswani, Z Wen, T Lattimore, M Ghavamzadeh International Conference on Machine Learning, 3601-3610, 2019	81	2019
Toprank: A practical algorithm for online stochastic ranking T Lattimore, B Kveton, S Li, C Szepesvari Advances in Neural Information Processing Systems 31, 2018	77	2018
Linear bandits with stochastic delayed feedback C Vernade, A Carpentier, T Lattimore, G Zappella, B Ermis, M Brueckner International Conference on Machine Learning, 9712-9721, 2020	76	2020
The sample-complexity of general reinforcement learning T Lattimore, M Hutter, P Sunehag International Conference on Machine Learning, 28-36, 2013	75	2013
Near-optimal PAC bounds for discounted MDPs T Lattimore, M Hutter Theoretical Computer Science 558, 125-143, 2014	70	2014
Bounded Regret for Finite-Armed Structured Bandits T Lattimore, R Munos	70	2014
Adaptive exploration in linear contextual bandit B Hao, T Lattimore, C Szepesvari International Conference on Artificial Intelligence and Statistics, 3536-3545, 2020	68	2020
An information-theoretic approach to minimax regret in partial monitoring T Lattimore, C Szepesvári Conference on Learning Theory, 2111-2139, 2019	67	2019

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors