Aviral Kumar

Citace

	Všechny	Od 2019
Citace	8447	8439
h-index	30	30
i10-index	41	41

3300

1650

825

2475

20192020202120222023202454 328 1137 2135 3221 1552

Veřejný přístup

Zobrazit všechny

20 článků

0 článků

dostupné

nedostupné

Vychází ze zplnomocnění pro financování

Spoluautoři

Sergey LevineUC Berkeley, Physical IntelligenceE-mailová adresa ověřena na: eecs.berkeley.edu
George TuckerGoogle BrainE-mailová adresa ověřena na: google.com
Chelsea FinnStanford University, GoogleE-mailová adresa ověřena na: cs.stanford.edu
Anikait SinghStanford UniversityE-mailová adresa ověřena na: stanford.edu
Tianhe YuGoogle DeepMindE-mailová adresa ověřena na: google.com
Yevgen ChebotarFigure AIE-mailová adresa ověřena na: figure.ai
Aurick ZhouWaymoE-mailová adresa ověřena na: berkeley.edu
Rishabh AgarwalSenior Research Scientist, Google DeepMindE-mailová adresa ověřena na: google.com
Xue Bin PengAssistant Professor, Simon Fraser University, NVIDIAE-mailová adresa ověřena na: sfu.ca
Kevin SwerskyGoogle BrainE-mailová adresa ověřena na: cs.toronto.edu

Sledovat

Aviral Kumar

Google DeepMind

E-mailová adresa ověřena na: berkeley.edu - Domovská stránka

Machine Learning Reinforcement Learning


Název Seřadit podle citací Seřadit podle roku Seřadit podle názvu	Citace Citace	Rok
Offline reinforcement learning: Tutorial, review, and perspectives on open problems S Levine, A Kumar, G Tucker, J Fu arXiv preprint arXiv:2005.01643, 2020	1602	2020
Conservative q-learning for offline reinforcement learning A Kumar, A Zhou, G Tucker, S Levine Advances in Neural Information Processing Systems 33, 1179-1191, 2020	1439	2020
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction A Kumar, J Fu, G Tucker, S Levine NeuRIPS 2019, arXiv:1906.00949, 2019	908	2019
D4rl: Datasets for deep data-driven reinforcement learning J Fu, A Kumar, O Nachum, G Tucker, S Levine arXiv preprint arXiv:2004.07219, 2020	887	2020
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	407	2023
Advantage-weighted regression: Simple and scalable off-policy reinforcement learning XB Peng, A Kumar, G Zhang, S Levine arXiv preprint arXiv:1910.00177, 2019	406	2019
Combo: Conservative offline model-based policy optimization T Yu, A Kumar, R Rafailov, A Rajeswaran, S Levine, C Finn Advances in neural information processing systems 34, 28954-28967, 2021	317	2021
Trainable calibration measures for neural networks from kernel mean embeddings A Kumar, S Sarawagi, U Jain International Conference on Machine Learning, 2805-2814, 2018	255	2018
Graph Normalizing Flows J Liu, A Kumar, J Ba, J Kiros, K Swersky NeurIPS 2019, arxiv:1905.13177, 2019	252*	2019
Opal: Offline primitive discovery for accelerating offline reinforcement learning A Ajay, A Kumar, P Agrawal, S Levine, O Nachum arXiv preprint arXiv:2010.13611, 2020	151	2020
Diagnosing Bottlenecks in Deep Q-learning Algorithms J Fu, A Kumar, M Soh, S Levine International Conference on Machine Learning (ICML) 2019, https://arxiv.org …, 2019	144	2019
Conservative safety critics for exploration H Bharadhwaj, A Kumar, N Rhinehart, S Levine, F Shkurti, A Garg arXiv preprint arXiv:2010.14497, 2020	118	2020
When should we prefer offline reinforcement learning over behavioral cloning? A Kumar, J Hong, A Singh, S Levine arXiv preprint arXiv:2204.05618, 2022	113*	2022
Discor: Corrective feedback in reinforcement learning via distribution correction A Kumar, A Gupta, S Levine Advances in Neural Information Processing Systems 33, 18560-18572, 2020	103	2020
Cog: Connecting new skills to past experience with offline reinforcement learning A Singh, A Yu, J Yang, J Zhang, A Kumar, S Levine arXiv preprint arXiv:2010.14500, 2020	95	2020
Why generalization in rl is difficult: Epistemic pomdps and implicit partial observability D Ghosh, J Rahme, A Kumar, A Zhang, RP Adams, S Levine Advances in neural information processing systems 34, 25502-25515, 2021	91	2021
Calibration of Encoder Decoder Models for Neural Machine Translation A Kumar, S Sarawagi https://arxiv.org/abs/1903.00802, 2019	84	2019
Reward-conditioned policies A Kumar, XB Peng, S Levine arXiv preprint arXiv:1912.13465, 2019	81	2019
A workflow for offline model-free robotic reinforcement learning A Kumar, A Singh, S Tian, C Finn, S Levine arXiv preprint arXiv:2109.10813, 2021	79	2021
One solution is not all you need: Few-shot extrapolation via structured maxent rl S Kumar, A Kumar, S Levine, C Finn Advances in Neural Information Processing Systems 33, 8198-8210, 2020	79	2020

Systém momentálně nemůže danou operaci provést. Zkuste to znovu později.

Články 1–20

Citace za rok

Duplicitní citace

Sloučené citace

Přidat spoluautorySpoluautoři

Sledovat

Citace

Spoluautoři