Shixiang Shane Gu

Citace

	Všechny	Od 2019
Citace	26682	24992
h-index	44	44
i10-index	60	60

9000

4500

2250

6750

201620172018201920202021202220232024124 449 1029 1476 2115 2792 3305 6242 8976

Veřejný přístup

Zobrazit všechny

6 článků

0 článků

dostupné

nedostupné

Vychází ze zplnomocnění pro financování

Spoluautoři

Sergey LevineUC Berkeley, Physical IntelligenceE-mailová adresa ověřena na: eecs.berkeley.edu
Richard E TurnerProfessor, University of CambridgeE-mailová adresa ověřena na: cam.ac.uk
Zoubin GhahramaniProfessor, University of Cambridge, and Distinguished Researcher, GoogleE-mailová adresa ověřena na: eng.cam.ac.uk
Ilya SutskeverCo-Founder and Chief Scientist of OpenAIE-mailová adresa ověřena na: openai.com
Andriy MnihResearch Scientist at Google DeepMindE-mailová adresa ověřena na: cs.toronto.edu
Hong GeCambridge UniversityE-mailová adresa ověřena na: cam.ac.uk
Steve MannProfessor of Electrical and Computer Engineering, University of TorontoE-mailová adresa ověřena na: eecg.utoronto.ca

Sledovat

Shixiang Shane Gu

Další jménaShane Gu, Shixiang Gu

Google DeepMind

E-mailová adresa ověřena na: google.com - Domovská stránka

Deep Learning Artificial Intelligence Machine Learning Reinforcement Learning Robotics


Název Seřadit podle citací Seřadit podle roku Seřadit podle názvu	Citace Citace	Rok
Categorical reparameterization with gumbel-softmax E Jang, S Gu, B Poole arXiv preprint arXiv:1611.01144, 2016	5988	2016
Gpt-4 technical report J Achiam, S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ... arXiv preprint arXiv:2303.08774, 2023	2524	2023
Large language models are zero-shot reasoners T Kojima, SS Gu, M Reid, Y Matsuo, Y Iwasawa Advances in neural information processing systems 35, 22199-22213, 2022	2432	2022
Scaling instruction-finetuned language models HW Chung, L Hou, S Longpre, B Zoph, Y Tay, W Fedus, Y Li, X Wang, ... Journal of Machine Learning Research 25 (70), 1-53, 2024	2174	2024
Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates S Gu, E Holly, T Lillicrap, S Levine 2017 IEEE international conference on robotics and automation (ICRA), 3389-3396, 2017	1869	2017
Continuous deep q-learning with model-based acceleration S Gu, T Lillicrap, I Sutskever, S Levine International conference on machine learning, 2829-2838, 2016	1249	2016
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	1042	2023
Towards deep neural network architectures robust to adversarial examples S Gu, L Rigazio arXiv preprint arXiv:1412.5068, 2014	1015	2014
Data-efficient hierarchical reinforcement learning O Nachum, SS Gu, H Lee, S Levine Advances in neural information processing systems 31, 2018	955	2018
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ... arXiv preprint arXiv:2206.04615, 2022	879	2022
A minimalist approach to offline reinforcement learning S Fujimoto, SS Gu Advances in neural information processing systems 34, 20132-20145, 2021	660	2021
Dynamics-aware unsupervised discovery of skills A Sharma, S Gu, S Levine, V Kumar, K Hausman arXiv preprint arXiv:1907.01657, 2019	423	2019
Q-prop: Sample-efficient policy gradient with an off-policy critic S Gu, T Lillicrap, Z Ghahramani, RE Turner, S Levine arXiv preprint arXiv:1611.02247, 2016	398	2016
Human-centric dialog training via offline reinforcement learning N Jaques, JH Shen, A Ghandeharioun, C Ferguson, A Lapedriza, ... arXiv preprint arXiv:2010.05848, 2020	384*	2020
Large language models can self-improve J Huang, SS Gu, L Hou, Y Wu, X Wang, H Yu, J Han arXiv preprint arXiv:2210.11610, 2022	316	2022
Temporal difference models: Model-free deep rl for model-based control V Pong, S Gu, M Dalal, S Levine arXiv preprint arXiv:1802.09081, 2018	289	2018
A divergence minimization perspective on imitation learning methods SKS Ghasemipour, R Zemel, S Gu Conference on robot learning, 1259-1277, 2020	279	2020
Sequence tutor: Conservative fine-tuning of sequence generation models with kl-control N Jaques, S Gu, D Bahdanau, JM Hernández-Lobato, RE Turner, D Eck International Conference on Machine Learning, 1645-1654, 2017	260*	2017
Near-optimal representation learning for hierarchical reinforcement learning O Nachum, S Gu, H Lee, S Levine arXiv preprint arXiv:1810.01257, 2018	231	2018
Language as an abstraction for hierarchical deep reinforcement learning Y Jiang, SS Gu, KP Murphy, C Finn Advances in Neural Information Processing Systems 32, 2019	229	2019

Systém momentálně nemůže danou operaci provést. Zkuste to znovu později.

Články 1–20

Citace za rok

Duplicitní citace

Sloučené citace

Přidat spoluautorySpoluautoři

Sledovat

Citace

Spoluautoři