Follow
Szymon Sidor
Szymon Sidor
OpenAI
Verified email at openai.com - Homepage
Title
Cited by
Cited by
Year
Evolution strategies as a scalable alternative to reinforcement learning
T Salimans, J Ho, X Chen, S Sidor, I Sutskever
arXiv preprint arXiv:1703.03864, 2017
11662017
Learning dexterous in-hand manipulation
OAIM Andrychowicz, B Baker, M Chociej, R Jozefowicz, B McGrew, ...
The International Journal of Robotics Research 39 (1), 3-20, 2020
9302020
Openai baselines
P Dhariwal, C Hesse, O Klimov, A Nichol, M Plappert, A Radford, ...
8112017
Dota 2 with large scale deep reinforcement learning
C Berner, G Brockman, B Chan, V Cheung, P Dębiak, C Dennison, ...
arXiv preprint arXiv:1912.06680, 2019
7822019
Stable baselines
A Hill, A Raffin, M Ernestus, A Gleave, A Kanervisto, R Traore, P Dhariwal, ...
6222018
Parameter space noise for exploration
M Plappert, R Houthooft, P Dhariwal, S Sidor, RY Chen, X Chen, T Asfour, ...
arXiv preprint arXiv:1706.01905, 2017
4882017
Emergent complexity via multi-agent competition
T Bansal, J Pachocki, S Sidor, I Sutskever, I Mordatch
arXiv preprint arXiv:1710.03748, 2017
3152017
Schema networks: Zero-shot transfer with a generative causal model of intuitive physics
K Kansky, T Silver, DA Mély, M Eldawy, M Lázaro-Gredilla, X Lou, ...
International conference on machine learning, 1809-1818, 2017
2102017
Ucb exploration via q-ensembles
RY Chen, S Sidor, P Abbeel, J Schulman
arXiv preprint arXiv:1706.01502, 2017
652017
Dota 2 with large scale deep reinforcement learning
CB OpenAI, G Brockman, B Chan, V Cheung, P Debiak, C Dennison, ...
arXiv preprint arXiv:1912.06680 2, 2019
552019
Openai baselines (2017)
P Dhariwal, C Hesse, O Klimov, A Nichol, M Plappert, A Radford, ...
URL https://github. com/openai/baselines, 2016
442016
UCB and infogain exploration via q-ensembles
RY Chen, J Schulman, P Abbeel, S Sidor
arXiv preprint arXiv:1706.01502 9, 2017
212017
OpenAI baselines
C Hesse, M Plappert, A Radford, J Schulman, S Sidor, Y Wu
192017
Tuning large neural networks via zero-shot hyperparameter transfer
G Yang, E Hu, I Babuschkin, S Sidor, X Liu, D Farhi, N Ryder, J Pachocki, ...
Advances in Neural Information Processing Systems 34, 17084-17097, 2021
72021
Reinforcement learning with natural language signals
S Sidor
Massachusetts Institute of Technology, 2016
52016
Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
G Yang, EJ Hu, I Babuschkin, S Sidor, X Liu, D Farhi, N Ryder, J Pachocki, ...
arXiv preprint arXiv:2203.03466, 2022
42022
Time resource networks
S Sidor, P Yu, C Fang, B Williams
arXiv preprint arXiv:1602.03203, 2016
12016
Occam's gates
J Raiman, S Sidor
arXiv preprint arXiv:1506.08251, 2015
12015
The system can't perform the operation now. Try again later.
Articles 1–18