Banghua Zhu

Citace

	Všechny	Od 2019
Citace	1304	1302
h-index	16	16
i10-index	24	24

580

290

145

435

20192020202120222023202422 55 100 197 351 564

Veřejný přístup

Zobrazit všechny

16 článků

1 článek

dostupné

nedostupné

Vychází ze zplnomocnění pro financování

Spoluautoři

Jiantao JiaoAssistant Professor of EECS and Statistics, University of California, BerkeleyE-mailová adresa ověřena na: berkeley.edu
Michael I. JordanProfessor of Electrical Engineering and Computer Sciences and Professor of Statistics, UC BerkeleyE-mailová adresa ověřena na: cs.berkeley.edu
Cong MaUniversity of ChicagoE-mailová adresa ověřena na: uchicago.edu
Stuart RussellProfessor of Computer Science, University of California, BerkeleyE-mailová adresa ověřena na: cs.berkeley.edu
Ying ShengPhD student of Stanford UniversityE-mailová adresa ověřena na: stanford.edu
Lianmin ZhengUC BerkeleyE-mailová adresa ověřena na: berkeley.edu
Ion StoicaProfessor of Computer Science, UC BerkeleyE-mailová adresa ověřena na: cs.berkeley.edu
Joseph E. GonzalezProfessor of Computer Science, UC BerkeleyE-mailová adresa ověřena na: berkeley.edu
Paria RashidinejadPostdoctoral Scholar, University of California, BerkeleyE-mailová adresa ověřena na: berkeley.edu
Tianle LiUndergraduate Researcher, UC BerkeleyE-mailová adresa ověřena na: berkeley.edu
Dacheng LiUC BerkeleyE-mailová adresa ověřena na: berkeley.edu
Jacob SteinhardtStanford UniversityE-mailová adresa ověřena na: cs.stanford.edu
Tianhao WuUniversity of California, BerkeleyE-mailová adresa ověřena na: berkeley.edu
Evan FrickUC BerkeleyE-mailová adresa ověřena na: berkeley.edu
Hanlin ZhuPh.D. student, University of California, BerkeleyE-mailová adresa ověřena na: berkeley.edu
Kurt KeutzerProfessor of the Graduate School, EECS, University of California, BerkeleyE-mailová adresa ověřena na: berkeley.edu
Song JianTsinghua UniversityE-mailová adresa ověřena na: tsinghua.edu.cn
Shiyi CaoUC BerkeleyE-mailová adresa ověřena na: berkeley.edu
Ikechukwu UchenduHarvard UniversityE-mailová adresa ověřena na: g.harvard.edu
Lele WangUniversity of British ColumbiaE-mailová adresa ověřena na: ece.ubc.ca

Sledovat

Banghua Zhu

University of California, Berkeley

E-mailová adresa ověřena na: berkeley.edu - Domovská stránka

foundation models human-AI interaction statistics information theory reinforcement learning


Název Seřadit podle citací Seřadit podle roku Seřadit podle názvu	Citace Citace	Rok
Bridging offline reinforcement learning and imitation learning: A tale of pessimism P Rashidinejad, B Zhu, C Ma, J Jiao, S Russell Advances in Neural Information Processing Systems 34, 11702-11716, 2021	288	2021
Deconstructing Generative Adversarial Networks B Zhu, J Jiao, D Tse arXiv preprint arXiv:1901.09465, 2019	141*	2019
Principled reinforcement learning with human feedback from pairwise or k-wise comparisons B Zhu, M Jordan, J Jiao International Conference on Machine Learning, 43037-43067, 2023	117	2023
Joint transceiver optimization for wireless communication PHY using neural network B Zhu, J Wang, L He, J Song IEEE Journal on Selected Areas in Communications 37 (6), 1364-1373, 2019	105	2019
Jump-start reinforcement learning I Uchendu, T Xiao, Y Lu, B Zhu, M Yan, J Simon, M Bennice, C Fu, C Ma, ... International Conference on Machine Learning, 34556-34583, 2023	86	2023
Chatbot arena: An open platform for evaluating llms by human preference WL Chiang, L Zheng, Y Sheng, AN Angelopoulos, T Li, D Li, H Zhang, ... arXiv preprint arXiv:2403.04132, 2024	72	2024
Starling-7B: Improving LLM Helpfulness & Harmlessness with RLAIF B Zhu, E Frick, T Wu, H Zhu, J Jiao https://starling.cs.berkeley.edu/, 2023	48	2023
Generalized resilience and robust statistics B Zhu, J Jiao, J Steinhardt The Annals of Statistics 50 (4), 2256-2283, 2022	48	2022
Robust estimation via generalized quasi-gradients B Zhu, J Jiao, J Steinhardt Information and Inference: A Journal of the IMA 11 (2), 581-636, 2022	43	2022
The sample complexity of online contract design B Zhu, S Bates, Z Yang, Y Wang, J Jiao, MI Jordan arXiv preprint arXiv:2211.05732, 2022	40	2022
S-lora: Serving thousands of concurrent lora adapters Y Sheng, S Cao, D Li, C Hooper, N Lee, S Yang, C Chou, B Zhu, L Zheng, ... arXiv preprint arXiv:2311.03285, 2023	39	2023
Byzantine-robust federated learning with optimal statistical rates B Zhu, L Wang, Q Pang, S Wang, J Jiao, D Song, MI Jordan International Conference on Artificial Intelligence and Statistics, 3151-3178, 2023	28*	2023
Sparse tensor decomposition for haplotype assembly of diploids and polyploids A Hashemi, B Zhu, H Vikalo BMC genomics 19, 1-15, 2018	27	2018
Fine-tuning language models with advantage-induced policy alignment B Zhu, H Sharma, FV Frujeri, S Dong, C Zhu, MI Jordan, J Jiao arXiv preprint arXiv:2306.02231, 2023	24	2023
Pairwise proximal policy optimization: Harnessing relative feedback for llm alignment T Wu, B Zhu, R Zhang, Z Wen, K Ramchandran, J Jiao arXiv preprint arXiv:2310.00212, 2023	18	2023
When does the Tukey median work? B Zhu, J Jiao, J Steinhardt 2020 IEEE International Symposium on Information Theory (ISIT), 1201-1206, 2020	18	2020
Online learning in stackelberg games with an omniscient follower G Zhao, B Zhu, J Jiao, M Jordan International Conference on Machine Learning, 42304-42316, 2023	16	2023
Minimax off-policy evaluation for multi-armed bandits C Ma, B Zhu, J Jiao, MJ Wainwright IEEE Transactions on Information Theory 68 (8), 5314-5339, 2022	13	2022
Fairness in serving large language models Y Sheng, S Cao, D Li, B Zhu, Z Li, D Zhuo, JE Gonzalez, I Stoica 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2024	11	2024
Noisy Sorting Capacity Z Wang, N Ghaddar, B Zhu, L Wang arXiv preprint arXiv:2202.01446, 2023	11	2023

Systém momentálně nemůže danou operaci provést. Zkuste to znovu později.

Články 1–20

Citace za rok

Duplicitní citace

Sloučené citace

Přidat spoluautorySpoluautoři

Sledovat

Citace

Spoluautoři