Zhehuai Chen

Cited by

	All	Since 2019
Citations	1119	994
h-index	18	16
i10-index	32	27

400

200

100

300

20162017201820192020202120222023202413 13 86 109 83 100 163 398 139

Public access

View all

10 articles

2 articles

available

not available

Based on funding mandates

Co-authors

Bhuvana RamabhadranManager, GoogleVerified email at google.com
Andrew RosenbergGoogleVerified email at google.com
Kai Yu（俞凯）Shanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Yu ZhangOpenAIVerified email at csail.mit.edu
Gary WangGoogleVerified email at google.com
Pedro Moreno MengibarSenior Research Director, Google IncVerified email at google.com
Yonghui WuGoogle BrainVerified email at google.com
Ankur BapnaSoftware Engineer, Google DeepmindVerified email at google.com
Yanmin QianProfessor, Shanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Yongqiang WangResearch Scientist, GoogleVerified email at google.com
Heiga ZenPrincipal Scientist (Director), Google DeepMindVerified email at google.com
Mike SeltzerFacebookVerified email at fb.com
Christian FuegenFacebook Inc.Verified email at fb.com
Mahaveer JainFacebookVerified email at fb.com
Jasha DroppoAmazonVerified email at amazon.com
Yimeng ZhuangSamsung Research China - Beijing (SRC-B)Verified email at samsung.com
Daniel PoveyChief Speech Scientist, Xiaomi Corp.Verified email at xiaomi.com
Sanjeev KhudanpurThe Johns Hopkins UniversityVerified email at jhu.edu
Hainan XuNVIDIAVerified email at nvidia.com
Qi LiuTencentVerified email at tencent.com

Zhehuai Chen

NVIDIA

Verified email at nvidia.com - Homepage

Speech Recognition Speech Synthesis


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Google usm: Scaling automatic speech recognition beyond 100 languages Y Zhang, W Han, J Qin, Y Wang, A Bapna, Z Chen, N Chen, B Li, ... arXiv preprint arXiv:2303.01037, 2023	121	2023
Maestro: Matched speech text representations through modality matching Z Chen, Y Zhang, A Rosenberg, B Ramabhadran, P Moreno, A Bapna, ... arXiv preprint arXiv:2204.03409, 2022	78	2022
Progressive joint modeling in unsupervised single-channel overlapped speech recognition Z Chen, J Droppo, J Li, W Xiong IEEE/ACM Transactions on Audio, Speech, and Language Processing 26 (1), 184-196, 2017	77	2017
Knowledge Distillation for Sequence Model. M Huang, Y You, Z Chen, Y Qian, K Yu Interspeech, 3703-3707, 2018	66	2018
Improving speech recognition using consistent predictions on synthesized speech G Wang, A Rosenberg, Z Chen, Y Zhang, B Ramabhadran, Y Wu, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	55	2020
Joint Grapheme and Phoneme Embeddings for Contextual End-to-End ASR. Z Chen, M Jain, Y Wang, ML Seltzer, C Fuegen Interspeech, 3490-3494, 2019	51	2019
End-to-end contextual speech recognition using class language models and a token passing decoder Z Chen, M Jain, Y Wang, ML Seltzer, C Fuegen ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	50	2019
Phone synchronous speech recognition with ctc lattices Z Chen, Y Zhuang, Y Qian, K Yu IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (1), 90-101, 2016	40	2016
On modular training of neural acoustics-to-word model for lvcsr Z Chen, Q Liu, H Li, K Yu 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	34	2018
Injecting text in self-supervised speech pretraining Z Chen, Y Zhang, A Rosenberg, B Ramabhadran, G Wang, P Moreno 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021	32	2021
Improving Speech Recognition Using GAN-Based Speech Synthesis and Contrastive Unspoken Text Selection. Z Chen, A Rosenberg, Y Zhang, G Wang, B Ramabhadran, PJ Moreno Interspeech, 556-560, 2020	32	2020
Tacotron: Towards end-to-end speech synthesis. arXiv 2017 Y Wang, R Skerry-Ryan, D Stanton, Y Wu, RJ Weiss, N Jaitly, Z Yang, ... arXiv preprint arXiv:1703.10135, 2017	27	2017
Joist: A joint speech and text streaming model for asr TN Sainath, R Prabhavalkar, A Bapna, Y Zhang, Z Huo, Z Chen, B Li, ... 2022 IEEE Spoken Language Technology Workshop (SLT), 52-59, 2023	26	2023
Phone Synchronous Decoding with CTC Lattice. Z Chen, W Deng, T Xu, K Yu Interspeech, 1923-1927, 2016	24	2016
Tts4pretrain 2.0: Advancing the use of text and speech in ASR pretraining with consistency and contrastive losses Z Chen, Y Zhang, A Rosenberg, B Ramabhadran, P Moreno, G Wang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	21	2022
Sequence discriminative training for deep learning based acoustic keyword spotting Z Chen, Y Qian, K Yu Speech Communication 102, 100-111, 2018	21	2018
Sequence modeling in unsupervised single-channel overlapped speech recognition Z Chen, J Droppo 2018 IEEE international conference on acoustics, speech and signal …, 2018	20	2018
A gpu-based wfst decoder with exact lattice generation Z Chen, J Luitjens, H Xu, Y Wang, D Povey, S Khudanpur arXiv preprint arXiv:1804.03243, 2018	19	2018
Accelerating rnn-t training and inference using ctc guidance Y Wang, Z Chen, C Zheng, Y Zhang, W Han, P Haghani ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	17	2023
Accented speech recognition: Benchmarking, pre-training, and diverse data A Aksënova, Z Chen, CC Chiu, D van Esch, P Golik, W Han, L King, ... arXiv preprint arXiv:2205.08014, 2022	16	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors