Semi-orthogonal low-rank matrix factorization for deep neural networks. D Povey, G Cheng, Y Wang, K Li, H Xu, M Yarmohammadi, S Khudanpur Interspeech, 3743-3747, 2018 | 509 | 2018 |
An Exploration of Dropout with LSTMs. G Cheng, V Peddinti, D Povey, V Manohar, S Khudanpur, Y Yan Interspeech, 1586-1590, 2017 | 129 | 2017 |
Transformer-based online CTC/attention end-to-end speech recognition architecture H Miao, G Cheng, C Gao, P Zhang, Y Yan ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 101 | 2020 |
Online Hybrid CTC/Attention Architecture for End-to-End Speech Recognition H Miao, G Cheng, P Zhang, T Li, Y Yan Proc. Interspeech 2019, 2623-2627, 2019 | 50 | 2019 |
Online hybrid CTC/attention end-to-end automatic speech recognition architecture H Miao, G Cheng, P Zhang, Y Yan IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 1452-1465, 2020 | 40 | 2020 |
Output-Gate Projected Gated Recurrent Unit for Speech Recognition G Cheng, D Povey, L Huang, J Xu, S Khudanpur, Y Yan Proc. Interspeech 2018, 1793-1797, 2018 | 22 | 2018 |
Open source magicdata-ramc: A rich annotated mandarin conversational (ramc) speech dataset Z Yang, Y Chen, L Luo, R Yang, L Ye, G Cheng, J Xu, Y Jin, Q Zhang, ... arXiv preprint arXiv:2203.16844, 2022 | 14 | 2022 |
Pre-training transformer decoder for end-to-end asr model with unpaired text data C Gao, G Cheng, R Yang, H Zhu, P Zhang, Y Yan ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 14 | 2021 |
Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models K Deng, Z Yang, S Watanabe, Y Higuchi, G Cheng, P Zhang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 13 | 2022 |
Alleviating asr long-tailed problem by decoupling the learning of representation and classification K Deng, G Cheng, R Yang, Y Yan IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 340-354, 2021 | 12 | 2021 |
Eteh: Unified attention-based end-to-end asr and kws architecture G Cheng, H Miao, R Yang, K Deng, Y Yan IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1360-1373, 2022 | 9 | 2022 |
Improving CTC-based speech recognition via knowledge transferring from pre-trained language models K Deng, S Cao, Y Zhang, L Ma, G Cheng, J Xu, P Zhang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 8 | 2022 |
Data augmentation based consistency contrastive pre-training for automatic speech recognition C Gao, G Cheng, Y Guo, Q Zhao, P Zhang arXiv preprint arXiv:2112.12522, 2021 | 7 | 2021 |
Using Highway Connections to Enable Deep Small‐footprint LSTM‐RNNs for Speech Recognition G CHENG, X LI, Y YAN Chinese Journal of Electronics 28 (1), 107-112, 2019 | 7 | 2019 |
Keyword search using attention-based end-to-end asr and frame-synchronous phoneme alignments R Yang, G Cheng, H Miao, T Li, P Zhang, Y Yan IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3202-3215, 2021 | 6 | 2021 |
Wav2vec-s: Semi-supervised pre-training for speech recognition H Zhu, L Wang, Y Hou, J Wang, G Cheng, P Zhang, Y Yan arXiv preprint arXiv:2110.04484 11, 2021 | 6 | 2021 |
Automatic speech recognition system with output-gate projected gated recurrent unit G Cheng, P Zhang, J Xu IEICE TRANSACTIONS on Information and Systems 102 (2), 355-363, 2019 | 6 | 2019 |
The conversational short-phrase speaker diarization (cssd) task: Dataset, evaluation metric and baselines G Cheng, Y Chen, R Yang, Q Li, Z Yang, L Ye, P Zhang, Q Zhang, L Xie, ... arXiv preprint arXiv:2208.08042, 2022 | 5 | 2022 |
History utterance embedding transformer lm for speech recognition K Deng, G Cheng, H Miao, P Zhang, Y Yan ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 5 | 2021 |
Decoupled Federated Learning for ASR with Non-IID Data H Zhu, J Wang, G Cheng, P Zhang, Y Yan arXiv preprint arXiv:2206.09102, 2022 | 4 | 2022 |