Overview of the design approach and prioritization of R&D activities towards an EU DEMO G Federici, C Bachmann, W Biel, L Boccaccini, F Cismondi, S Ciattaglia, ... Fusion Engineering and Design 109, 1464-1474, 2016 | 243 | 2016 |
Acoustic Modeling for Google Home. B Li, TN Sainath, A Narayanan, J Caroselli, M Bacchiani, A Misra, ... Interspeech, 399-403, 2017 | 169 | 2017 |
Recurrent neural aligner: An encoder-decoder neural network model for sequence to sequence mapping. H Sak, M Shannon, K Rao, F Beaufays Interspeech 8, 1298-1302, 2017 | 119 | 2017 |
Location-relative attention mechanisms for robust long-form speech synthesis E Battenberg, RJ Skerry-Ryan, S Mariooryad, D Stanton, D Kao, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 101 | 2020 |
Autoregressive models for statistical parametric speech synthesis M Shannon, H Zen, W Byrne IEEE transactions on audio, speech, and language processing 21 (3), 587-597, 2012 | 72 | 2012 |
Optimizing expected word error rate via sampling for speech recognition M Shannon arXiv preprint arXiv:1706.02776, 2017 | 54 | 2017 |
Measuring the perceptual effects of modelling assumptions in speech synthesis using stimuli constructed from repeated natural speech GE Henter, T Merritt, M Shannon, C Mayo, S King Fifteenth Annual Conference of the International Speech Communication …, 2014 | 44 | 2014 |
Effective use of variational embedding capacity in expressive end-to-end speech synthesis E Battenberg, S Mariooryad, D Stanton, RJ Skerry-Ryan, M Shannon, ... arXiv preprint arXiv:1906.03402, 2019 | 42 | 2019 |
Semi-supervised generative modeling for controllable speech synthesis R Habib, S Mariooryad, M Shannon, E Battenberg, RJ Skerry-Ryan, ... arXiv preprint arXiv:1910.01709, 2019 | 41 | 2019 |
Improved End-of-Query Detection for Streaming Speech Recognition. M Shannon, G Simko, SY Chang, C Parada Interspeech, 1909-1913, 2017 | 38 | 2017 |
Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project M Wester, J Dines, M Gibson, H Liang, YJ Wu, L Saheer, S King, K Oura, ... Proceedings of the 7th ISCA Speech Synthesis Workshop, 2010 | 30 | 2010 |
The effect of using normalized models in statistical speech synthesis M Shannon, H Zen, W Byrne ISCA (International Speech Communication Association), 2011 | 22 | 2011 |
Encoder-decoder models for sequence to sequence mapping H Sak, SM Shannon US Patent 10,706,840, 2020 | 19 | 2020 |
Personalising speech-to-speech translation in the EMIME project M Kurimo, W Byrne, J Dines, PN Garner, M Gibson, Y Guan, T Hirsimäki, ... Proceedings of the ACL 2010 System Demonstrations, 48-53, 2010 | 17 | 2010 |
Personalising speech-to-speech translation in the EMIME project M Kurimo, W Byrne, J Dines, PN Garner, M Gibson, Y Guan, T Hirsimäki, ... Proceedings of the ACL 2010 System Demonstrations, 48-53, 2010 | 17 | 2010 |
Speaker generation D Stanton, M Shannon, S Mariooryad, RJ Skerry-Ryan, E Battenberg, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 16 | 2022 |
On the EU approach for DEMO architecture exploration and dealing with uncertainties M Coleman, F Maviglia, C Bachmann, J Anthony, G Federici, M Shannon, ... Fusion Engineering and Design 109, 1158-1162, 2016 | 16 | 2016 |
Fast, low-artifact speech synthesis considering global variance M Shannon, W Byrne 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 15 | 2013 |
Autoregressive HMMs for speech synthesis SM Shannon, WJ Byrne ISCA (International Speech Communication Association), 2009 | 15 | 2009 |
End of query detection G Simko, MCP San Martin, SM Shannon US Patent 10,593,352, 2020 | 13 | 2020 |