Shruti Palaskar

Cited by

	All	Since 2019
Citations	989	967
h-index	15	15
i10-index	16	16

260

130

195

201720182019202020212022202320243 17 94 135 181 236 254 67

Public access

View all

10 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Florian MetzeCarnegie Mellon University; Meta AIVerified email at andrew.cmu.edu
Ramon SanabriaThe University of EdinburghVerified email at ed.ac.uk
Alan W BlackProfessor, Language Technologies Institute, Carnegie Mellon UniversityVerified email at cs.cmu.edu
Ozan CaglayanImperial College LondonVerified email at imperial.ac.uk
Desmond ElliottAssistant Professor, University of CopenhagenVerified email at di.ku.dk
Lucia SpeciaProfessor, Imperial College London and Chief Scientist at Contex.aiVerified email at imperial.ac.uk
Loïc BarraultResearch Scientist, Meta AIVerified email at fb.com
Jindřich LibovickýCharles UniversityVerified email at ufal.mff.cuni.cz
Spandana GellaAmazon AIVerified email at amazon.com
Mark Hasegawa-JohnsonProfessor of Electrical and Computer Engineering, University of IllinoisVerified email at illinois.edu
Lucas Ondel YangLaboratoire Interdisciplinaire des Sciences du NumériqueVerified email at upsaclay.fr
Odette ScharenborgAssociate Professor, Delft University of Technology, The NetherlandsVerified email at tudelft.nl
Amanda DuarteBarcelona Supercomputing CenterVerified email at bsc.es
Deepti GhadiyaramStaff Research Scientist at RunwayVerified email at runwayml.com
Xavier Giró-i-NietoAmazon Science BarcelonaVerified email at amazon.es
Sandeep KonamAbridgeVerified email at abridge.com
Jordi TorresUPC Barcelona Tech - Barcelona Supercomputing CenterVerified email at ac.upc.edu
Sebastian StükerZoom Video Communications Inc.Verified email at kit.edu
Graham NeubigCarnegie Mellon UniversityVerified email at cs.cmu.edu
laurent besacierProfessor in Computer ScienceVerified email at imag.fr

Shruti Palaskar

Apple

Verified email at apple.com - Homepage

Multimodal Machine Learning Speech Recognition Natural Language Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
How2: a large-scale dataset for multimodal language understanding R Sanabria, O Caglayan, S Palaskar, D Elliott, L Barrault, L Specia, ... arXiv preprint arXiv:1811.00347, 2018	257	2018
How2sign: a large-scale multimodal dataset for continuous american sign language A Duarte, S Palaskar, L Ventura, D Ghadiyaram, K DeHaan, F Metze, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021	135	2021
Multimodal abstractive summarization for how2 videos S Palaskar, J Libovický, S Gella, F Metze arXiv preprint arXiv:1906.07901, 2019	95	2019
Asr error correction and domain adaptation using machine translation A Mani, S Palaskar, NV Meripo, S Konam, F Metze ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	82	2020
Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the “speaking rosetta” JSALT 2017 workshop O Scharenborg, L Besacier, A Black, M Hasegawa-Johnson, F Metze, ... 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	52*	2018
End-to-end multimodal speech recognition S Palaskar, R Sanabria, F Metze 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	47	2018
Cmu sinbad’s submission for the dstc7 avsd challenge R Sanabria, S Palaskar, F Metze DSTC7 at AAAI2019 workshop 6, 2019	43	2019
Combining LSTM and latent topic modeling for mortality prediction Y Jo, L Lee, S Palaskar arXiv preprint arXiv:1709.02842, 2017	42	2017
Building an ASR system for a low-research language through the adaptation of a high-resource language ASR system: preliminary results O Scharenborg, F Ciannella, S Palaskar, A Black, F Metze, L Ondel, ... Proc. Internat. Conference on Natural Language, Signal and Speech Processing …, 2017	36	2017
Multimodal grounding for sequence-to-sequence speech recognition O Caglayan, R Sanabria, S Palaskar, L Barraul, F Metze ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	32	2019
Towards understanding ASR error correction for medical conversations A Mani, S Palaskar, S Konam Proceedings of the first workshop on natural language processing for medical …, 2020	31	2020
Multimodal abstractive summarization for open-domain videos J Libovický, S Palaskar, S Gella, F Metze Visually Grounded Interaction and Language (ViGIL), 1-8, 2018	30	2018
Learned in speech recognition: Contextual acoustic word embeddings S Palaskar, V Raunak, F Metze ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	25	2019
Acoustic-to-word recognition with sequence-to-sequence models S Palaskar, F Metze 2018 IEEE Spoken Language Technology Workshop (SLT), 397-404, 2018	22	2018
End-to-end speech summarization using restricted self-attention R Sharma, S Palaskar, AW Black, F Metze ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	15	2022
Learning from multiview correlations in open-domain videos N Holzenberger, S Palaskar, P Madhyastha, F Metze, R Arora ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	15	2019
Multimodal Speech Summarization Through Semantic Concept Learning. S Palaskar, R Salakhutdinov, AW Black, F Metze Interspeech, 791-795, 2021	9	2021
Transfer learning for multimodal dialog S Palaskar, R Sanabria, F Metze Computer Speech & Language 64, 101093, 2020	8	2020
Speech summarization using restricted self-attention R Sharma, S Palaskar, AW Black, F Metze arXiv preprint arXiv:2110.06263, 2021	5	2021
Grounded sequence to sequence transduction L Specia, L Barrault, O Caglayan, A Duarte, D Elliott, S Gella, ... IEEE journal of selected topics in signal processing 14 (3), 577-591, 2020	5	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors