David Harwath
Assistant Professor
David Harwath's research interests are in the areas of automatic speech recognition, spoken language understanding, and multi-modal machine learning. His work aims to develop models of speech and language that are robust, flexible, and capable of learning on the fly from multiple input modalities. He holds a B.S. in Electrical Engineering from the University of Illinois at Urbana-Champaign, a S.M. in Computer Science from MIT, and a Ph.D. in Computer Science from MIT.
Research
Research Areas:
Research Interests:
Automatic speech recognition, spoken language understanding, multi-modal and embodied machine learning (speech, environmental sound, vision)
Select Publications
. 2020. Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech. ICLR 2020.
. 2019. Transfer Learning from Audio-Visual Grounding to Speech Recognition. Interspeech.
. 2019. Towards Bilingual Lexicon Discovery From Visually Grounded Speech Audio. Interspeech.
. 2019. Learning Words by Drawing Images. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
. 2019. Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input. International Journal of Computer Vision.
Awards & Honors
- 2018 - George M. Sprowls Award for best doctoral thesis in computer science, MIT



