My research interests are in the area of machine learning for speech, language, and sound processing. I am particularly interested in multimodality and unsupervised/self-supervised approaches that can benefit low-resource languages and domains. At UT, I lead the Speech, Audio, and Language Technologies (SALT) Lab.
Prior to joining UT, I worked as a research scientist at MIT CSAIL from 2018 to 2020. I recieved my PhD in 2018 from the Spoken Language Systems Group at MIT CSAIL, under the supervision of Jim Glass.
Research
Publications
Please see my Google Scholar page for an up-to-date list of my publications.
Datasets
Please visit this website to download all of the spoken caption corpora I have collected (as well as modeling code).
Miscellaneous
Media Coverage
Fun Stuff
My Adacemic Genealogy
- David Frank Harwath, Massachusetts Institute of Technology 2018
- James Robert Glass, Massachusetts Institute of Technology 1988
- Victor Waito Zue, Massachusetts Institute of Technology 1976
- Kenneth Noble Stevens, Massachusetts Institute of Technology 1952
- Leo Leroy Beranek, Harvard University 1940
- Frederick Vinton Hunt, Harvard University 1934
- George Washington Pierce, Harvard University 1900
- John Trowbridge, Harvard University 1873
- Joseph Lovering, Harvard University 1833
- Benjamin Peirce, Harvard University 1829
- Nathaniel Bowditch, Harvard University 1802
According to the CSAIL Genealogy Project, there may be an additional connection between Hunt and Saunders that traces back to Helmholtz: