UTCS Artificial Intelligence
courses
talks/events
demos
people
projects
publications
software/data
labs
areas
admin
Speech
Spoken Language Technology; language-audio processing
Publications
Temporally Streaming Audio-Visual Synchronization for Real-World Videos
2025
Jordan Voas, Wei-Cheng Tseng, Layne Berry, Xixi Hu, Puyuan Peng, James Stuedemann, and David Harwath,
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)
(2025).
Measuring Sound Symbolism in Audio-visual Models
2024
Wei-Cheng Tseng, Yi-Jen Shih, David Harwath, Raymond Mooney,
IEEE Spoken Language Technology (SLT) Workshop
(2024).
Multimodal Contextualized Semantic Parsing from Speech
2024
Jordan Voas, Raymond Mooney, David Harwath,
Association for Computational Linguistics (ACL)
(2024).
Labs
Machine Learning