Speech
Spoken Language Technology; language-audio processing
Temporally Streaming Audio-Visual Synchronization for Real-World Videos 2025
Jordan Voas, Wei-Cheng Tseng, Layne Berry, Xixi Hu, Puyuan Peng, James Stuedemann, and David Harwath, IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (2025).
Measuring Sound Symbolism in Audio-visual Models 2024
Wei-Cheng Tseng, Yi-Jen Shih, David Harwath, Raymond Mooney, IEEE Spoken Language Technology (SLT) Workshop (2024).
Multimodal Contextualized Semantic Parsing from Speech 2024
Jordan Voas, Raymond Mooney, David Harwath, Association for Computational Linguistics (ACL) (2024).