Publications: 2025
- Text-Guided Interactive Scene Synthesis with Scene Prior Guidance
[Details] [PDF]
Shaoheng Fang, Haitao Yang, Raymond Mooney, Qixing Huang
In European Association for Computer Graphics, May 2025.
- CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models
[Details] [PDF]
Jierui Li, Hung Le, Yingbo Zhou, Caiming Xiong, Silvio Savarese, Doyen Sahoo
In Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL), April 2025.
- MET-Bench: Multimodal Entity Tracking for Evaluating the Limitations of Vision-Language and Reasoning Models
[Details] [PDF]
Vanya Cohen, Raymond Mooney
Preprint, January 2025.
- Temporally Streaming Audio-Visual Synchronization for Real-World Videos
[Details] [PDF]
Jordan Voas, Wei-Cheng Tseng, Layne Berry, Xixi Hu, Puyuan Peng, James Stuedemann, and David Harwath
In IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), February 2025.