Forum for Artificial Intelligence

The Forum for Artificial Intelligence meets every other week (or so) to discuss scientific, philosophical, and cultural issues in artificial intelligence. Both technical research topics and broader inter-disciplinary aspects of AI are covered, and all are welcome to attend! Recordings will be made available online by the end of the day each Friday there is a talk.

If you would like to be added to the FAI mailing list, subscribe here. You can also subscribe to the FAI Google Calendar for calendar alerts. If you have any questions or comments, please send an email to Tiffany Guridy.

FAI Talk Schedule 2024-2025

More to be announced!

Upcoming Talks 2025-2026

Past Talks 2024-2025

Friday, September 13, 2024, 11:00 AM, GDC 6.302 \| Recording	Robert Platt [homepage] Associate Professor, Northeastern University Title: Symmetric Policy Learning in Robotics Abstract: Many robotics problems have transition dynamics that are symmetric in SE(2) and SE(3) with respect to rotation, translation, scaling, reflection, etc. In these situations, any optimal policy will also be symmetric over these transformations. In this talk, I leverage this insight to improve the data efficiency of policy learning by encoding domain symmetries directly into the neural network model using group invariant and equivariant layers. The result is that we can learn non-trivial visuomotor control policies with much less data than is typically the case. For imitation learning, this significantly reduces the number of demonstrations required. For reinforcement learning, it reduces the amount of experience needed to learn a good policy. In fact, we can sometimes learn good policies from scratch training directly on physical robotic hardware in real time. About the speaker: Rob Platt is an Associate Professor in the Khoury College of Computer Sciences at Northeastern University and a Faculty Fellow at BDAII. He is interested in developing robots that can perform complex manipulation tasks alongside humans in the uncertain everyday world. Much of his work is at the intersection of robotic policy learning, planning, and perception. Prior to coming to Northeastern, he was a Research Scientist at MIT and a technical lead at NASA Johnson Space Center.
Friday, September 20, 2024, 11:00 AM, GDC 6.302 \| Recording	Daniel Fried [homepage] Assistant Professor, Carnegie Mellon University Title: Planning and Inferring With LLMs for Grounded, Interactive Tasks Abstract: Large language models (LLMs) are increasingly being used in language-based interfaces for tasks that require interacting with people and with the (digital) world: for example asking questions of a user to help them interactively retrieve information, or carrying out everyday tasks in a web browser. These settings involve uncertainty and partial observability, and so afford planning and inference methods inspired by classical AI approaches. We present techniques for layering structured search and inference procedures on top of LLM-based agentive systems --- making the LLMs better able to interact with their environments and human partners. First, we investigate visually grounded reference games, where a system and a person must use dialogue to collaboratively identify and build common ground. Here, modeling uncertainty about the user's intent and asking maximally-informative questions improves task success. Second, we investigate language-based tasks on the web, where a system must carry out instructions from a person by taking actions in the browser. Here, performing tree search to plan over possible trajectories in the environments substantially improves performance of state-of-the-art models. About the speaker: Daniel Fried is an assistant professor in the Language Technologies Institute at CMU, and a research scientist at Meta AI. His research focuses on language grounding, interaction, and applied pragmatics, with a particular focus on language interfaces such as grounded instruction following and code generation. Previously, he was a postdoc at Meta AI and the University of Washington and completed a PhD at UC Berkeley. His research has been supported by an Okawa Research Award, a Google PhD Fellowship and a Churchill Fellowship
Friday, September 27, 2024, 11:00 AM, GDC 4.304 \| Recording	Bo Liu [homepage] Ph.D. Student & Graduate Research Assistant, University of Texas at Austin Title: Longhorn: State Space Models are Amortized Online Learners Abstract: The most fundamental capability of modern AI methods such as Large Language Models (LLMs) is the ability to predict the next token in a long sequence of tokens, known as ``sequence modeling." Although the Transformers model is the current dominant approach to sequence modeling, its quadratic computational cost with respect to sequence length is a significant drawback. State-space models (SSMs) offer a promising alternative due to their linear decoding efficiency and high parallelizability during training. However, existing SSMs often rely on seemingly ad hoc linear recurrence designs. In this work, we explore SSM design through the lens of online learning, conceptualizing SSMs as meta-modules for specific online learning problems. This approach links SSM design to formulating precise online learning objectives, with state transition rules derived from optimizing these objectives. Based on this insight, we introduce a novel deep SSM architecture based on the implicit update for optimizing an online regression objective. Our experimental results show that our models outperform state-of-the-art SSMs, including the Mamba model, on standard sequence modeling benchmarks and language modeling tasks. About the speaker: Bo is a PhD student at the University of Texas at Austin, advised by Prof. Peter Stone and Prof. Qiang Liu. His research lies in multitask/continual learning and reinforcement learning. Recently, he is working on designing theoretically sound algorithms/neural architectures for training large-scale multi-purpose agents.
Friday, October 11, 2024, 11:00 AM, GDC 6.302 \| Recording Co-Hosted with Dr. Joydeep Biswas's AMRL Lab	Jason Liu [homepage] Ph.D. Candidate, Brown University Title: Robotic Language Grounding Abstract: Natural language provides an intuitive and flexible way for humans to communicate with robots. However, understanding diverse, ambiguous language commands is challenging. Grounding language to structured task specifications enables autonomous robots to understand a broad range of natural language and solve long-horizon tasks with safety guarantees. Linear temporal logic (LTL) provides unambiguous semantics for language grounding, and its compositionality can induce skill transfer. In this talk, I will first propose two language grounding systems. 1) Lang2LTL is a modular system that uses large language models (LLMs) to ground navigation commands with diverse temporal patterns to LTL task specifications in novel environments without retraining. 2) Improved upon its predecessor, Lang2LTL-2 uses LLMs and a pretrained vision-language model to ground spatiotemporal navigation commands. By translating language to LTL, both systems can detect infeasible task specifications and abort execution when necessary. 3) Next, I will introduce LTL-Transfer, a zero-shot transfer algorithm that leverages the compositionality of LTL to reuse learned skills to solve novel tasks without violating any safety constraints. Future work aims at developing robotic systems that produce robust and verifiable behavior by using multimodal grounding and human-robot dialog. Relevant Papers: https://arxiv.org/abs/2302.11649 https://spatiotemporal-ground.github.io/ https://arxiv.org/abs/2206.05096 https://arxiv.org/abs/2405.13245 About the speaker: Jason Xinyu Liu is a Ph.D. candidate at Brown University, advised by Prof. Stefanie Tellex. His research lies in the intersection of robotics, natural language processing, and formal methods. He is working towards developing autonomous robots that assist people. His work has appeared at CoRL, ICRA, IROS, IJCAI, and AAAI Symposiums. Jason earned his Bachelor's degree in Electrical Engineering and Computer Sciences from UC Berkeley. His research is generously funded by the NSF Graduate Research Fellowship Program and the Jack Kent Cooke Foundation Graduate Scholarship.
Friday, October 18, 2024, 11:00 AM, GDC 6.302 \| Recording	Leqi Liu [homepage] Assistant Professor, The University of Texas at Austin Title: Preference Optimization in Large Language Model Alignment: Personalization, Common Pitfalls and Beyond Abstract: Reinforcement Learning from Human Feedback (RLHF) has become the predominant method for aligning large language models (LLMs) to be more helpful and less harmful. In this talk, we address two core limitations of traditional RLHF. First, it assumes that all human preferences come from the same distribution, preventing fine-tuned LLMs from generating personalized content without explicit prompting. We introduce Personalized RLHF, an efficient framework that captures individual preferences through a lightweight user model, enabling LLMs to generate content that reflects diverse and potentially conflicting user preferences. Second, current RLHF methods often rely on optimizing against margin-based losses, which focus on the difference between preferred and dispreferred responses but fail to specify ideal LLM behavior on each type of the responses individually. This underspecification can lead to problematic training dynamics, increasing the probability of generating unsafe content or reducing the probability of generating ideal responses. We characterize when these problematic dynamics emerge and outline algorithms that can mitigate these issues. Finally, we will discuss future directions and potential new paradigms for improving LLM alignment. About the speaker: Leqi Liu is an assistant professor in use-inspired AI at the department of information, risk and operations management at UT Austin. Her research focuses on (1) investigating the foundations of state-of-the-art machine intelligence, with a particular focus on generative AI systems; (2) designing principled algorithmic frameworks for human-centered machine learning that model human preferences and behaviors, integrating these models into machine learning pipelines for applications such as healthcare, recommender systems, and education; and (3) evaluating and auditing the societal impacts of large-scale AI systems, including large language models and recommender systems. She graduated from the Machine Learning Department at Carnegie Mellon University in 2023 where she was advised by Zachary Lipton, and spent a year at Princeton Language & Intelligence as a postdoc. She has also spent time at Apple and Google DeepMind London during her Ph.D., and was an Open Philanthropy AI Fellow.
Friday, October 25, 2024, 11:00 AM, GDC 6.302 \| Recording	He He [homepage] Assistant Professor, New York University Title: Tracing LLM Capabilities to the Training Data Abstract: Pre-trained large language models (LLMs) exhibit remarkable emergent capabilities. However, the origins of these capabilities remain poorly understood. What kind of patterns in pre-training data enable LLMs to perform ICL? Can LLMs infer causal relations from relational data in text, or are they limited to memorizing explicit causal facts? We explore these questions and discuss the broader implications for understanding how LLMs generalize from training data to perform complex tasks and where their reasoning capabilities face limitations. About the speaker: He He is an Assistant Professor of Computer Science and Center for Data Science at New York University. Her current research focuses on understanding and aligning large language models, and human-AI collaboration. Before joining NYU, she obtained her PhD in 2016 from the University of Maryland, College Park, did a post-doc at Stanford, and spent one year at AWS.
Friday, November 1, 2024, 11:00 AM, GDC 6.302 \| Recording	Gopala Anumanchipalli [homepage] Assistant Professor, University of California Berkeley Title: Grounding Speech AI models in Human Speech Mechanisms Abstract: In this talk I will go over our recent attempts to induce human mechanisms in Spoken language AI applications. I will first go over the neural and physiological basis of fluent speech production, and talk about grounding current Self-Supervised Speech models (Hubert, WavLM etc) in human processes. Specifically, I will talk about probing articulatory information in these models. I will then present a new coding scheme called SPARC (SPeech ARticulatory Coding), that completely describes speech in terms of speech articulation, and discuss the universality of such a coding scheme across speakers and languages. I will also detail newer experiments in inducing higher order phonological structure into these models. As a practical demonstration of these developments, I will talk about some experiments in Neurotechnologies, specifically Brain-Computer Interfaces that use some of these results. Time permitting, I will briefly talk about our recent work in Dysfluent speech modeling characterization in speech disorders, exemplar based speech stylization etc. About the speaker: Gopala Anumachipalli is the Robert E. And Beverly A. Brooks Assistant Professor in the EECS department at UC Berkeley, where he leads the Berkeley Speech Group. He holds an adjunct position at UCSF, and is a member of Berkeley AI Research (BAIR), and Computational Precision Health (CPH). His group focuses on the science and engineering of spoken language, with application to human health — both for screening speech disorders and externally restoring lost function using Brain Computer Interfaces. He obtained his PhD from Carnegie Mellon University and went to UCSF for postdoctoral training. He has been recognized as a Kavli Fellow, Noyce Innovator, Hellman Fellow, Google Research Scholar, JP Morgan AI Research awardee, among other honors. Speaker Website: https://people.eecs.berkeley.edu/~gopala/
Friday, January 24, 2025, 11:00 AM, GDC 6.302 \| Recording	Ray Mooney [homepage] Professor, University of Texas at Austin Title: Has Machine Learning Theory Aided Experimental Progress? Abstract: Science works best when there is a mutually beneficial interaction between theoretical and experimental research. The history of machine learning provides little evidence that machine learning theory has provided substantial assistance to experimental progress in the field; in fact, perhaps it has sometimes inhibited it. This talk will review this history and attempt to spur a discussion of why theory has not been more beneficial and how a more productive interaction between theoretical and experimental machine learning might be encouraged. About the speaker: Raymond J. Mooney is a Professor in the Department of Computer Science at the University of Texas at Austin where he is also Director of the AI Lab. He received his Ph.D. in 1988 from the University of Illinois at Urbana/Champaign. He is an author of over 200 published research papers, primarily in the areas of machine learning and natural language processing. He was the President of the International Machine Learning Society from 2008-2011, program co-chair for AAAI 2006, general chair for HLT-EMNLP 2005, and co-chair for ICML 1990. He is a Fellow of AAAI, ACM, and ACL and the recipient of the Classic Paper award from AAAI-19 and best paper awards from AAAI-96, KDD-04, ICML-05 and ACL-07.
Wednesday, February 19, 2025, 11:00 AM, GDC 4.304 \| Recording	Jeannette Bohg [homepage] Assistant Professor, Stanford Title: A vision for robotics in the age of foundation models Abstract: My long-term research goal is to enable real robots to manipulate any kind of object such that they can perform many different tasks in a wide variety of application scenarios such as in our homes, in hospitals, warehouses, or factories. These tasks will require fine sensorimotor skills to for example use tools, operate devices, assemble parts, deal with deformable objects and so on. I claim that equipping robots with these sensorimotor skills is one of the biggest challenges in robotics. The currently dominant approach towards achieving this goal of universal sensorimotor skills is using imitation learning and collecting as much robot data as humanly possible. The promise of this approach is a foundation model for robotics. While I believe in the power of data and simple learning models, I think we need to think beyond this for achieving the goal of a generalist robot. In this talk, I will discuss the need for (1) better robot policy architectures, (2) better multi-sensory data, (3) better online and life-long learning algorithms and (3) better robot hardware. About the speaker: Jeannette Bohg is an Assistant Professor of Computer Science at Stanford University. She was a group leader at the Autonomous Motion Department (AMD) of the MPI for Intelligent Systems until September 2017. Before joining AMD in January 2012, Jeannette Bohg was a PhD student at the Division of Robotics, Perception and Learning (RPL) at KTH in Stockholm. In her thesis, she proposed novel methods towards multi-modal scene understanding for robotic grasping. She also studied at Chalmers in Gothenburg and at the Technical University in Dresden where she received her Master in Art and Technology and her Diploma in Computer Science, respectively. Her research focuses on perception and learning for autonomous robotic manipulation and grasping. She is specifically interested in developing methods that are goal-directed, real-time and multi-modal such that they can provide meaningful feedback for execution and learning. Jeannette Bohg has received several Early Career and Best Paper awards, most notably the 2019 IEEE Robotics and Automation Society Early Career Award and the 2020 Robotics: Science and Systems Early Career Award.
Friday, February 21, 2025, 11:00 AM, GDC 6.302 \| Recording	Yoav Wald [homepage] Faculty Fellow, New York University Title: Robustness to Spurious Correlations: Lessons Learned on Causal Thinking and Large Scale Modelling Abstract: Predictive models often rely on spurious correlations, leading to unreliable performance. This talk covers a series of works on causally motivated methods to train models that are robust to spurious correlations. I will begin by outlining a formal framework for reasoning about spurious correlations and cover some core techniques to mitigate their effects. The talk will then explore three aspects of developing robust models: (i) considerations in training overparameterized models that maintain invariance to shifts in spurious correlations, (ii) approaches to dealing with scenarios where annotations of spuriously correlated features are unavailable, and (iii) the use of large language models and causal estimation techniques to enhance robustness in large-scale, real-world medical text classification tasks. About the speaker: Yoav Wald is a Faculty Fellow/Assistant Professor at NYU's Center for Data Science, working on causality-motivated machine learning and causal inference. He applies these tools to problems in robust prediction and decision support in healthcare. His interests include out-of-distribution generalization, novelty detection, sequential decision making, and interpretability. He received his PhD from the Hebrew University, was a long term research intern at Google Research and a postdoctoral fellow at Johns Hopkins University.
Monday, February 24, 2025, 10:00 AM, GDC 6.302 \| Recording	Erez Karpas [homepage] Associate Professor, Israel Institute of Technology Title: Planning and Acting While the Clock Ticks Abstract: Standard planning assumes that planning takes place offline and then execution starts at time 0. In this talk, we explore what happens when planning starts at time 0. Specifically, in situated temporal planning execution starts after the planner has found a plan, while concurrent planning and execution introduces the option of executing an action even before a complete plan is found.To address these problems we present modifications to the heuristic search procedure involved in planning, and use metareasoning to choose the best computational action. About the speaker: Erez Karpas is an Associate Professor at the Faculty of Data and Decision Sciences, Technion – Israel Institute of Technology. His main research interests are artificial intelligence and robotics. Prior to that he was a postdoctoral associate at the Model-based Embedded and Robotics Systems Group at MIT, under the supervision of Prof. Brian Williams, and before that, a research fellow and the research coordinator of the Technion-Microsoft Electronic-Commerce Research Center, under Prof. Moshe Tennenholtz. He completed his Ph.D. under the supervision of Prof. Carmel Domshlak and Prof. Shaul Markovitch at the Faculty of Industrial Engineering and Management at the Technion – Israel Institute of Technology.

Friday, April 18, 2025, 11:00 AM, GDC 6.302 \| Recording	Nathan Sturtevant [homepage] Professor, University of Alberta Title: Re-searching the foundations of heuristic search Abstract: Although the field of heuristic search is over 50 years old, the last 7-8 years have seen numerous revisions to the foundational algorithms in the field. These include the theories for bidirectional search, for suboptimal search, and for improving the worst-case performance of fundamental algorithms such as A. This talk will give an overview of these new results, demonstrating the changes and their impact, many of which center around the notion of whether re-expansions are allowed during search. About the speaker:* Nathan Sturtevant is a Full Professor at the University of Alberta and a Canada CIFAR AI Chair and Fellow with the Alberta Machine Intelligence Institute (Amii). He was the director of Amii at the University of Alberta from 2020-2023. Nathan has published nearly 200 papers in Artificial Intelligence in the areas such as combinatorial search, multi-agent pathfinding, heuristics, traditional games, and video games. He has won best paper awards at conferences such as AAAI, AIIDE, and SoCS, as well as received commendations from IJCAI and the prominent paper award from the Artificial Intelligence Journal. Nathan is a Senior Member of AAAI. He has implemented his research in popular commercial games such as Dragon Age: Origins and Nightingale.
Friday, April 25, 2025, 10:00 AM, GDC 6.302 \| Recording Link	Aubra Anthony [homepage] Senior Fellow, Carnegie Endowment for International Peace Title: Responsible AI – on whose terms? Abstract: Most of us accept that AI will bring benefit to humanity only if it's developed and deployed in a responsible, ethically aligned way. But AI's reach spans the globe, and what makes AI 'responsible' differs depending on where in the world that question is being posed. What's more, those who are making many of the most far-reaching decisions about AI's global trajectory are, for now, not globally representative. Instead, consequential design and policy decisions are often concentrated in just a few elite tech circles, representing a minority of world viewpoints. This talk will explore what goes missing if we’re resigned to this minority/majority split in defining globally responsible AI. Aubra Anthony will share research that encourages a more pluralistic approach to considering responsible AI, highlighting nuances that AI developers and policymakers throughout the Global Majority have long been grappling with. About the speaker: Aubra Anthony is a senior fellow in the Technology and International Affairs Program at the Carnegie Endowment for International Peace, where she researches the human impacts of digital technology, specifically in emerging markets. Aubra works to better understand how the global development and adoption of technology like artificial intelligence can be made more inclusive and trustworthy. Prior to joining Carnegie, Aubra worked at the U.S. Agency for International Development (USAID). As strategy and research lead in the Innovation, Technology, and Research Hub at USAID, Aubra led research and published findings on responsible use of artificial intelligence (AI) and machine learning in international development and humanitarian assistance, subsequently leading the translation of this research into USAID’s first-ever AI Action Plan, launched by Administrator Samantha Power in May 2022. Aubra received her Ph.D. in Physics from the University of Texas at Austin, then carried out postdoctoral research in observational cosmology with the University of Colorado before shifting her focus to tech policy and international affairs. She currently lives in Austin, Texas.

Friday, April 18, 2025, 11:00 AM, GDC 6.302 | Recording

Nathan Sturtevant [homepage]
Professor, University of Alberta

Title: Re-searching the foundations of heuristic search

Abstract: Although the field of heuristic search is over 50 years old, the last 7-8 years have seen numerous revisions to the foundational algorithms in the field. These include the theories for bidirectional search, for suboptimal search, and for improving the worst-case performance of fundamental algorithms such as A*. This talk will give an overview of these new results, demonstrating the changes and their impact, many of which center around the notion of whether re-expansions are allowed during search.

About the speaker: Nathan Sturtevant is a Full Professor at the University of Alberta and a Canada CIFAR AI Chair and Fellow with the Alberta Machine Intelligence Institute (Amii). He was the director of Amii at the University of Alberta from 2020-2023. Nathan has published nearly 200 papers in Artificial Intelligence in the areas such as combinatorial search, multi-agent pathfinding, heuristics, traditional games, and video games. He has won best paper awards at conferences such as AAAI, AIIDE, and SoCS, as well as received commendations from IJCAI and the prominent paper award from the Artificial Intelligence Journal. Nathan is a Senior Member of AAAI. He has implemented his research in popular commercial games such as Dragon Age: Origins and Nightingale.

Friday, April 25, 2025, 10:00 AM, GDC 6.302 | Recording Link

Aubra Anthony [homepage]

Senior Fellow, Carnegie Endowment for International Peace

Title: Responsible AI – on whose terms?

Abstract: Most of us accept that AI will bring benefit to humanity only if it's developed and deployed in a responsible, ethically aligned way. But AI's reach spans the globe, and what makes AI 'responsible' differs depending on where in the world that question is being posed. What's more, those who are making many of the most far-reaching decisions about AI's global trajectory are, for now, not globally representative. Instead, consequential design and policy decisions are often concentrated in just a few elite tech circles, representing a minority of world viewpoints. This talk will explore what goes missing if we’re resigned to this minority/majority split in defining globally responsible AI. Aubra Anthony will share research that encourages a more pluralistic approach to considering responsible AI, highlighting nuances that AI developers and policymakers throughout the Global Majority have long been grappling with.

About the speaker: Aubra Anthony is a senior fellow in the Technology and International Affairs Program at the Carnegie Endowment for International Peace, where she researches the human impacts of digital technology, specifically in emerging markets. Aubra works to better understand how the global development and adoption of technology like artificial intelligence can be made more inclusive and trustworthy. Prior to joining Carnegie, Aubra worked at the U.S. Agency for International Development (USAID). As strategy and research lead in the Innovation, Technology, and Research Hub at USAID, Aubra led research and published findings on responsible use of artificial intelligence (AI) and machine learning in international development and humanitarian assistance, subsequently leading the translation of this research into USAID’s first-ever AI Action Plan, launched by Administrator Samantha Power in May 2022. Aubra received her Ph.D. in Physics from the University of Texas at Austin, then carried out postdoctoral research in observational cosmology with the University of Colorado before shifting her focus to tech policy and international affairs. She currently lives in Austin, Texas.

Friday, September 13, 2024, 11:00 AM, GDC 6.302 \| Recording	Robert Platt [homepage] Associate Professor, Northeastern University Title: Symmetric Policy Learning in Robotics Abstract: Many robotics problems have transition dynamics that are symmetric in SE(2) and SE(3) with respect to rotation, translation, scaling, reflection, etc. In these situations, any optimal policy will also be symmetric over these transformations. In this talk, I leverage this insight to improve the data efficiency of policy learning by encoding domain symmetries directly into the neural network model using group invariant and equivariant layers. The result is that we can learn non-trivial visuomotor control policies with much less data than is typically the case. For imitation learning, this significantly reduces the number of demonstrations required. For reinforcement learning, it reduces the amount of experience needed to learn a good policy. In fact, we can sometimes learn good policies from scratch training directly on physical robotic hardware in real time. About the speaker: Rob Platt is an Associate Professor in the Khoury College of Computer Sciences at Northeastern University and a Faculty Fellow at BDAII. He is interested in developing robots that can perform complex manipulation tasks alongside humans in the uncertain everyday world. Much of his work is at the intersection of robotic policy learning, planning, and perception. Prior to coming to Northeastern, he was a Research Scientist at MIT and a technical lead at NASA Johnson Space Center.
Friday, September 20, 2024, 11:00 AM, GDC 6.302 \| Recording	Daniel Fried [homepage] Assistant Professor, Carnegie Mellon University Title: Planning and Inferring With LLMs for Grounded, Interactive Tasks Abstract: Large language models (LLMs) are increasingly being used in language-based interfaces for tasks that require interacting with people and with the (digital) world: for example asking questions of a user to help them interactively retrieve information, or carrying out everyday tasks in a web browser. These settings involve uncertainty and partial observability, and so afford planning and inference methods inspired by classical AI approaches. We present techniques for layering structured search and inference procedures on top of LLM-based agentive systems --- making the LLMs better able to interact with their environments and human partners. First, we investigate visually grounded reference games, where a system and a person must use dialogue to collaboratively identify and build common ground. Here, modeling uncertainty about the user's intent and asking maximally-informative questions improves task success. Second, we investigate language-based tasks on the web, where a system must carry out instructions from a person by taking actions in the browser. Here, performing tree search to plan over possible trajectories in the environments substantially improves performance of state-of-the-art models. About the speaker: Daniel Fried is an assistant professor in the Language Technologies Institute at CMU, and a research scientist at Meta AI. His research focuses on language grounding, interaction, and applied pragmatics, with a particular focus on language interfaces such as grounded instruction following and code generation. Previously, he was a postdoc at Meta AI and the University of Washington and completed a PhD at UC Berkeley. His research has been supported by an Okawa Research Award, a Google PhD Fellowship and a Churchill Fellowship
Friday, September 27, 2024, 11:00 AM, GDC 4.304 \| Recording	Bo Liu [homepage] Ph.D. Student & Graduate Research Assistant, University of Texas at Austin Title: Longhorn: State Space Models are Amortized Online Learners Abstract: The most fundamental capability of modern AI methods such as Large Language Models (LLMs) is the ability to predict the next token in a long sequence of tokens, known as ``sequence modeling." Although the Transformers model is the current dominant approach to sequence modeling, its quadratic computational cost with respect to sequence length is a significant drawback. State-space models (SSMs) offer a promising alternative due to their linear decoding efficiency and high parallelizability during training. However, existing SSMs often rely on seemingly ad hoc linear recurrence designs. In this work, we explore SSM design through the lens of online learning, conceptualizing SSMs as meta-modules for specific online learning problems. This approach links SSM design to formulating precise online learning objectives, with state transition rules derived from optimizing these objectives. Based on this insight, we introduce a novel deep SSM architecture based on the implicit update for optimizing an online regression objective. Our experimental results show that our models outperform state-of-the-art SSMs, including the Mamba model, on standard sequence modeling benchmarks and language modeling tasks. About the speaker: Bo is a PhD student at the University of Texas at Austin, advised by Prof. Peter Stone and Prof. Qiang Liu. His research lies in multitask/continual learning and reinforcement learning. Recently, he is working on designing theoretically sound algorithms/neural architectures for training large-scale multi-purpose agents.
Friday, October 11, 2024, 11:00 AM, GDC 6.302 \| Recording Co-Hosted with Dr. Joydeep Biswas's AMRL Lab	Jason Liu [homepage] Ph.D. Candidate, Brown University Title: Robotic Language Grounding Abstract: Natural language provides an intuitive and flexible way for humans to communicate with robots. However, understanding diverse, ambiguous language commands is challenging. Grounding language to structured task specifications enables autonomous robots to understand a broad range of natural language and solve long-horizon tasks with safety guarantees. Linear temporal logic (LTL) provides unambiguous semantics for language grounding, and its compositionality can induce skill transfer. In this talk, I will first propose two language grounding systems. 1) Lang2LTL is a modular system that uses large language models (LLMs) to ground navigation commands with diverse temporal patterns to LTL task specifications in novel environments without retraining. 2) Improved upon its predecessor, Lang2LTL-2 uses LLMs and a pretrained vision-language model to ground spatiotemporal navigation commands. By translating language to LTL, both systems can detect infeasible task specifications and abort execution when necessary. 3) Next, I will introduce LTL-Transfer, a zero-shot transfer algorithm that leverages the compositionality of LTL to reuse learned skills to solve novel tasks without violating any safety constraints. Future work aims at developing robotic systems that produce robust and verifiable behavior by using multimodal grounding and human-robot dialog. Relevant Papers: https://arxiv.org/abs/2302.11649 https://spatiotemporal-ground.github.io/ https://arxiv.org/abs/2206.05096 https://arxiv.org/abs/2405.13245 About the speaker: Jason Xinyu Liu is a Ph.D. candidate at Brown University, advised by Prof. Stefanie Tellex. His research lies in the intersection of robotics, natural language processing, and formal methods. He is working towards developing autonomous robots that assist people. His work has appeared at CoRL, ICRA, IROS, IJCAI, and AAAI Symposiums. Jason earned his Bachelor's degree in Electrical Engineering and Computer Sciences from UC Berkeley. His research is generously funded by the NSF Graduate Research Fellowship Program and the Jack Kent Cooke Foundation Graduate Scholarship.
Friday, October 18, 2024, 11:00 AM, GDC 6.302 \| Recording	Leqi Liu [homepage] Assistant Professor, The University of Texas at Austin Title: Preference Optimization in Large Language Model Alignment: Personalization, Common Pitfalls and Beyond Abstract: Reinforcement Learning from Human Feedback (RLHF) has become the predominant method for aligning large language models (LLMs) to be more helpful and less harmful. In this talk, we address two core limitations of traditional RLHF. First, it assumes that all human preferences come from the same distribution, preventing fine-tuned LLMs from generating personalized content without explicit prompting. We introduce Personalized RLHF, an efficient framework that captures individual preferences through a lightweight user model, enabling LLMs to generate content that reflects diverse and potentially conflicting user preferences. Second, current RLHF methods often rely on optimizing against margin-based losses, which focus on the difference between preferred and dispreferred responses but fail to specify ideal LLM behavior on each type of the responses individually. This underspecification can lead to problematic training dynamics, increasing the probability of generating unsafe content or reducing the probability of generating ideal responses. We characterize when these problematic dynamics emerge and outline algorithms that can mitigate these issues. Finally, we will discuss future directions and potential new paradigms for improving LLM alignment. About the speaker: Leqi Liu is an assistant professor in use-inspired AI at the department of information, risk and operations management at UT Austin. Her research focuses on (1) investigating the foundations of state-of-the-art machine intelligence, with a particular focus on generative AI systems; (2) designing principled algorithmic frameworks for human-centered machine learning that model human preferences and behaviors, integrating these models into machine learning pipelines for applications such as healthcare, recommender systems, and education; and (3) evaluating and auditing the societal impacts of large-scale AI systems, including large language models and recommender systems. She graduated from the Machine Learning Department at Carnegie Mellon University in 2023 where she was advised by Zachary Lipton, and spent a year at Princeton Language & Intelligence as a postdoc. She has also spent time at Apple and Google DeepMind London during her Ph.D., and was an Open Philanthropy AI Fellow.
Friday, October 25, 2024, 11:00 AM, GDC 6.302 \| Recording	He He [homepage] Assistant Professor, New York University Title: Tracing LLM Capabilities to the Training Data Abstract: Pre-trained large language models (LLMs) exhibit remarkable emergent capabilities. However, the origins of these capabilities remain poorly understood. What kind of patterns in pre-training data enable LLMs to perform ICL? Can LLMs infer causal relations from relational data in text, or are they limited to memorizing explicit causal facts? We explore these questions and discuss the broader implications for understanding how LLMs generalize from training data to perform complex tasks and where their reasoning capabilities face limitations. About the speaker: He He is an Assistant Professor of Computer Science and Center for Data Science at New York University. Her current research focuses on understanding and aligning large language models, and human-AI collaboration. Before joining NYU, she obtained her PhD in 2016 from the University of Maryland, College Park, did a post-doc at Stanford, and spent one year at AWS.
Friday, November 1, 2024, 11:00 AM, GDC 6.302 \| Recording	Gopala Anumanchipalli [homepage] Assistant Professor, University of California Berkeley Title: Grounding Speech AI models in Human Speech Mechanisms Abstract: In this talk I will go over our recent attempts to induce human mechanisms in Spoken language AI applications. I will first go over the neural and physiological basis of fluent speech production, and talk about grounding current Self-Supervised Speech models (Hubert, WavLM etc) in human processes. Specifically, I will talk about probing articulatory information in these models. I will then present a new coding scheme called SPARC (SPeech ARticulatory Coding), that completely describes speech in terms of speech articulation, and discuss the universality of such a coding scheme across speakers and languages. I will also detail newer experiments in inducing higher order phonological structure into these models. As a practical demonstration of these developments, I will talk about some experiments in Neurotechnologies, specifically Brain-Computer Interfaces that use some of these results. Time permitting, I will briefly talk about our recent work in Dysfluent speech modeling characterization in speech disorders, exemplar based speech stylization etc. About the speaker: Gopala Anumachipalli is the Robert E. And Beverly A. Brooks Assistant Professor in the EECS department at UC Berkeley, where he leads the Berkeley Speech Group. He holds an adjunct position at UCSF, and is a member of Berkeley AI Research (BAIR), and Computational Precision Health (CPH). His group focuses on the science and engineering of spoken language, with application to human health — both for screening speech disorders and externally restoring lost function using Brain Computer Interfaces. He obtained his PhD from Carnegie Mellon University and went to UCSF for postdoctoral training. He has been recognized as a Kavli Fellow, Noyce Innovator, Hellman Fellow, Google Research Scholar, JP Morgan AI Research awardee, among other honors. Speaker Website: https://people.eecs.berkeley.edu/~gopala/
Friday, January 24, 2025, 11:00 AM, GDC 6.302 \| Recording	Ray Mooney [homepage] Professor, University of Texas at Austin Title: Has Machine Learning Theory Aided Experimental Progress? Abstract: Science works best when there is a mutually beneficial interaction between theoretical and experimental research. The history of machine learning provides little evidence that machine learning theory has provided substantial assistance to experimental progress in the field; in fact, perhaps it has sometimes inhibited it. This talk will review this history and attempt to spur a discussion of why theory has not been more beneficial and how a more productive interaction between theoretical and experimental machine learning might be encouraged. About the speaker: Raymond J. Mooney is a Professor in the Department of Computer Science at the University of Texas at Austin where he is also Director of the AI Lab. He received his Ph.D. in 1988 from the University of Illinois at Urbana/Champaign. He is an author of over 200 published research papers, primarily in the areas of machine learning and natural language processing. He was the President of the International Machine Learning Society from 2008-2011, program co-chair for AAAI 2006, general chair for HLT-EMNLP 2005, and co-chair for ICML 1990. He is a Fellow of AAAI, ACM, and ACL and the recipient of the Classic Paper award from AAAI-19 and best paper awards from AAAI-96, KDD-04, ICML-05 and ACL-07.
Wednesday, February 19, 2025, 11:00 AM, GDC 4.304 \| Recording	Jeannette Bohg [homepage] Assistant Professor, Stanford Title: A vision for robotics in the age of foundation models Abstract: My long-term research goal is to enable real robots to manipulate any kind of object such that they can perform many different tasks in a wide variety of application scenarios such as in our homes, in hospitals, warehouses, or factories. These tasks will require fine sensorimotor skills to for example use tools, operate devices, assemble parts, deal with deformable objects and so on. I claim that equipping robots with these sensorimotor skills is one of the biggest challenges in robotics. The currently dominant approach towards achieving this goal of universal sensorimotor skills is using imitation learning and collecting as much robot data as humanly possible. The promise of this approach is a foundation model for robotics. While I believe in the power of data and simple learning models, I think we need to think beyond this for achieving the goal of a generalist robot. In this talk, I will discuss the need for (1) better robot policy architectures, (2) better multi-sensory data, (3) better online and life-long learning algorithms and (3) better robot hardware. About the speaker: Jeannette Bohg is an Assistant Professor of Computer Science at Stanford University. She was a group leader at the Autonomous Motion Department (AMD) of the MPI for Intelligent Systems until September 2017. Before joining AMD in January 2012, Jeannette Bohg was a PhD student at the Division of Robotics, Perception and Learning (RPL) at KTH in Stockholm. In her thesis, she proposed novel methods towards multi-modal scene understanding for robotic grasping. She also studied at Chalmers in Gothenburg and at the Technical University in Dresden where she received her Master in Art and Technology and her Diploma in Computer Science, respectively. Her research focuses on perception and learning for autonomous robotic manipulation and grasping. She is specifically interested in developing methods that are goal-directed, real-time and multi-modal such that they can provide meaningful feedback for execution and learning. Jeannette Bohg has received several Early Career and Best Paper awards, most notably the 2019 IEEE Robotics and Automation Society Early Career Award and the 2020 Robotics: Science and Systems Early Career Award.
Friday, February 21, 2025, 11:00 AM, GDC 6.302 \| Recording	Yoav Wald [homepage] Faculty Fellow, New York University Title: Robustness to Spurious Correlations: Lessons Learned on Causal Thinking and Large Scale Modelling Abstract: Predictive models often rely on spurious correlations, leading to unreliable performance. This talk covers a series of works on causally motivated methods to train models that are robust to spurious correlations. I will begin by outlining a formal framework for reasoning about spurious correlations and cover some core techniques to mitigate their effects. The talk will then explore three aspects of developing robust models: (i) considerations in training overparameterized models that maintain invariance to shifts in spurious correlations, (ii) approaches to dealing with scenarios where annotations of spuriously correlated features are unavailable, and (iii) the use of large language models and causal estimation techniques to enhance robustness in large-scale, real-world medical text classification tasks. About the speaker: Yoav Wald is a Faculty Fellow/Assistant Professor at NYU's Center for Data Science, working on causality-motivated machine learning and causal inference. He applies these tools to problems in robust prediction and decision support in healthcare. His interests include out-of-distribution generalization, novelty detection, sequential decision making, and interpretability. He received his PhD from the Hebrew University, was a long term research intern at Google Research and a postdoctoral fellow at Johns Hopkins University.
Monday, February 24, 2025, 10:00 AM, GDC 6.302 \| Recording	Erez Karpas [homepage] Associate Professor, Israel Institute of Technology Title: Planning and Acting While the Clock Ticks Abstract: Standard planning assumes that planning takes place offline and then execution starts at time 0. In this talk, we explore what happens when planning starts at time 0. Specifically, in situated temporal planning execution starts after the planner has found a plan, while concurrent planning and execution introduces the option of executing an action even before a complete plan is found.To address these problems we present modifications to the heuristic search procedure involved in planning, and use metareasoning to choose the best computational action. About the speaker: Erez Karpas is an Associate Professor at the Faculty of Data and Decision Sciences, Technion – Israel Institute of Technology. His main research interests are artificial intelligence and robotics. Prior to that he was a postdoctoral associate at the Model-based Embedded and Robotics Systems Group at MIT, under the supervision of Prof. Brian Williams, and before that, a research fellow and the research coordinator of the Technion-Microsoft Electronic-Commerce Research Center, under Prof. Moshe Tennenholtz. He completed his Ph.D. under the supervision of Prof. Carmel Domshlak and Prof. Shaul Markovitch at the Faculty of Industrial Engineering and Management at the Technion – Israel Institute of Technology.

Research Areas

What Starts Here

Academics

Student Support

Ph.D. Program

Master's Programs

Portfolio Program in Robotics

Admissions & Incoming Students

Current Students

Online Programs & Degrees

Master's Degrees

Student Experience

Apply

FAQ

Industry

Alumni

Outreach

FAI Talk Schedule 2024-2025

Upcoming Talks 2025-2026

Past Talks 2024-2025

FAI Archive by Year