Peter Stone's Selected Publications

Classified by TopicClassified by Publication TypeSorted by DateSorted by First Author Last NameClassified by Funding Source


Multimodal Embodied Attribute Learning by Robots for Object-Centric Action Policies

Multimodal Embodied Attribute Learning by Robots for Object-Centric Action Policies.
Xiaohan Zhang, Saeid Amiri, Jivko Sinapov, Jesse Thomason, Peter Stone, and Shiqi Zhang.
Autonomous Robots, March 2023.
Official version on publisher's website

Download

[PDF]2.6MB  

Abstract

Robots frequently need to perceive object attributes, such as red, heavy, and empty, using multimodal exploratory behaviors, such as look, lift, and shake. One possible way for robots to do so is to learn a classifier for each perceivable attribute given an exploratory behavior. Once the attribute classifiers are learned, they can be used by robots to select actions and identify attributes of new objects, answering questions, such as "Is this object RED and EMPTY?" In this article, we introduce a robot interactive perception problem, called Multimodal Embodied Attribute Learning (meal), and explore solutions to this new problem. Under different assumptions, there are two classes of meal problems. offline- meal problems are defined in this article as learning attribute classifiers from pre-collected data, and sequencing actions towards attribute identification under the challenging trade-off between information gains and exploration action costs. For this purpose, we introduce Mixed Observability Robot Control (morc), an algorithm for offline- meal problems, that dynamically constructs both fully and partially observable components of the state for multimodal attribute identification of objects. We further investigate a more challenging class of meal problems, called online- meal, where the robot assumes no pre-collected data, and works on both attribute classification and attribute identification at the same time. Based on morc, we develop an algorithm called Information-Theoretic Reward Shaping (morc-itrs) that actively addresses the trade-off between exploration and exploitation in online- meal problems. morc and morc-itrs are evaluated in comparison with competitive meal baselines, and results demonstrate the superiority of our methods in learning efficiency and identification accuracy.

BibTeX Entry

@article{AURO23,
   author="Xiaohan Zhang and Saeid Amiri and Jivko Sinapov and Jesse Thomason and Peter Stone and Shiqi Zhang",	
   title="Multimodal Embodied Attribute Learning by Robots for Object-Centric Action Policies",
   journal="Autonomous Robots",
   month="March",
   year="2023",
   doi="https://doi.org/10.1007/s10514-023-10098-5",
   abstract={
             Robots frequently need to perceive object attributes,
             such as red, heavy, and empty, using multimodal
             exploratory behaviors, such as look, lift, and shake. One
             possible way for robots to do so is to learn a classifier
             for each perceivable attribute given an exploratory
             behavior. Once the attribute classifiers are learned,
             they can be used by robots to select actions and identify
             attributes of new objects, answering questions, such as
             "Is this object RED and EMPTY?" In this article, we
             introduce a robot interactive perception problem, called
             Multimodal Embodied Attribute Learning (meal), and
             explore solutions to this new problem. Under different
             assumptions, there are two classes of meal
             problems. offline- meal problems are defined in this
             article as learning attribute classifiers from
             pre-collected data, and sequencing actions towards
             attribute identification under the challenging trade-off
             between information gains and exploration action
             costs. For this purpose, we introduce Mixed Observability
             Robot Control (morc), an algorithm for offline- meal
             problems, that dynamically constructs both fully and
             partially observable components of the state for
             multimodal attribute identification of objects. We
             further investigate a more challenging class of meal
             problems, called online- meal, where the robot assumes no
             pre-collected data, and works on both attribute
             classification and attribute identification at the same
             time. Based on morc, we develop an algorithm called
             Information-Theoretic Reward Shaping (morc-itrs) that
             actively addresses the trade-off between exploration and
             exploitation in online- meal problems. morc and morc-itrs
             are evaluated in comparison with competitive meal
             baselines, and results demonstrate the superiority of our
             methods in learning efficiency and identification
             accuracy.
	     },
  wwwnote={<a href="https://link.springer.com/article/10.1007/s10514-023-10098-5">Official version</a> on publisher's website},
}

Generated by bib2html.pl (written by Patrick Riley ) on Fri Jul 12, 2024 23:00:31