Evaluating the Robustness of Natural Language Reward Shaping Models to Spatial Relations (2020)
As part of an effort to bridge the gap between using reinforcement learning in simulation and in the real world, we probe whether current reward shaping models are able to encode relational data between objects in the environment. We construct an augmented dataset for controlling a robotic arm in the Meta-World platform to test whether current models are able to discriminate between target objects based on their relations. We found that state of the art models are indeed expressive enough to achieve performance comparable to the gold standard, so this specific experiment did not uncover any obvious shortcomings.
View:
PDF
Citation:
Undergraduate Honors Thesis, Computer Science Department, University of Texas at Austin.
Bibtex:

Presentation:
Slides (PDF) Slides (PPT)
Antony Yun Undergraduate Alumni antony yun [at] utexas edu