Abstract Sim2Real through Approximate Information States

2026-04-16Robotics

Robotics
AI summary

The authors study how to teach robots using simplified simulators that don’t capture every detail of the real world. They frame this problem using ideas from reinforcement learning about how to represent states in a simpler way. To improve robot learning, they propose a method that uses real-world data to adjust the simulator’s behavior. Their approach helps policies trained in simplified simulators work well when transferred to the real world or more detailed simulations.

reinforcement learningsimulatorsim2realstate abstractionpolicy transferroboticsdynamics correctionsim2simreal-world dataabstract simulator
Authors
Yunfu Deng, Yuhao Li, Josiah P. Hanna
Abstract
In recent years, reinforcement learning (RL) has shown remarkable success in robotics when a fast and accurate simulator is available for a given task. When using RL and simulation, more simulator realism is generally beneficial but becomes harder to obtain as robots are deployed in increasingly complex and widescale domains. In such settings, simulators will likely fail to model all relevant details of a given target task and this observation motivates the study of sim2real with simulators that leave out key task details. In this paper, we formalize and study the abstract sim2real problem: given an abstract simulator that models a target task at a coarse level of abstraction, how can we train a policy with RL in the abstract simulator and successfully transfer it to the real-world? Our first contribution is to formalize this problem using the language of state abstraction from the RL literature. This framing shows that an abstract simulator can be grounded to match the target task if the grounded abstract dynamics take the history of states into account. Based on the formalism, we then introduce a method that uses real-world task data to correct the dynamics of the abstract simulator. We then show that this method enables successful policy transfer both in sim2sim and sim2real evaluation.