HATS: A Human-Agent Teleoperation System for Multi-Arm Data Collection

2026-06-15 • Robotics

Robotics

AI summaryⓘ

The authors created HATS, a system where one human controls two robot arms, and an AI agent helps by controlling two extra arms to work together on tasks. This way, the human does not get overwhelmed, and the AI assists by handling simpler parts of the job. The human can also use voice commands to fix or stop the AI if needed. Tests showed HATS works as well as having two expert humans working together and helps collect good data to teach robots.

multi-arm teleoperationrobotic manipulationmulti-agent systemshuman-robot interactionmachine learningmulti-task learningvoice commandsdata collectionrobot collaborationlarge workspace robotics

Authors

Zesen Lin, Jian-Jian Jiang, Haoming Cen, Xiao-Ming Wu, Dandan Zhang, Wei-Shi Zheng

Abstract

Many real-world manipulation scenarios, such as handling complex collaborative tasks and dealing with large workspaces, require coordination of more than two robotic arms. Consequently, an effective multi-arm teleoperation system is required to collect demonstrations for training coordinated multi-arm manipulation policies. However, existing teleoperation frameworks mainly focus on single-operator or multi-operator setups, facing a practical trade-off between the cognitive load placed on a single operator and the coordination cost incurred by multiple operators. To address this problem, we introduce HATS, a human-agent teleoperation system that enables a single human operator, assisted by an MLLM-based agent, to collect data for multi-arm manipulation tasks. Our system decouples the control space: two primary arms are directly teleoperated by the human, while two assistive arms are controlled by a training-free agent that handles sub-tasks. In addition, the human operator can use voice commands to prevent collisions and correct assistive arm behaviors during execution. Extensive evaluations demonstrate that HATS achieves data collection efficiency and success rates comparable to expert dual-human teams. Moreover, downstream policy evaluations demonstrate the efficacy and quality of the data collected through HATS.

View PDFOpen arXiv