AERIS: Aerial-Edge Role-Driven Intelligence at Runtime via Orchestrated Language-Model Swarm
2026-06-29 • Robotics
Robotics
AI summaryⓘ
The authors present AERIS, a system designed to run small language models and control modules directly on flying robots, like drones. This lets the drones handle instructions over time and react quickly, even with limited computing power. AERIS breaks down complex instructions step-by-step and keeps the drone's perception and control working smoothly. They tested their system in simulations and real-world flights, showing it can plan and respond efficiently in real time.
large language modelsaerial platformsedge computinginstruction decompositionUAV navigationperception-control loopheartbeat schedulingreal-time systemsattention mechanismrobotic autonomy
Authors
Jiabin Lou, Haopeng Wang, Xinyu Liu, Yu Zhang, Rongye Shi, Wenjun Wu
Abstract
Integrating large language models into robotic systems holds promise for enhancing autonomy, yet practical deployment remains constrained by strict heartbeat-constrained scheduling and limited computational power. We propose AERIS: an edge deployment framework for aerial platforms. It organizes dedicated small language models combined with lightweight perception and control modules into roles that can be instantiated at runtime, and dynamically rebinds them across different executors as resources change, thereby pushing intelligent capabilities to the edge. AERIS achieves long-horizon instruction decomposition through an attention-subgoal alignment mechanism, which involves annotating the currently active instruction step in messages, thereby progressively approaching long-term objectives. We evaluate AERIS on a high-fidelity UAV Vision-and-Language Navigation benchmark. Under a heartbeat-timed execution mechanism, AERIS maintains a stable perception-decision-control loop between a low-frequency planner and a high-frequency controller, supporting real-time closed-loop operation. We further validate its deployability through two real-world experiments focused on planning and fast response. A demonstration video is provided in the supplementary materials.