Efficient Minimal Solvers for Visual-Inertial Relative Pose Estimation in Multi-Camera Systems
2026-06-08 • Computer Vision and Pattern Recognition
Computer Vision and Pattern Recognition
AI summaryⓘ
The authors developed two new, faster ways to figure out the positions and orientations of multi-camera setups using fewer matching points than usual. They use extra information from motion sensors to simplify the math, turning a complicated problem into solving a simpler polynomial equation. Their methods work well with common algorithms that handle errors and showed good speed and accuracy in tests with simulated data and a real-world dataset called KITTI.
multi-camera systemsrelative pose estimationInertial Measurement Unit (IMU)point correspondences6th-degree polynomialRANSACvisual odometryKITTI benchmark
Authors
Tao Li, Zhenbao Yu, Banglei Guan, Jianli Han, Weimin Lv
Abstract
Estimating the relative poses of multi-camera systems is a fundamental problem in computer vision, with critical applications in autonomous vehicles, mobile devices, and unmanned aerial vehicles (UAVs). However, existing solutions often suffer from high computational complexity or rely on an excessive number of point correspondences, limiting their real-world applicability. To address these limitations, we propose two efficient minimal solvers for estimating the relative poses of multi-camera systems using a novel parameterization. The first solver leverages the vertical direction prior provided by Inertial Measurement Units (IMUs), while the second utilizes the rotation axis direction prior from IMUs. Our methods require only four point correspondences and reduce the problem of multi-camera relative pose estimation to solving a univariate 6th-degree polynomial, a significant improvement over existing approaches, which typically involve 8th-degree polynomials. This reduction in computational complexity and correspondence requirements makes our solvers particularly effective when integrated into RANSAC frameworks, demonstrating strong potential for visual odometry applications. Through rigorous evaluations on synthetic data and the KITTI benchmark, our methods achieved superior computational efficiency and competitive accuracy compared to state-of-the-art algorithms.