OWMDrive: Causality-Aware End-to-End Autonomous Driving via 4D Occupancy World Model
2026-06-29 • Computer Vision and Pattern Recognition
Computer Vision and Pattern Recognition
AI summaryⓘ
The authors propose OWMDrive, a new autonomous driving system that predicts how a traffic scene will change over time to plan safer and more reliable driving paths. Unlike previous methods that only look at the current situation, their system forecasts multiple future steps using a 3D map of space occupancy. This helps the car better handle tricky situations like hidden obstacles or sudden events. Their tests show that this forward-looking approach improves the safety and stability of driving decisions.
autonomous drivingend-to-end learningoccupancy grid3D occupancy forecastingdiffusion-based planningtrajectory planningspatiotemporal modelingcausal dynamicspartially observable environmentstraffic interactions
Authors
Junjie Cheng, Ruiqi Song, Ye Wu, Nanxing Zeng, Ximiao Li, Yunfeng Ai
Abstract
Autonomous driving systems are steadily moving toward end-to-end paradigms to mitigate the limited adaptability of rule-based pipelines in complex traffic environments. However, most existing learning-based methods still make decisions from static representations of the current scene, without explicit future rollouts or modeling of the temporal causal dynamics in traffic interactions. This limitation often results in unstable or overly conservative planning under high-uncertainty conditions, such as occlusions and unexpected events. To overcome these challenges, we introduce OWMDrive, a generative end-to-end driving framework built upon an Occupancy World Model for multi-step 3D occupancy forecasting, which serves as a conditional prior to guide diffusion-based planning. Conditioned on both current observations and predicted future states, the planner iteratively refines trajectory candidates to generate a reinforced driving trajectory. By explicitly modeling scene evolution over future horizons, OWMDrive captures key spatiotemporal causal dependencies, which leads to more foresighted and robust trajectory generation. Extensive experiments demonstrate that OWMDrive significantly improves planning reliability and safety, especially in challenging and partially observable driving scenarios.