Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:GAIA-2: A Controllable Multi-View Generative World Model for Autonomous Driving

Mar 26, 2025

Lloyd Russell, Anthony Hu, Lorenzo Bertoni, George Fedoseev, Jamie Shotton, Elahe Arani, Gianluca Corrado

Figure 1 for GAIA-2: A Controllable Multi-View Generative World Model for Autonomous Driving

Figure 2 for GAIA-2: A Controllable Multi-View Generative World Model for Autonomous Driving

Figure 3 for GAIA-2: A Controllable Multi-View Generative World Model for Autonomous Driving

Figure 4 for GAIA-2: A Controllable Multi-View Generative World Model for Autonomous Driving

Share this with someone who'll enjoy it:

Abstract:Generative models offer a scalable and flexible paradigm for simulating complex environments, yet current approaches fall short in addressing the domain-specific requirements of autonomous driving - such as multi-agent interactions, fine-grained control, and multi-camera consistency. We introduce GAIA-2, Generative AI for Autonomy, a latent diffusion world model that unifies these capabilities within a single generative framework. GAIA-2 supports controllable video generation conditioned on a rich set of structured inputs: ego-vehicle dynamics, agent configurations, environmental factors, and road semantics. It generates high-resolution, spatiotemporally consistent multi-camera videos across geographically diverse driving environments (UK, US, Germany). The model integrates both structured conditioning and external latent embeddings (e.g., from a proprietary driving model) to facilitate flexible and semantically grounded scene synthesis. Through this integration, GAIA-2 enables scalable simulation of both common and rare driving scenarios, advancing the use of generative world models as a core tool in the development of autonomous systems. Videos are available at https://wayve.ai/thinking/gaia-2.

* Technical Report

View paper on

Share this with someone who'll enjoy it:

Title:GAIA-2: A Controllable Multi-View Generative World Model for Autonomous Driving

Paper and Code