Genie 3 — A New Frontier for World Models (Overview)
Download printable cheat-sheet (CC-BY 4.0)05 Aug 2025, 00:00 Z
TL;DR Genie 3 is DeepMind’s latest world model: from a text prompt it generates explorable 720p environments at 24 fps, keeps scene memory for minutes, supports promptable world events, and already plugs into SIMA for longer-horizon embodied tasks—though it’s only available as a limited research preview.
What is Genie 3?
Genie 3 is Google DeepMind’s third-generation world model, announced on 5 August 2025. It builds on the Genie 1/2 sequence and DeepMind’s Veo video generators to deliver interactive environments rather than passive clips. From a textual description, Genie 3 renders a navigable scene that can be steered in real time for a few minutes at 24 frames per second and 720p resolution. The model maintains spatial and visual consistency over long horizons without relying on an explicit 3D representation, enabling a single prompt to become a responsive “world” that evolves based on user actions.
DeepMind positions Genie 3 as a step toward AGI-ready simulators: agents can explore counterfactual scenarios, humans can stage bespoke training runs, and researchers can study open-ended environments that go beyond static datasets.
Links:
Key ideas
- Real-time interactive generation: produces 720p, 24 fps environments that stay coherent for multiple minutes, expanding on the non-interactive outputs of Genie 2 (DeepMind blog).
- Emergent consistency without explicit 3D assets: per-frame generation keeps geometry and lighting stable over long trajectories, unlike NeRFs/Gaussian splatting which need explicit scene reconstructions (DeepMind blog).
- Promptable world events: alongside navigation inputs, text “events” can alter weather, spawn objects, or trigger dynamic changes to test counterfactuals (DeepMind blog).
- Agent-ready worlds: Genie 3 integrates with DeepMind’s SIMA agent, which can pursue multi-step goals within generated worlds thanks to preserved state and longer horizons (DeepMind blog).
- Responsible roll-out: only a limited cohort of academics and creators can access the research preview while DeepMind collects safety feedback and refines mitigations (DeepMind blog).
Model availability
- Access model: invite-only research preview hosted by Google DeepMind.
- Outputs: interactive browser experience with navigation controls, promptable events, and recording utilities showcased in the announcement.
- No public checkpoints or inference code are released yet; prior Genie weights remain closed.