Genie 3 World Model Produces Minutes of Video in Real Time

Google DeepMind has unveiled Genie 3, a world-building model that uses text and image prompts to generate 3D environments in real time. Still in research preview, Genie 3 can output “several minutes” of video that can be navigated in real time at 24fps and a resolution of 720p. Because it remembers the rules of the world it creates, Genie 3 allows agents to predict how the environment evolves and how actions affect it. Google says world models are “a key steppingstone” to artificial general intelligence, or AGI, since they can train AI agents in “an unlimited curriculum of rich simulation.” Continue reading Genie 3 World Model Produces Minutes of Video in Real Time

Odyssey’s AI World Modeling Engine Streams Interactive 3D

Artificial intelligence startup Odyssey, which turns two this year, has unveiled an interactive streaming AI video model. Available on the web in research preview, the model generates video streams every 40 milliseconds that viewers can navigate through — much like interacting with a 3D-rendered video game using either a keyboard, game controller or smartphone. Odyssey describes the current experience as similar to “exploring a glitchy dream” and says that while “utility is limited for now” its breakthrough is based on the fact that “improvements won’t be driven by hand-built game engines, but rather by models and data.” Continue reading Odyssey’s AI World Modeling Engine Streams Interactive 3D