Google DeepMind Archives

Google Adding AI Video Generator Veo 3 to YouTube Shorts

By Paula Parisi
June 24, 2025

YouTube Shorts is getting a free Veo 3 upgrade that will let creators generate high-quality AI video clips using text prompts. The news was announced by YouTube CEO Neal Mohan at the Cannes Lions International Festival of Creativity, where it was positioned as a means for brands to transform how advertisements are produced. Veo 3 functionality will be integrated “later this summer,” according to Mohan. The Google DeepMind video generation model has been made available for use in YouTube Shorts starting with Veo 2. With Veo 3, the platform gets audio capability and what Mohan describes as “vastly improved” video quality. Continue reading Google Adding AI Video Generator Veo 3 to YouTube Shorts

Odyssey’s AI World Modeling Engine Streams Interactive 3D

By Paula Parisi
May 30, 2025

Artificial intelligence startup Odyssey, which turns two this year, has unveiled an interactive streaming AI video model. Available on the web in research preview, the model generates video streams every 40 milliseconds that viewers can navigate through — much like interacting with a 3D-rendered video game using either a keyboard, game controller or smartphone. Odyssey describes the current experience as similar to “exploring a glitchy dream” and says that while “utility is limited for now” its breakthrough is based on the fact that “improvements won’t be driven by hand-built game engines, but rather by models and data.” Continue reading Odyssey’s AI World Modeling Engine Streams Interactive 3D

Google Upgrades GenAI Models, Debuts AI Storyteller ‘Flow’

By Paula Parisi
May 22, 2025

Google is in a filmmaking frame of mind. The search giant introduced Veo 3, the latest version of its generative video model, loading it with cinematic capabilities including a new AI storytelling tool called Flow. At the Google I/O conference the company also debuted an upgraded image generator, Imagen 4, and announced expanded access to the AI music tool Lyria 2. Veo 3 can generate videos with audio — a Google first, adding things like background traffic noises, birds singing, “even dialogue between characters.” It offers improved consistency of characters, scenes and objects, while gaining camera controls, outpainting and object add/remove. Continue reading Google Upgrades GenAI Models, Debuts AI Storyteller ‘Flow’

Google DeepMind AlphaEvolve: Model of Algorithm Efficiency

By Paula Parisi
May 20, 2025

Google DeepMind has introduced AlphaEvolve, a coding agent that takes an evolutionary approach to general-purpose algorithm discovery and model optimization. AlphaEvolve combines the creative problem-solving abilities of Google’s Gemini models with automated evaluators that verify answers, then applies an evolutionary framework that improves on the most promising results. Evolutionary AI refers to techniques inspired by biological evolution, including natural selection, to optimize and design machine learning models. Continue reading Google DeepMind AlphaEvolve: Model of Algorithm Efficiency

Google Launches AI Futures Fund to Finance, Foster Startups

By Paula Parisi
May 15, 2025

Google has formally announced the launch of its AI Futures Fund to identify and invest in artificial intelligence startups. Participating startups will get early access to Google DeepMind’s latest models — Gemini, Imagen and Veo, “hands-on support from Google researchers” and Google Cloud credits. The AI Futures Fund has been ramping up these past several months, working with startups including Fal, Replit and Synthesia, among others. In identifying up-and-coming AI startups, Google is competing against Amazon and Microsoft who have aggressively pursued nascent firms, most notably Anthropic and OpenAI, respectively. Continue reading Google Launches AI Futures Fund to Finance, Foster Startups

Vertex AI Movie Studio Can Create Videos from Start to Score

By Paula Parisi
April 14, 2025

Among the many tech advancements unveiled at Google Cloud Next include a major generative media upgrade to Vertex AI, Google Cloud’s managed AI development platform. The new Vertex AI Media Studio lets enterprise users generate complete videos from scratch using text prompts. Lyria, Google’s text-to-music model is now available on Vertex in private preview. Both are subject to an “allowlist.” Chirp 3 now creates custom voices with just 10 seconds of audio input, while Imagen 3 has gained improved abilities for reconstructing missing or damaged portions of an image. Continue reading Vertex AI Movie Studio Can Create Videos from Start to Score

Deep Cogito Is Out of Stealth with Hybrid Reasoning Models

By Paula Parisi
April 10, 2025

San Francisco-based AI startup Deep Cogito has released five AI models in preview, making them available under an open-source license agreement. The models come in sizes 3B, 8B, 14B, 32B and 70B, with plans to release 109B, 400B and 671B versions in the weeks and months ahead. As for the current models, “each outperforms the best available open models of the same size, including counterparts from Meta, DeepSeek and Alibaba, across most standard benchmarks,” Deep Cogito claims, noting that the 70B model in particular “outperforms the newly released Llama 4 109B MoE model.” Continue reading Deep Cogito Is Out of Stealth with Hybrid Reasoning Models

Google Debuts Next-Gen Reasoning Models with Gemini 2.5

By Paula Parisi
March 27, 2025

Google has released what it calls its most intelligent AI model yet, Gemini 2.5. The first 2.5 model release, an experimental version of Gemini 2.5 Pro, is a next-gen reasoning model that Google says outperformed OpenAI o3-mini and Claude 3.7 Sonnet from Anthropic on common benchmarks “by meaningful margins.” Gemini 2.5 models “are thinking models, capable of reasoning through their thoughts before responding, resulting in enhanced performance and improved accuracy,” according to Google. The new model comes just three months after Google released Gemini 2.0 with reasoning and agentic capabilities. Continue reading Google Debuts Next-Gen Reasoning Models with Gemini 2.5

Google Launches Agentspace in the UK and Promotes Chirp 3

By Paula Parisi
March 19, 2025

Google is expanding its AI presence in the UK market, hosting a splashy launch event there for Agentspace. Google in December launched Agentspace, an AI agent hub that makes it easy for enterprises to build, manage and deploy custom agents using Gemini. The gathering was hosted by Google DeepMind CEO Demis Hassabis, and Google Cloud CEO Thomas Kurian and included participation by local customers BT Group and advertising powerhouse WPP. Google invited UK businesses to store cloud data locally using its $1 billion data center, opening there this year. The company also promoted its new Chirp 3 audio generator, which offers HD voice synthesis. Continue reading Google Launches Agentspace in the UK and Promotes Chirp 3

YouTube Shorts Updates Dream Screen with Google Veo 2 AI

By Paula Parisi
February 19, 2025

YouTube Shorts has upgraded its Dream Screen AI background generator to incorporate Google DeepMind’s latest video model, Veo 2, which will also generate standalone video clips that users can post to Shorts. “Need a specific scene but don’t have the right footage? Want to turn your imagination into reality and tell a unique story? Simply use a text prompt to generate a video clip that fits perfectly into your narrative, or create a whole new world,” coaxes YouTube, which seems to be trying out “Dream Screen” branding as an umbrella for its genAI efforts. Continue reading YouTube Shorts Updates Dream Screen with Google Veo 2 AI

OpenAI Operator Agent Available to ChatGPT Pro Subscribers

By Paula Parisi
January 27, 2025

OpenAI has launched Operator, a semi-autonomous AI agent that uses a proprietary web browser to execute tasks like planning a vacation using Tripadvisor or booking restaurant reservations through OpenTable. “It can look at a webpage and interact with it by typing, clicking and scrolling,” explains OpenAI. Operator is powered by a new model called Computer-Using Agent (CUA), and is available in research preview to ChatGPT Pro subscribers in the U.S. Combining GPT-4o’s computer vision capabilities with advanced reasoning, CUA is trained to interact with graphical user interfaces (GUIs) — parsing menus, clicking buttons and reading screen text. Continue reading OpenAI Operator Agent Available to ChatGPT Pro Subscribers

Galaxy Unpacked: More AI for S25 and a Peek at AR Glasses

By Paula Parisi
January 27, 2025

Samsung’s new Galaxy S25 line — the Galaxy S25 Ultra, Galaxy S25+ and Galaxy S25 — will more tightly integrate AI, including AI agents, becoming “true AI companions” at a level previously unknown to mobile devices. That leap is credited largely to a “first-of-its-kind” Snapdragon 8 Elite customization for the Galaxy chipset that “delivers greater on-device processing power for Galaxy AI and superior camera range and control with Galaxy’s next-gen ProVisual Engine,” according to Samsung. In addition, the top-of-the-line Galaxy S25 Ultra has been redesigned with a slightly larger 6.9-inch screen and rounded bevel. Continue reading Galaxy Unpacked: More AI for S25 and a Peek at AR Glasses

Veo 2 Is Unveiled Weeks After Google Debuted Veo in Preview

By Paula Parisi
December 18, 2024

Attempting to stay ahead of OpenAI in the generative video race, Google announced Veo 2, which it says can output 4K clips of two-minutes-plus at 4096 x 2160 pixels. Competitor Sora can generate video of up to 20 seconds at 1080p. However, TechCrunch says Veo 2’s supremacy is “theoretical” since it is currently available only through Google Labs’ experimental VideoFX platform, which is limited to videos of up to 8-seconds at 720p. VideoFX is also waitlisted, but Google says it will expand access this week (with no comment on expanding the cap). Continue reading Veo 2 Is Unveiled Weeks After Google Debuted Veo in Preview

DeepMind Genie 2 Creates Worlds That Emulate Video Games

By Paula Parisi
December 6, 2024

Google DeepMind’s new Genie 2 is a large foundation world model that generates interactive 3D worlds that are being likened to video games. “Games play a key role in the world of artificial intelligence research,” says Google DeepMind, noting “their engaging nature, challenges and measurable progress make them ideal environments to safely test and advance AI capabilities.” Based on a simple prompt image, Genie 2 is capable of producing “an endless variety of action-controllable, playable 3D environments” — suitable for training and evaluating embodied agents — that can be played by a human or AI agent using keyboard and mouse inputs. Continue reading DeepMind Genie 2 Creates Worlds That Emulate Video Games

Google DeepMind Touts AI-Powered Quantum Error Detection

By Paula Parisi
November 26, 2024

Google DeepMind has come up with an error correction technique it says will make quantum computers more reliable, particularly at scale. While quantum computing holds tremendous promise — potentially able to solve in just a few hours problems it would take a conventional computer “billions of years” to figure out, Google claims — the systems are notoriously unstable, due to the delicacy of the “quantum state.” AlphaQubit is an AI-based decoder that identifies quantum computing errors with accuracy. Combining DeepMind’s machine learning expertise with Google Quantum AI error correction, the technique advances efforts to create a reliable quantum computer. Continue reading Google DeepMind Touts AI-Powered Quantum Error Detection