By
Paula ParisiJuly 2, 2025
Chinese e-commerce giant Alibaba has released a new multimodal model called Qwen VLo that can understand and generate images. Available for free in preview through Qwen Chat, it can use image or text prompts to generate pictures, and accepts text in multiple languages, including Chinese and English. It can also edit, change backgrounds and switch styles, handling multiple image edits in sequence. An upgrade over January’s Qwen 2.5-VL release, Qwen VLo uses progressive generation, allowing users to see the image creation in progress, and Alibaba says it’s particularly good at making inline adjustments to fine-tune images. Continue reading Alibaba’s Qwen VLo Generative AI Shows Images in Progress
By
Paula ParisiJune 24, 2025
The redesigned Firefly AI app Adobe released in April with third-party model support is now available on iOS and Android. Text-to-video and background editing are among the features included in the new mobile package, which Adobe claims will help users capture inspiration as it strikes with “the freedom to generate images and videos wherever you are.” Adobe says those of all skill levels will be able to use the app, which was designed “to complement the ways we already interact with our phones.” The company is also rolling out its AI-powered online moodboard creator — Firefly Boards — in public beta, now with video functionality. Continue reading Adobe Unveils Firefly Generative AI App for iOS and Android
By
Paula ParisiApril 28, 2025
News from Adobe MAX London 2025 spanned new Firefly image models to a refreshed web app that includes third-party image generators, an AI agent that automates Photoshop, an updated Firefly mobile app coming soon to iOS and Android, and the Firefly Video model in general release. The latest release of Firefly “unifies AI-powered tools for image, video, audio, and vector generation into a single, cohesive platform and introduces many new capabilities,” according to Adobe, which says that since its debut nearly two years ago, creatives have used Firefly to generate more than 22 billion assets worldwide. Continue reading Adobe Unveils Two New Image Models and Array of Products
By
Paula ParisiApril 14, 2025
Among the many tech advancements unveiled at Google Cloud Next include a major generative media upgrade to Vertex AI, Google Cloud’s managed AI development platform. The new Vertex AI Media Studio lets enterprise users generate complete videos from scratch using text prompts. Lyria, Google’s text-to-music model is now available on Vertex in private preview. Both are subject to an “allowlist.” Chirp 3 now creates custom voices with just 10 seconds of audio input, while Imagen 3 has gained improved abilities for reconstructing missing or damaged portions of an image. Continue reading Vertex AI Movie Studio Can Create Videos from Start to Score
By
Paula ParisiDecember 18, 2024
Attempting to stay ahead of OpenAI in the generative video race, Google announced Veo 2, which it says can output 4K clips of two-minutes-plus at 4096 x 2160 pixels. Competitor Sora can generate video of up to 20 seconds at 1080p. However, TechCrunch says Veo 2’s supremacy is “theoretical” since it is currently available only through Google Labs’ experimental VideoFX platform, which is limited to videos of up to 8-seconds at 720p. VideoFX is also waitlisted, but Google says it will expand access this week (with no comment on expanding the cap). Continue reading Veo 2 Is Unveiled Weeks After Google Debuted Veo in Preview
By
Paula ParisiDecember 6, 2024
Google DeepMind’s new Genie 2 is a large foundation world model that generates interactive 3D worlds that are being likened to video games. “Games play a key role in the world of artificial intelligence research,” says Google DeepMind, noting “their engaging nature, challenges and measurable progress make them ideal environments to safely test and advance AI capabilities.” Based on a simple prompt image, Genie 2 is capable of producing “an endless variety of action-controllable, playable 3D environments” — suitable for training and evaluating embodied agents — that can be played by a human or AI agent using keyboard and mouse inputs. Continue reading DeepMind Genie 2 Creates Worlds That Emulate Video Games
By
Paula ParisiAugust 30, 2024
Google is giving Gemini Advanced, Enterprise and Business subscribers the ability to create personalized AI assistants, which the company calls “Gems.” “Create your own personal AI experts on any topic you want,” the Alphabet company says. The search giant is also reintroducing Gemini’s image generation capabilities with its latest Imagen 3 model, which will be available to everyone. Gemini, which is Google’s ChatGPT competitor, will again have the ability to generate images of people, something Google disabled in February after controversy over some of the images. The company announced it has implemented new guardrails. Continue reading Gemini Gets Custom Gems AI Assistants and Adds Imagen 3
By
Paula ParisiAugust 22, 2024
Google DeepMind has made its latest AI image generator, Imagen 3, free for use in the U.S. via the company’s ImageFX platform. Imagen 3 will be available in multiple versions, “each optimized for different types of tasks, from generating quick sketches to high-resolution images.” Google announced Imagen 3 at Google I/O in March, and in June made it available to enterprise users through Vertex. Using simplified natural language text input rather than “complex prompt engineering,” Google says Imagen 3 generates high-quality images in a range styles, from photorealistic, painterly and textured to whimsically cartoony. Continue reading Google DeepMind Releases Imagen 3 for Free to U.S. Users
By
Paula ParisiMay 16, 2024
Google is launching two new AI models: the video generator Veo and Imagen 3, billed as the company’s “highest quality text-to-image model yet.” The products were introduced at Google I/O this week, where new demo recordings created using the Music AI Sandbox were also showcased. The 1080p Veo videos can be generated in “a wide range of cinematic and visual styles” and run “over a minute” in length, Google says. Veo is available in private preview in VideoFX by joining a waitlist. At a future date, the company plans to bring some Veo capabilities to YouTube Shorts and other products. Continue reading Veo AI Image Generator and Imagen 3 Unveiled at Google I/O