Text-to-Video Archives

Runway Gen-4.5 Video Debuts at No. 1 on Video Arena Chart

By Paula Parisi
December 3, 2025

Runway Gen-4.5 is rolling out, and the text-to-video model grabbed the No. 1 spot on the Video Arena leaderboard for generative models that don’t simultaneously output sound, beating the non-audio versions of Google’s Veo 3 and OpenAI’s Sora 2 Pro. Offering what Runway AI calls “unprecedented visual fidelity” across cinematic and “highly realistic” outputs, it is also built for creative freedom, providing “precise control over every aspect of generation.” The new model is good at understanding physics, cause and effect through casual reasoning, camera movements and human emotion, claims the New York-based startup. Continue reading Runway Gen-4.5 Video Debuts at No. 1 on Video Arena Chart

Google Adds New Features to Its Flow GenAI Storytelling Tool

By Paula Parisi
December 2, 2025

Google has added four new features in Flow, its AI tool for storytelling, that offer more precise control over images and videos. The upgrades include generative imaging with Nano Banana Pro, doodle prompts, an object insertion/removal tool and camera motion. Flow was introduced in May and offers the ability to edit and build scenes using natural language. The improvements aim to make Flow output more polished. “In Flow, you can use images to serve as the characters, subjects and starting points for your clips” with pictures you upload or create in Flow with the new “Images” tab, according to the company. Continue reading Google Adds New Features to Its Flow GenAI Storytelling Tool

Genie 3 World Model Produces Minutes of Video in Real Time

By Paula Parisi
August 15, 2025

Google DeepMind has unveiled Genie 3, a world-building model that uses text and image prompts to generate 3D environments in real time. Still in research preview, Genie 3 can output “several minutes” of video that can be navigated in real time at 24fps and a resolution of 720p. Because it remembers the rules of the world it creates, Genie 3 allows agents to predict how the environment evolves and how actions affect it. Google says world models are “a key steppingstone” to artificial general intelligence, or AGI, since they can train AI agents in “an unlimited curriculum of rich simulation.” Continue reading Genie 3 World Model Produces Minutes of Video in Real Time

Google Adding AI Video Generator Veo 3 to YouTube Shorts

By Paula Parisi
June 24, 2025

YouTube Shorts is getting a free Veo 3 upgrade that will let creators generate high-quality AI video clips using text prompts. The news was announced by YouTube CEO Neal Mohan at the Cannes Lions International Festival of Creativity, where it was positioned as a means for brands to transform how advertisements are produced. Veo 3 functionality will be integrated “later this summer,” according to Mohan. The Google DeepMind video generation model has been made available for use in YouTube Shorts starting with Veo 2. With Veo 3, the platform gets audio capability and what Mohan describes as “vastly improved” video quality. Continue reading Google Adding AI Video Generator Veo 3 to YouTube Shorts

Manus AI Takes an Agentic Approach with Its Video Generator

By Paula Parisi
June 9, 2025

China’s Manus AI has unveiled a text-to-video generator it says can transform “prompts into complete stories — structured, sequenced, and ready to watch. With a single prompt, Manus plans each scene, crafts the visuals, and animates your vision,” the company announced last week. Manus generated buzz in March for its agentic approach to AI, and now it is putting that autonomous technology to work on generative AI, promising story generation within minutes. Last month, the firm that developed Manus, Butterfly Effect, reportedly secured $75 million in funding led by U.S.-based Benchmark for a nearly $500 million valuation. Continue reading Manus AI Takes an Agentic Approach with Its Video Generator

Microsoft Debuts Mobile Bing Video Creator Powered by Sora

By Paula Parisi
June 5, 2025

Microsoft’s Bing mobile search app for Android and iOS is adding a free Bing Video Creator feature powered by OpenAI’s Sora. Text-to-video model Sora was previously available only by subscription through ChatGPT Plus for $20 per month or the $200 monthly ChatGPT Pro. The integration will enable creation of five-second vertical video clips on mobile, with horizontal capability coming soon. Although a major investor in OpenAI, Microsoft has made headway with its own signature AI offerings, including Bing Image Creator and the Copilot AI assistant. The Bing Mobile app is available worldwide via the App Store for iOS and the Play Store for Android. Continue reading Microsoft Debuts Mobile Bing Video Creator Powered by Sora

Character.AI Introduces New Video Generator in Closed Beta

By Paula Parisi
April 24, 2025

Character.AI, a platform offering AI chatbots for socializing and role play, has released a video generation model called AvatarFX in closed beta. Promising the ability to make photorealistic images “come to life — speak, sing and emote — all with the click of a button,” the technology combines audio and video to create a variety of visual style and voice, from realistic 3D — including “non-human faces (like a favorite pet)” — to 2D animations, according to the company. AvatarFX also has the ability “to maintain strong temporal consistency with face, hand and body movement” and can “power videos with multiple speakers.” Continue reading Character.AI Introduces New Video Generator in Closed Beta

Highly Realistic Alibaba GenVid Models Are Available for Free

By Paula Parisi
February 28, 2025

Alibaba has open-sourced its Wan 2.1 video- and image-generating AI models, heating up an already competitive space. The Wan 2.1 family, which has four models, is said to produce “highly realistic” images and videos from text and images. The company has since December been previewing a new reasoning model, QwQ-Max, indicating it will be open-sourced when fully released. The move comes after another Chinese AI company, DeepSeek, released its R1 reasoning model for free download and use, triggering demand for more open-source artificial intelligence. Continue reading Highly Realistic Alibaba GenVid Models Are Available for Free

ByteDance’s Goku Video Model Is Latest in Chinese AI Streak

By Paula Parisi
February 24, 2025

Barely two weeks after the launch of its OmniHuman-1 AI model, ByteDance has released Goku, a new artificial intelligence designed to create photorealistic video featuring humanoid actors. Goku uses text prompts to create among other things, realistic product videos without the need for human actors. This last is a boon for ByteDance social media unit TikTok. Goku is open source, trained on a large dataset of roughly 36 million video-text pairs and 160 million image-text pairs. Goku’s debut is received as more bad news for OpenAI in the form of added competition, but a positive step for global enterprise. Continue reading ByteDance’s Goku Video Model Is Latest in Chinese AI Streak

YouTube Shorts Updates Dream Screen with Google Veo 2 AI

By Paula Parisi
February 19, 2025

YouTube Shorts has upgraded its Dream Screen AI background generator to incorporate Google DeepMind’s latest video model, Veo 2, which will also generate standalone video clips that users can post to Shorts. “Need a specific scene but don’t have the right footage? Want to turn your imagination into reality and tell a unique story? Simply use a text prompt to generate a video clip that fits perfectly into your narrative, or create a whole new world,” coaxes YouTube, which seems to be trying out “Dream Screen” branding as an umbrella for its genAI efforts. Continue reading YouTube Shorts Updates Dream Screen with Google Veo 2 AI

Adobe Firefly Video Now in Public Beta Starting at $10 Month

By Paula Parisi
February 14, 2025

Adobe’s Firefly video is now in public beta as part of Firefly AI, now multi-modal with video, image and vector generation. Available for $10 for Firefly Standard or $30 for Firefly Pro, the Firefly app offers additional tiers for premium video and audio features, offering a degree of customization based on project needs. Adobe continues to position Firefly as “the only generative AI model that is IP-friendly and commercially safe,” offering the option of contractual IP indemnification to protect against infringement lawsuits “in the unlikely event of a claim involving a Firefly output.” Continue reading Adobe Firefly Video Now in Public Beta Starting at $10 Month

Luma AI Upgrades Its Video Generator and Adds Image Model

By Paula Parisi
December 2, 2024

Anticipating what one outlet calls “the likely imminent release of OpenAI’s Sora,” generative AI video competitors are compelled to step up their game. Luma AI has released a major upgrade to its Dream Machine, speeding its already quick video generation and enabling a chat function for natural language prompts, so you can talk to it as with OpenAI’s ChatGPT. In addition to the new interface, Dream Machine is going mobile and adding a new foundation image model, Luma AI Photon, which “has been purpose built to advance the power and capabilities of Dream Machine,” according to the company. Continue reading Luma AI Upgrades Its Video Generator and Adds Image Model

MiniMax’s Hailuo AI Rolls Out New Image-to-Video Capability

By Paula Parisi
October 11, 2024

Hailuo, the free text-to-video generator released last month by the Alibaba-backed company MiniMax, has delivered its promised image-to-video feature. Founded by AI researcher Yan Junjie, the Shanghai-based MiniMax also has backing from Tencent. The model earned high marks for what has been called “ultra realistic” video, and MiniMax says the new image-to-video feature will improve output across the board as a result of “text-and-image joint instruction following,” which means Hailuo now “seamlessly integrates both text and image command inputs, enhancing your visuals while precisely adhering to your prompts.” Continue reading MiniMax’s Hailuo AI Rolls Out New Image-to-Video Capability

Meta’s Movie Gen Model is a Powerful Content Creation Tool

By Paula Parisi
October 8, 2024

Meta Platforms has unveiled Movie Gen, a new family of AI models that generates video and audio content. Coming to Instagram next year, Movie Gen also allows a high degree of editing and effects customization using text prompts. Meta CEO Mark Zuckerberg demonstrated its abilities last week in an example shared on his Instagram account, where he sends a leg press machine at the gym through transformations as a steam punk machine and one made of molten gold. The models have been trained on a combination of licensed and publicly available datasets. Continue reading Meta’s Movie Gen Model is a Powerful Content Creation Tool

Alibaba Cloud Ups Its AI Game with 100 Open-Source Models

By Paula Parisi
September 25, 2024

Alibaba Cloud last week globally released more than 100 new open-source variants of its large language foundation model, Qwen 2.5, to the global open-source community. The company has also revamped its proprietary offering as a full-stack AI-computing infrastructure across cloud products, networking and data center architecture, all aimed at supporting the growing demands of AI computing. Alibaba Cloud’s significant contribution was revealed at the Apsara Conference, the annual flagship event held by the cloud division of China’s e-retail giant, often referred to as the Chinese Amazon. Continue reading Alibaba Cloud Ups Its AI Game with 100 Open-Source Models