By
Paula ParisiJuly 21, 2025
Adobe’s Firefly Video model has introduced new updates including Generate Sound Effects, in beta, and a text-to-avatar feature that lets users turn scripts into avatar-led videos “in just a few clicks.” Firefly becomes the second video model to generate audio, joining Veo 3, although unlike Google’s AI video tool Firefly does not yet generate dialogue. What it can do is output foley-like sound and sound effects, while text-to-avatar can generate speech. As with Firefly’s generative visuals, Adobe says Generate Sound Effects is “commercially safe,” which means they are trained only on licensed or publicly available material. Continue reading Adobe Adds Generative Audio and Text-to-Avatar to Firefly AI
By
Paula ParisiJuly 15, 2025
Google has added photo-to-video capability to its Gemini AI app. Powered by Veo 3, Google’s latest generative video model, launched in May, Gemini AI can now turn images into 8-second videos complete with AI-generated sound including speech, environmental sounds and background noises. Available now via the Web to anyone with a $20 per month Google AI Pro subscription or those on the $125 per quarter Google AI Ultra plan, the new feature is also being released to mobile users this month for both iOS and Android devices. The videos are finished as 720p resolution MP4 files in 16:9 landscape format. Continue reading Google Offers Gemini AI Subscribers Photo-to-Video Function
By
Paula ParisiJuly 10, 2025
Apple announced a new 15,000-square-foot Apple Music Los Angeles studio in Culver City will open later this summer. The three-story complex that Apple says is “designed with artists in mind” includes two radio studios with support for immersive Apple Spatial Audio playback, a spatial audio mixing room, an art gallery, a “social media lab” and a 4,000-square-foot soundstage. Commemorating the 10th anniversary of Apple Music, the new structure is situated nearby to the future home of Apple TV+, a 550,000-square-foot building going up where Culver City borders the City of Los Angeles. Continue reading Apple Music Opening Three-Story Creative Hub in Culver City
By
Paula ParisiJune 26, 2025
ElevenLabs is bringing its powerful AI voice tools to mobile. Previously, the company’s apps and voice libraries were only available via the Web. Now iOS and Android users can tap ElevenLabs tech on the go with a “faster, intuitive, more powerful experience built natively for mobile” rather than awkwardly through a mobile browser. Combining mobility with creativity, the app lets users create realistic voiceovers for social media or narrate video using ElevenLabs’ text-to-speech models — including Eleven v3, now in alpha, which lets users fine-tune vocalizations using tags. The company has also introduced a new voice assistant, 11ai. Continue reading ElevenLabs Text-to-Voice AI Tools Now Available for Mobile
By
Paula ParisiJune 25, 2025
Google has launched Search Live with voice-input, a two-way conversational query function for exploring online resources. Presently available via the Google app for Android and iOS to U.S. users enrolled in Google Labs’ AI Mode experiment, Search Live is designed to handle complex, multi-part questions. Google suggests the new feature is “perfect for when you’re on the go or multitasking, like if you’re packing for a trip.” The discursive voice feature follows Google’s general rollout of AI Mode, recently launched to compete against products such as OpenAI’s ChatGPT Search and Perplexity AI. Continue reading Google Search Live Features Conversational Voice Capability
By
Paula ParisiJune 24, 2025
YouTube Shorts is getting a free Veo 3 upgrade that will let creators generate high-quality AI video clips using text prompts. The news was announced by YouTube CEO Neal Mohan at the Cannes Lions International Festival of Creativity, where it was positioned as a means for brands to transform how advertisements are produced. Veo 3 functionality will be integrated “later this summer,” according to Mohan. The Google DeepMind video generation model has been made available for use in YouTube Shorts starting with Veo 2. With Veo 3, the platform gets audio capability and what Mohan describes as “vastly improved” video quality. Continue reading Google Adding AI Video Generator Veo 3 to YouTube Shorts
By
Paula ParisiJune 17, 2025
Google is testing podcast-like audio search summaries generated by AI. Audio Overviews uses Google’s latest Gemini models to generate “quick, conversational audio overviews for certain search queries.” It can be enabled through Google Labs, the company’s public-facing portal to AI experiments. An Audio Overview “can help you get a lay of the land, offering a convenient, hands-free way to absorb information,” Google says, noting that the feature displays search results “right within the audio player” to make it easy to delve further. Google already had AI audio summaries in NotebookLM and Gemini. Like those, Search features AI discussion “hosts.” Continue reading Google Is Testing ‘Hosted’ GenAI Audio Summaries in Search
By
Paula ParisiJune 9, 2025
Chatbot platform Character.AI is rolling out its video generator, AvatarFX, in general release after a month in closed beta. It’s also adding a sharing feature called Scenes and Streams that will serve content to Character.AI’s community feed, coming soon to mobile. Users can now tap AvatarFX to create up to five videos per day, starting by uploading a photo, choosing a voice and writing dialogue for the character. Character.AI started as 1:1 text chat in the summer of 2023. Now the company is “expanding into a multi-modal world” with “more ways for creators to build immersive narratives and dynamic experiences.” Continue reading Character.AI Goes Wide with AvatarFX, Adds Mobile Features
By
Paula ParisiMay 27, 2025
Amazon is testing audio product summaries that make “AI shopping experts” available for interactive pre-purchase exploration, guiding customers through the retail experience by highlighting key product features and analyzing customer reviews. The feature — launching in the U.S. for select products — is designed to “make product research fun and convenient, like having helpful friends discuss potential purchases to make shopping easier,” the company says. The initial focus is on “products that typically require consideration before purchase,” saving time through focused discussion. Customers can tap the “Hear the Highlights” button on product detail pages in the Amazon Shopping app. Continue reading Amazon Tests Conversational AI ‘Hear the Highlights’ Feature
By
Paula ParisiMay 22, 2025
Google is in a filmmaking frame of mind. The search giant introduced Veo 3, the latest version of its generative video model, loading it with cinematic capabilities including a new AI storytelling tool called Flow. At the Google I/O conference the company also debuted an upgraded image generator, Imagen 4, and announced expanded access to the AI music tool Lyria 2. Veo 3 can generate videos with audio — a Google first, adding things like background traffic noises, birds singing, “even dialogue between characters.” It offers improved consistency of characters, scenes and objects, while gaining camera controls, outpainting and object add/remove. Continue reading Google Upgrades GenAI Models, Debuts AI Storyteller ‘Flow’
By
Paula ParisiMay 20, 2025
Stability AI has released an AI model that generates stereo audio that is quick and lightweight enough for mobile devices. Called Stable Audio Open Small, the open-source model is the result of a collaboration between the AI startup and chipmaker Arm. While there are several AI-powered apps that generate audio — Suno and Udio among them — most rely on cloud processing, thus can’t be used offline. Stability says Stable Audio Open Small is also IP safe due to being trained entirely on audio from the royalty-free libraries Free Music Archive and Freesound. Continue reading Stability AI Releases a Fast Stereo Audio-Generator for Mobile
By
Paula ParisiMay 15, 2025
Amazon’s Audible audiobook service is partnering with select publishers to bring more print and e-books into the spoken word realm and is leveraging AI narration and translation to help it happen at scale. This move aims to quickly boost Audible’s product offerings so it can compete more effectively against streamers like Apple and Spotify who have rapidly expanded their literary market share. “Audiobooks are the fastest-growing format in publishing,” yet of the millions of titles available today in print and as e-books, only 2-5 percent exist in audio form, according to the company. Continue reading Audible Using AI Narration and Translation to Expand Catalog
By
Paula ParisiApril 24, 2025
Character.AI, a platform offering AI chatbots for socializing and role play, has released a video generation model called AvatarFX in closed beta. Promising the ability to make photorealistic images “come to life — speak, sing and emote — all with the click of a button,” the technology combines audio and video to create a variety of visual style and voice, from realistic 3D — including “non-human faces (like a favorite pet)” — to 2D animations, according to the company. AvatarFX also has the ability “to maintain strong temporal consistency with face, hand and body movement” and can “power videos with multiple speakers.” Continue reading Character.AI Introduces New Video Generator in Closed Beta
By
Paula ParisiApril 24, 2025
Instagram has released a standalone video editing tool called Edits that is being described as a full-fledged suite that also has camera capabilities. The resulting content can be released on any social platform, not just those from Meta Platforms, though an Instagram account is required to access Edits. Available worldwide for iOS and Android, Edits is positioned as a way for social videographers to level-up their Instagram or Facebook Reels, but also as a tool for professionals who want a simple mobile solution for short-form videos. Edits also offers analytics so creators can see how their work is performing. Continue reading Instagram ‘Edits’ Video App Is Released for iOS and Android
By
Paula ParisiApril 14, 2025
Among the many tech advancements unveiled at Google Cloud Next include a major generative media upgrade to Vertex AI, Google Cloud’s managed AI development platform. The new Vertex AI Media Studio lets enterprise users generate complete videos from scratch using text prompts. Lyria, Google’s text-to-music model is now available on Vertex in private preview. Both are subject to an “allowlist.” Chirp 3 now creates custom voices with just 10 seconds of audio input, while Imagen 3 has gained improved abilities for reconstructing missing or damaged portions of an image. Continue reading Vertex AI Movie Studio Can Create Videos from Start to Score