Dialogue Archives - ETCentric

Adobe Firefly Adds Third-Party Models to Generative AI Suite

By Paula Parisi
October 30, 2025

To coincide with Adobe MAX 2025 in Los Angeles, Adobe has released a new version of its generative AI tool Firefly, calling it “your all-in-one creative AI studio.” Firefly now offers under one subscription a collection of models that include not only Firefly Image Model 5 (in public beta) but those from partners including Google, OpenAI, Luma AI, ElevenLabs, Topaz Labs, and more. From concept to final product, Firefly is attempting to support every phase of the workflow with AI to generate music, narration and video clips, while supporting areas such as ideation and editing, Adobe says. Continue reading Adobe Firefly Adds Third-Party Models to Generative AI Suite

Google Veo 3.1 Advances Generative Video in Flow and Vertex

By Paula Parisi
October 24, 2025

Google has released Veo 3.1 and Veo 3.1 Fast in paid preview, adding new capabilities to the generative video model that is already a leader in the field. Creative and technical upgrades include richer native audio from dialogue to sound effects, greater understanding of cinematic styles and better prompt adherence. The two new models are available via the Gemini API in Google AI Studio and Vertex AI, with Veo 3.1 also available in the Gemini app and the storytelling tool Flow, which now gets native audio. Flow has generated more than 275 million videos since its release at Google I/O in May, according to the company. Continue reading Google Veo 3.1 Advances Generative Video in Flow and Vertex

OpenAI Sora 2 Vid Generator Has Sound and Social Features

By Paula Parisi
October 2, 2025

Sora 2 is here, “marking a giant leap forward in realism,” claims OpenAI. And it includes sound and dialogue generation, catching up to Google’s Veo 3. Coming nearly two years after Sora was first introduced, the new model is being released in conjunction with a free iOS social app with a vertical feed and “swipe-and-scroll” functionality like TikTok, YouTube Shorts and Instagram Reels. Available in the U.S. and Canada, the fee version — which currently requires an invitation — is also available at sora.com. ChatGPT Pro subscribers can access an experimental, higher quality Sora 2 Pro model online only. Continue reading OpenAI Sora 2 Vid Generator Has Sound and Social Features

Grok Imagine from xAI Offers Video Generation, ‘Spicy’ Mode

By Paula Parisi
August 8, 2025

Grok Imagine is xAI’s new video and image generator, which is currently available via the X social platform, the Grok mobile app, and Grok web interface. Imagine replaces AI image generator Aurora, which was retired in May following a string of offensive posts that led to media criticism and user concerns. Despite the backlash, Elon Musk’s xAI seems determined to have Imagine push conventional limits, with a “spicy” mode that outputs imagery including adult content. Its text-to-image capabilities work with text or voice prompts, while the video tool relies on image prompts to make short clips using images from a user’s gallery or generated by Grok. Continue reading Grok Imagine from xAI Offers Video Generation, ‘Spicy’ Mode

Adobe Adds Generative Audio and Text-to-Avatar to Firefly AI

By Paula Parisi
July 21, 2025

Adobe’s Firefly Video model has introduced new updates including Generate Sound Effects, in beta, and a text-to-avatar feature that lets users turn scripts into avatar-led videos “in just a few clicks.” Firefly becomes the second video model to generate audio, joining Veo 3, although unlike Google’s AI video tool Firefly does not yet generate dialogue. What it can do is output foley-like sound and sound effects, while text-to-avatar can generate speech. As with Firefly’s generative visuals, Adobe says Generate Sound Effects is “commercially safe,” which means they are trained only on licensed or publicly available material. Continue reading Adobe Adds Generative Audio and Text-to-Avatar to Firefly AI

Google Offers Gemini AI Subscribers Photo-to-Video Function

By Paula Parisi
July 15, 2025

Google has added photo-to-video capability to its Gemini AI app. Powered by Veo 3, Google’s latest generative video model, launched in May, Gemini AI can now turn images into 8-second videos complete with AI-generated sound including speech, environmental sounds and background noises. Available now via the Web to anyone with a $20 per month Google AI Pro subscription or those on the $125 per quarter Google AI Ultra plan, the new feature is also being released to mobile users this month for both iOS and Android devices. The videos are finished as 720p resolution MP4 files in 16:9 landscape format. Continue reading Google Offers Gemini AI Subscribers Photo-to-Video Function

Copyright Office Says AI ‘Assisted’ Content Can Be Protected

By Paula Parisi
January 31, 2025

The U.S. Copyright Office has released Part 2 of its report on artificial intelligence, dealing with the legal and policy issues pertaining to copyright and generative AI. The two main takeaways are that legal questions concerning copyrightability and AI can be settled using existing federal law, requiring no legislative change. Also, “where AI ‘merely assists’ an author in the creative process, it does not change the copyrightability of the output.” Additionally, it reaffirms that any work created entirely by prompts (content “entirely generated by AI”) cannot be protected by copyright. Continue reading Copyright Office Says AI ‘Assisted’ Content Can Be Protected

YouTube AI Song Eraser Easily Removes Copyright Material

By Paula Parisi
July 10, 2024

YouTube has released an eraser tool update that makes it easy to remove copyrighted music from videos without disturbing the remaining audio, like dialogue and sound effects. The Erase Song update uses an AI algorithm to detect and remove the offending material, making it more accurate than what had previously been available, as well as easier. Creators whose material has Content ID claims can now excise the objectionable material without having to manually edit and upload a new video, thereby avoiding potential restrictions on where the video is viewable or whether it can be monetized. Continue reading YouTube AI Song Eraser Easily Removes Copyright Material

DeepMind’s V2A Generates Music, Sound Effects, Dialogue

By Paula Parisi
June 19, 2024

Google DeepMind has unveiled new research on AI tech it calls V2A (“video-to-audio”) that can generate soundtracks for videos. The initiative complements the wave of AI video generators from companies ranging from biggies like OpenAI and Alibaba to startups such as Luma and Runway, all of which require a separate app to add sound. V2A technology “makes synchronized audiovisual generation possible” by combining video pixels with natural language text prompts “to generate rich soundscapes for the on-screen action,” DeepMind writes, explaining that it can “create shots with a dramatic score, realistic sound effects or dialogue.” Continue reading DeepMind’s V2A Generates Music, Sound Effects, Dialogue

Alibaba’s EMO Can Generate Performance Video from Images

By ETCentric Staff
March 11, 2024

Alibaba is touting a new artificial intelligence system that can animate portraits, making people sing and talk in realistic fashion. Researchers at the Alibaba Group’s Institute for Intelligent Computing developed the generative video framework, calling it EMO, short for Emote Portrait Alive. Input a single reference image along with “vocal audio,” as in talking or singing, and “our method can generate vocal avatar videos with expressive facial expressions and various head poses,” the researchers say, adding that EMO can generate videos of any duration, “depending on the length of video input.” Continue reading Alibaba’s EMO Can Generate Performance Video from Images

Pika Taps ElevenLabs Audio App to Add Lip Sync to AI Video

By ETCentric Staff
March 1, 2024

On the heels of ElevenLabs’ demo of a text-to-sound app unveiled using clips generated by OpenAI’s text-to-video artificial intelligence platform Sora, Pika Labs is releasing a feature called Lip Sync that lets its paid subscribers use the ElevenLabs app to add AI-generated voices and dialogue to Pika-generated videos and have the characters’ lips moving in sync with the speech. Pika Lip Sync supports both uploaded audio files and text-to-audio AI, allowing users to type or record dialogue, or use pre-existing sound files, then apply AI to change the voicing style. Continue reading Pika Taps ElevenLabs Audio App to Add Lip Sync to AI Video

CES: Voiseed Upgrades Its Platform for Expressive AI Voices

By Paul Bennun
January 16, 2024

Milano-based Voiseed demonstrated its web-based Revoiceit platform at CES, pitched as the best way to manage synthetic voice actors, particularly ensuring that synthetic voices present realistic emotions. The company describes it as a cloud-based solution that uses “generative AI to infuse virtual voices with human emotions and prosody, creating highly expressive, lifelike audio experiences.” While Revoiceit’s most obvious feature is its Studio (imagine Adobe Audition devoted to second-by-second management of voices), it may well be the product’s forthcoming API that provides real value to developers of entertaining technology products. Continue reading CES: Voiseed Upgrades Its Platform for Expressive AI Voices

Adobe Reveals Its New AI Tool for Editing Problematic Audio

By Paula Parisi
November 22, 2023

Adobe has unveiled Project Sound Lift, an AI-powered technology that separates speech recordings into discrete tracks of voices, non-speech sounds and other background noise in video. The company describes Project Sound Lift as “a one-click solution” that leverages AI to help users easily manipulate audio recordings “across a range of scenarios” to “enhance, transform, and control speech and sound independently.” Adobe’s existing Enhance Speech technology, available in the company’s Premiere Pro editing program, has been integrated within Project Sound Lift to aid creators in producing studio-quality audio content. Continue reading Adobe Reveals Its New AI Tool for Editing Problematic Audio

Game Creators Are Now Testing the Benefits of Generative AI

By Paula Parisi
March 2, 2023

Game developers are harnessing the power of generative AI to improve the state of play. With hundreds of computer-controlled characters, many of whom have incidental roles, the goal of giving these bit players the ability to spout some meaningful dialogue, should a player cross their path, is one potential use for chatbot text. Sony’s Haven Studios is using GenAI to quickly mock-up characters, while Roblox is developing an AI system it plans to let users leverage to create digital objects and build-out virtual worlds based on text prompts. Continue reading Game Creators Are Now Testing the Benefits of Generative AI