OpenAI Voice Cloning Tool Needs Only a 15-Second Sample

OpenAI has debuted a new text-to-voice generation platform called Voice Engine, available in limited access. Voice Engine can generate a synthetic voice from a 15-second clip of someone’s voice. The synthetic voice can then read a provided text, even translating to other languages. For now, only a handful of companies are using the tech under a strict usage policy as OpenAI grapples with the potential for misuse. “These small scale deployments are helping to inform our approach, safeguards, and thinking about how Voice Engine could be used for good across various industries,” OpenAI explained. Continue reading OpenAI Voice Cloning Tool Needs Only a 15-Second Sample

YouTube Adds GenAI Labeling Requirement for Realistic Video

YouTube has added new rules requiring those uploading realistic-looking videos that are “made with altered or synthetic media, including generative AI” to label them using a new tool in Creator Studio. The new labeling “is meant to strengthen transparency with viewers and build trust between creators and their audience,” YouTube says, listing examples of content that require disclosure as “likeness of a realistic person” including voice as well as image, “altering footage of real events or places” and “generating realistic scenes” of fictional major events, “like a tornado moving toward a real town.” Continue reading YouTube Adds GenAI Labeling Requirement for Realistic Video

Midjourney Creates a Feature to Advance Image Consistency

Artificial intelligence imaging service Midjourney has been embraced by storytellers who have also been clamoring for a feature that enables characters to regenerate consistently across new requests. Now Midjourney is delivering that functionality with the addition of the new “–cref” tag (short for Character Reference), available for those who are using Midjourney v6 on the Discord server. Users can achieve the effect by adding the tag to the end of text prompts, followed by a URL that contains the master image subsequent generations should match. Midjourney will then attempt to repeat the particulars of a character’s face, body and clothing characteristics. Continue reading Midjourney Creates a Feature to Advance Image Consistency

Alibaba’s EMO Can Generate Performance Video from Images

Alibaba is touting a new artificial intelligence system that can animate portraits, making people sing and talk in realistic fashion. Researchers at the Alibaba Group’s Institute for Intelligent Computing developed the generative video framework, calling it EMO, short for Emote Portrait Alive. Input a single reference image along with “vocal audio,” as in talking or singing, and “our method can generate vocal avatar videos with expressive facial expressions and various head poses,” the researchers say, adding that EMO can generate videos of any duration, “depending on the length of video input.” Continue reading Alibaba’s EMO Can Generate Performance Video from Images

ElevenLabs Promotes Its Latest Advances in AI Audio Effects

“What if you could describe a sound and generate it with AI?,” asks startup ElevenLabs, which set out to do just that, and says it has succeeded. The two-year-old company explains it “used text prompts like ‘waves crashing,’ ‘metal clanging,’ ‘birds chirping,’ and ‘racing car engine’ to generate audio.” Best known for using machine learning to clone voices, the AI firm founded by Google and Palantir alums has yet to make publicly available its new text-to-sound model but began teasing it by releasing online demos this week. Some see the technology as a natural complement to the latest wave of image generators. Continue reading ElevenLabs Promotes Its Latest Advances in AI Audio Effects

EU Makes Provisional Agreement on Artificial Intelligence Act

The EU has reached a provisional agreement on the Artificial Intelligence Act, making it the first Western democracy to establish comprehensive AI regulations. The sweeping new law predominantly focuses on so-called “high-risk AI,” establishing parameters — largely in the form of reporting and third-party monitoring — “based on its potential risks and level of impact.” Parliament and the 27-country European Council must still hold final votes before the AI Act is finalized and goes into effect, but the agreement, reached Friday in Brussels after three days of negotiations, means the main points are set. Continue reading EU Makes Provisional Agreement on Artificial Intelligence Act

CBS News Confirmed: New Fact-Checking Unit Examining AI

CBS is launching a unit charged with identifying misinformation and avoiding deepfakes. Called CBS News Confirmed, it will operate out of the news-and-stations division, ferreting out false information generated by artificial intelligence. Claudia Milne, senior VP of CBS News and Stations and its standards and practices chief will run the new group with Ross Dagan, EVP and head of news operations and transformation, CBS News and Stations. CBS plans to hire forensic journalists and will expand training and invest in technologies to assist them in their role. In addition to flagging deepfakes, CBS News Confirmed will also report on them. Continue reading CBS News Confirmed: New Fact-Checking Unit Examining AI

President Biden Signs Executive Order to Contain Risks of AI

President Biden has signed a far-ranging executive order establishing guardrails for artificial intelligence. Companies are now required to report to the federal government on risks related to their AI systems should they fall into the hands of terrorists or be used for weapons of mass destruction. The order also attempts to mitigate the dangers of deepfakes that could be used to manipulate elections or defraud consumers. “Deepfakes use AI-generated audio and video to smear reputations, spread fake news and commit fraud,” Biden said as he signed the order at the White House. Continue reading President Biden Signs Executive Order to Contain Risks of AI

Spotify Uses AI to Copy Host Voices for Podcast Translations

Spotify is using AI to drive podcast language translation in what sounds like the podcaster’s own voice, which has obvious implications for film and television dubbing. Working with podcast notables including Dax Shepard, Monica Padman and Bill Simmons, Spotify used AI to mimic their voices in Spanish, French and German for several episodes. The proprietary Spotify technology uses OpenAI’s new text-to-speech voice-generation technology as well as its open-source Whisper speech recognition system, which transcribes spoken words into text. The result, Spotify says, is “more authentic” and “more personal and natural” than traditional dubbing. Continue reading Spotify Uses AI to Copy Host Voices for Podcast Translations

OpenAI’s Latest Version of DALL-E Integrates with ChatGPT

OpenAI has released the DALL-E 3 generative AI imaging platform in research preview. The latest iteration features more safety options and integrates with OpenAI’s ChatGPT, currently driven by the now seasoned large language model GPT-4. That is the ChatGPT version to which Plus subscribers and enterprise customers have access — the same who will be able to preview DALL-E 3. The free chatbot is built around GPT-3.5. OpenAI says GPT-4 makes for better contextual understanding by DALL-E, which even in version 2 evidenced some glaring comprehension glitches. Continue reading OpenAI’s Latest Version of DALL-E Integrates with ChatGPT

Google Introduces an AI Watermark That Cannot Be Removed

Google DeepMind and Google Cloud have teamed to launch what they claim is an indelible AI watermark tool, which if it works would mark an industry first. Called SynthID, the technique for identifying AI-generated images is being launched in beta. The technology embeds its digital watermark “directly into the pixels of an image, making it imperceptible to the human eye, but detectable for identification,” according to DeepMind. SynthID is being released to a limited number of Google’s Vertex AI customers using Imagen, a Google AI language model that generates photorealistic images. Continue reading Google Introduces an AI Watermark That Cannot Be Removed

EU Urges Tech Companies to Label All AI-Generated Content

The European Union wants deepfakes and other AI-generated content labeled, and is pressing signatories to its Code of Practice on Online Disinformation to adopt technology that will clearly identify output that is generated or manipulated by machines. “The new AI technologies can be a force for good” that offers “new avenues for increased efficiency and creative expression. But, as always, we have to mention the dark side,” EU values and transparency commissioner Vera Jourova said, citing “new risks and the potential for negative consequences for society.” Continue reading EU Urges Tech Companies to Label All AI-Generated Content

Deezer Says Its Tech Can Flag and Delete Deepfake AI Tunes

Deezer, the global music streaming platform based in France, claims to have developed a technique for flagging — and potentially deleting — songs that use artificial intelligence to simulate the performance of popular singers. “We need to take a stand now,” Deezer CEO Jeronimo Folgueira said in an interview. “We are at a pivotal moment in music.” His company plans to “weed out illegal and fraudulent content” in an effort to protect artists. Deezer’s detection technology is still under development. It relies on AI, which Folgueira said he is not against if it is used ethically. Continue reading Deezer Says Its Tech Can Flag and Delete Deepfake AI Tunes

Twitter Community Notes Aim to Curb Impact of Fake Images

Twitter is emphasizing crowdsourced moderation. The launch of Community Notes for images in posts seeks to address instances where morphed or AI-generated images are posted. The idea is to expose altered content before it goes viral, as did the image of Pope Francis wearing a Balenciaga puffy coat in March and the fake image of an explosion at the Pentagon in May. Twitter says Community Notes about an image will appear with “recent and future” posts containing the graphic in question. Currently in the test phase, the feature works with tweets featuring a single image. Continue reading Twitter Community Notes Aim to Curb Impact of Fake Images

Music Industry Contends with Artificial Intelligence Disruption

There’s been a lot of noise recently about music generated by artificial intelligence tools. The clamor is on multiple fronts: generative mimicry of specific artists’ vocal styles, the potential to put Muzak-style background tunesmiths out of business with potentially cheaper alternatives, and the particulars of takedown orders. The matter came to a head this month after generative AI vocals prompted to sound like Drake and The Weeknd performed a song called “Heart on My Sleeve,” written and produced by a TikTok user. The tune quickly went viral, raising numerous concerns. Continue reading Music Industry Contends with Artificial Intelligence Disruption