Meta’s AudioCraft Turns Words into Music with Generative AI

Meta Platforms is releasing AudioCraft, a generative AI framework that creates “high-quality,” “realistic” audio and music from text prompts. AudioCraft consists of three models: MusicGen, AudioGen and EnCodec, all of which Meta announced it is open-sourcing. Released in June, MusicGen was trained on Meta-owned and licensed music, and generates music from text prompts, while AudioGen, which was trained on public domain samples, generates sound effects (like honking horns and barking dogs) from text prompts. The EnCodec decoder allows “higher quality music generation with fewer artifacts,” according to Meta. Continue reading Meta’s AudioCraft Turns Words into Music with Generative AI

Cryptographic C2PA Protocol Pursues Labeling of AI Content

Launched two years ago, C2PA is an open-source Internet protocol that cryptographically encodes origin metadata into content. The protocol, a more secure form of watermarking, is being put forth as a way of disclosing when material has been created wholly or in part using artificial intelligence, something the White House has said it wants companies to do. Impending European Union regulations will also mandate that some tech platforms label images, audio, and video generated by artificial intelligence using “prominent markings.” More than 1,500 companies are involved with C2PA through the Content Authenticity Initiative, making it a viable solution. Continue reading Cryptographic C2PA Protocol Pursues Labeling of AI Content

Study: Smart TVs Are Now in 74 Percent of American Homes

Four in five U.S. homes now have a smart TV, accounting for three in five TV sets, according to the fifth annual Hub Entertainment Research “Evolution of the TV Set” survey, which found streaming is growing commensurate with penetration of the intelligent displays. About 64 percent of viewers use their smart TVs to stream video, while roughly half use the connected devices to stream music or other audio content, the study found. The 74 percent of households that own at least one smart TV is up from 61 percent in 2020. Additionally, Horowitz Research found that consumers are increasingly turning to curated collections and hubs for content discovery. Continue reading Study: Smart TVs Are Now in 74 Percent of American Homes

ByteDance Bows Ripple AI for Music Creation, Audio Editing

China’s ByteDance is testing an AI tool called Ripple. The free app for creating music and editing audio is being made available in closed beta in the U.S. with a small group of invited testers. Aimed at creators who want to up their sound game, Ripple is designed in the manner of a portable smart digital audio workstation (DAW). Ripple incorporates what TikTok’s parent company ByteDance calls a “virtual recording studio” that allows users to record and edit audio files on a mobile device, and the company plans to release additional mobile-friendly audio tools. Continue reading ByteDance Bows Ripple AI for Music Creation, Audio Editing

SiriusXM to Close Its Stitcher Podcast App and Site in August

SiriusXM is shuttering its Stitcher podcasting app and merging podcast delivery into its flagship SiriusXM subscription offerings. As of August 29, “the Stitcher app and web listening experience will be disabled,” the company told users this week. Stitcher offered listeners the choice of free-to-listen ad-supported programs or a la carte show subscriptions. It also had the $4.95 per month ($34.99 per year) Stitcher Premium, providing a wide variety of ad-free podcasts. “Subscribers can listen to podcasts within the SiriusXM app and will see an all-new listening experience later this year,” the company said. Continue reading SiriusXM to Close Its Stitcher Podcast App and Site in August

RIAA Alleges Popular ‘AI Hub’ on Discord Violates Copyright

The AI Hub server on Discord has drawn attention from the Recording Industry Association of America, which sent a DMCA takedown notice and is alleging copyright infringement. The users are said to share a wide range of AI voice models, including some based on recognizable performers. Those that may sound familiar are in the style of Stevie Wonder, Frank Sinatra, Rihanna and Bruno Mars. AI Hub reportedly has more than 142,000 members that engage in sharing topical information, such as guides. One point that is getting a lot of attention is the RIAA demand that Discord identify the accused infringers. Continue reading RIAA Alleges Popular ‘AI Hub’ on Discord Violates Copyright

Meta Creates Voicebox Generative AI Model for Audio Synth

Meta Platforms has unveiled Voicebox, an AI model that can produce high-quality audio clips and edit pre-recorded audio. It also uses artificial intelligence for speech generation efforts, using what Meta calls “in-context learning” to accomplish tasks it was not specifically trained for. The company says Voicebox is first in class with this type of generalized learning for audio. Untrained tasks include sampling, stylizing and editing. As an editor, it can isolate and remove sounds like car horns and background animal noise while preserving the content and style of the source audio. The multilingual model generates speech in six languages. Continue reading Meta Creates Voicebox Generative AI Model for Audio Synth

Meta’s MusicGen AI Works with Language and Song Prompts

Meta Platforms has debuted what’s being called “ChatGPT for audio.” MusicGen is an AI music generator that can create tunes from natural language or song snippets. The company says MusicGen was trained on 20,000 hours of music, including 10,000 hours of “high-quality” licensed songs and 390,000 instrumental tracks. Meta released MusicGen on GitHub this past weekend, and is currently demoing the app on Facebook’s Hugging Face page. Visitors can generate tunes by describing the sound they want. Among Meta’s prompts: “80s driving pop song with heavy drums and synth pads in the background.” Continue reading Meta’s MusicGen AI Works with Language and Song Prompts

Meta Develops Computer Vision AI That Learns Like Humans

Meta Platforms continues to make progress on a mission to develop artificial intelligence that can teach itself to learn how the world works. Chief AI Scientist Yann LeCun has taken a special interest in developing the new model, called Image Joint Embedding Predictive Architecture, or I-JEPA, which learns by building an internal representation of the outside world and analyzing image abstracts instead of comparing pixels. The approach allows AI techto learn more like humans do, with their ability to figure out complex tasks and adapt to new situations. Continue reading Meta Develops Computer Vision AI That Learns Like Humans

Deezer Says Its Tech Can Flag and Delete Deepfake AI Tunes

Deezer, the global music streaming platform based in France, claims to have developed a technique for flagging — and potentially deleting — songs that use artificial intelligence to simulate the performance of popular singers. “We need to take a stand now,” Deezer CEO Jeronimo Folgueira said in an interview. “We are at a pivotal moment in music.” His company plans to “weed out illegal and fraudulent content” in an effort to protect artists. Deezer’s detection technology is still under development. It relies on AI, which Folgueira said he is not against if it is used ethically. Continue reading Deezer Says Its Tech Can Flag and Delete Deepfake AI Tunes

Meta’s Open-Source ImageBind Works Across Six Modalities

Meta Platforms has built and is open-sourcing ImageBind, an artificial intelligence that combines six modalities: audio, visual, text, thermal, movement and depth data. Currently a research project, it suggests a future in which AI models generate multisensory content. “ImageBind equips machines with a holistic understanding that connects objects in a photo with how they will sound, their 3D shape, how warm or cold they are, and how they move,” Meta says. In other words, ImageBind’s approach more closely approximates human thinking by training on the relationship between things rather than ingesting massive datasets so as absorb every possibility. Continue reading Meta’s Open-Source ImageBind Works Across Six Modalities

Google’s PaLM API, MakerSuite Coming to Select Developers

Google is readying an API and other enterprise tools for its Pathways Language Model (PaLM) — a large language model similar to GPT — to encourage developers to create chatbots and other apps using the platform. PaLM is one of Google’s most advanced systems, with the capability to generate text, images, code, video and audio from natural language prompts. Much like OpenAI’s GTP series and the LLaMA family from Meta Platforms, it is suitable for a wide variety of general tasks. To facilitate PaLM’s use for specific tasks, Google is launching the MakerSuite along with the PaLM API. Continue reading Google’s PaLM API, MakerSuite Coming to Select Developers

Discord Integrates OpenAI Tech, Updates AI-Driven Features

Chat app Discord is expanding the use of artificial intelligence on its platform, including the addition of OpenAI technology to its chatbot and moderation features. Discord says it has 150 million users across 19 million interest groups, called “servers,” that dialogue using text, audio and video chat. Discord’s Midjourney text-to-image generation group is its largest community, with in excess of 13 million members. “Harnessed properly, AI can fundamentally enhance and empower genuine human connection,” Discord CEO Jason Citron said at a press event last week, heralding “the most exciting moments in technology emerging.” Continue reading Discord Integrates OpenAI Tech, Updates AI-Driven Features

ETC Releases Next Section of Virtual Production White Paper

The Entertainment Technology Center@USC has released the second installment of its case study, “Fathead: Virtual Production & Beyond.” Section 2 of the four-part white paper is “Sound Mitigation: Performance Matters,” which features compelling interviews with “Fathead” co-producer Brandyn Johnson and former Sony Pictures executive Eric Rigney. The section also addresses “the challenges of recording clean dialogue on LED volumetric stages and in-camera visual effects (ICVFX) during production.” Click here to access Section 2 and the previously released Section 1, “Cloud Computing: Growth Without Bounds.” We’ll post announcements when the remaining two sections become available. Continue reading ETC Releases Next Section of Virtual Production White Paper

Spotify Launches New Video Feed to Keep Listeners Listening

Spotify is adding new features that will allow for more social expression and help users discover new music, among other things. The audio streaming giant service is adding a video feed designed to recommend songs, podcasts and audiobooks via short clips, like those found on TikTok, YouTube Shorts and Instagram. “Previews,” as they’re called, allow users to swipe through content recommendations. Generated either via algorithm or configured by an artist or podcaster, the short videos are meant to encourage a deep dive into something new or saving for later.
Continue reading Spotify Launches New Video Feed to Keep Listeners Listening