ElevenLabs Promotes Its Latest Advances in AI Audio Effects

“What if you could describe a sound and generate it with AI?,” asks startup ElevenLabs, which set out to do just that, and says it has succeeded. The two-year-old company explains it “used text prompts like ‘waves crashing,’ ‘metal clanging,’ ‘birds chirping,’ and ‘racing car engine’ to generate audio.” Best known for using machine learning to clone voices, the AI firm founded by Google and Palantir alums has yet to make publicly available its new text-to-sound model but began teasing it by releasing online demos this week. Some see the technology as a natural complement to the latest wave of image generators. Continue reading ElevenLabs Promotes Its Latest Advances in AI Audio Effects

Adobe Reveals Its New AI Tool for Editing Problematic Audio

Adobe has unveiled Project Sound Lift, an AI-powered technology that separates speech recordings into discrete tracks of voices, non-speech sounds and other background noise in video. The company describes Project Sound Lift as “a one-click solution” that leverages AI to help users easily manipulate audio recordings “across a range of scenarios” to “enhance, transform, and control speech and sound independently.” Adobe’s existing Enhance Speech technology, available in the company’s Premiere Pro editing program, has been integrated within Project Sound Lift to aid creators in producing studio-quality audio content. Continue reading Adobe Reveals Its New AI Tool for Editing Problematic Audio

Sony Xperia 5 V Phone Is Built for Creators and Entertainment

Unveiled at IFA 2023, Sony’s Xperia 5 V is a smartphone aimed at creators that also happens to be an entertainment powerhouse. Features include a 6.1-inch OLED HD+ screen, Bravia X1 mobile image engine, and a front stage sound system with “powerful deep bass” that Sony says eliminates the need for external speakers. Building on the preceding model, upgrades include the Video Creator app, which edits select video content into a finished clip in about a minute. This new iteration shoots 4K video at up to 120fps in HDR and features optical image stabilization. A dedicated microphone can isolate and capture the main vocal amidst environmental noise and other voices. Continue reading Sony Xperia 5 V Phone Is Built for Creators and Entertainment

Films Shot ‘In-World’ at Mona Get Their Own Shorts Festival

Films shot using the 3D world-building tool Mona will compete in what is being billed as “the world’s first metaverse short film festival.” Running September 26-29, the Mona Shorts Fest will take place in the immersive environment app that CEO Justin Melillo has coined “the Monaverse.” “With an entire film studio at your fingertips through Mona’s SDK and in-world experience, imagine what story you could tell,” the festival’s website suggests. Last summer, the company announced more than $14 million in Series A funds after “thousands of creators” used the platform to build experiences for Web3. Continue reading Films Shot ‘In-World’ at Mona Get Their Own Shorts Festival

ByteDance Bows Ripple AI for Music Creation, Audio Editing

China’s ByteDance is testing an AI tool called Ripple. The free app for creating music and editing audio is being made available in closed beta in the U.S. with a small group of invited testers. Aimed at creators who want to up their sound game, Ripple is designed in the manner of a portable smart digital audio workstation (DAW). Ripple incorporates what TikTok’s parent company ByteDance calls a “virtual recording studio” that allows users to record and edit audio files on a mobile device, and the company plans to release additional mobile-friendly audio tools. Continue reading ByteDance Bows Ripple AI for Music Creation, Audio Editing

Meta Develops Computer Vision AI That Learns Like Humans

Meta Platforms continues to make progress on a mission to develop artificial intelligence that can teach itself to learn how the world works. Chief AI Scientist Yann LeCun has taken a special interest in developing the new model, called Image Joint Embedding Predictive Architecture, or I-JEPA, which learns by building an internal representation of the outside world and analyzing image abstracts instead of comparing pixels. The approach allows AI techto learn more like humans do, with their ability to figure out complex tasks and adapt to new situations. Continue reading Meta Develops Computer Vision AI That Learns Like Humans

AMPAS Diversifies with a Production and Technology Branch

The Academy of Motion Picture Arts and Sciences has formed a Production and Technology Branch to include about 400 individuals previously classified as members-at-large. The branch “represents members working in key technical and production positions in all phases of filmmaking, from pre- to post-production,” AMPAS says, listing chief technology officers, senior department heads in tech and creative services, and preservation and restoration specialists as among the inductees. In addition, credited production roles — including stunt coordinators, script supervisors, choreographers, music supervisors, colorists, line producers and associate producers — will be part of the branch. Continue reading AMPAS Diversifies with a Production and Technology Branch

ETC Releases Next Section of Virtual Production White Paper

The Entertainment Technology Center@USC has released the second installment of its case study, “Fathead: Virtual Production & Beyond.” Section 2 of the four-part white paper is “Sound Mitigation: Performance Matters,” which features compelling interviews with “Fathead” co-producer Brandyn Johnson and former Sony Pictures executive Eric Rigney. The section also addresses “the challenges of recording clean dialogue on LED volumetric stages and in-camera visual effects (ICVFX) during production.” Click here to access Section 2 and the previously released Section 1, “Cloud Computing: Growth Without Bounds.” We’ll post announcements when the remaining two sections become available. Continue reading ETC Releases Next Section of Virtual Production White Paper

New LG Miraclass LEDs Offer Cinemas Projection Alternative

LG is bringing to market a line of LED screens designed for small movie theaters, the LG Miraclass line. The four initial models range in size from 16- to 46-feet wide, offering 24-bit color and resolution from 2K to 4K. The screens are comprised of smaller panels featuring self-emissive LED pixels that run edge-to-edge (eliminating visible seams). Brightness adjusts across five stages — from 48 to 300 nits  — making the displays suitable for a range of use-cases, from the brightness required for 2D/3D movies to less light-intensive corporate presentations. Continue reading New LG Miraclass LEDs Offer Cinemas Projection Alternative

Netflix Premium Offers 700+ Titles That Feature Spatial Audio

Netflix is hoping that offering cinema-like spatial audio on a catalog of more than 700 titles for those who subscribe to its $20 per month premium tier will encourage more customers to upgrade. Spatial audio is engineered to deliver a 360-degree audio experience, including sounds that appear to come from behind the viewer. Customers who subscribe to Netflix’s Standard or Basic plans will have access to some spatial audio titles “but only on a small, select number,” the company said, describing the audio tech as “a game changer for the Premium plan viewing experience.” Netflix Premium already supports 4K, HDR, Dolby Atmos and Netflix Calibrated Mode. Continue reading Netflix Premium Offers 700+ Titles That Feature Spatial Audio

Google’s MusicLM AI Can Generate Tunes from Text Prompts

Google is introducing a new artificial intelligence app called MusicLM that creates music in any style or genre based on text prompts and can translate a whistled melody or casually hummed snipped into instrument sounds. TechCrunch calls the technology “impressive” but says the Alphabet company “fearing the risks, has no immediate plans to release it,” in recognition of the controversy surrounding AI models trained using copyrighted material. MusicLM was created using a dataset of 280,000 musical hours, resulting in the ability to generate minutes-long songs of “significant complexity.” Continue reading Google’s MusicLM AI Can Generate Tunes from Text Prompts

CES: Kardome Audio Solution Could Serve Virtual Production

The rise of virtual production stages has elevated the importance of finding a solution to echoing and sound distortion on the set. A company called Kardome that has created a sound isolation technology and “Spatial Hearing” solution for automobile cabins may have a solution applicable to the virtual stage. At CES, the company was demonstrating targeted speech and voice tech that brings clarity to speech recognition devices. According to Kardome, its “AI-driven Spatial Hearing and noise reduction technology facilitate a seamless voice recognition experience in any acoustic environment, from the quiet to the chaotic.” Continue reading CES: Kardome Audio Solution Could Serve Virtual Production

CES: Startup Leverages AI to Address Problematic Acoustics

There are a growing number of companies working on technologies that strive to make a person’s voice more intelligible to the listener over speakers, headphones, hearing aids and other consumer audio devices. Augmented Hearing, a Danish startup launched two years ago, is one of the more interesting companies at CES 2023 focusing on this space. The firm’s software-based solution runs on iOS, Windows and other CE operating systems. Their solution could mitigate the current trend of people across all age groups turning on closed captioning because they often find video dialogue difficult to understand. Continue reading CES: Startup Leverages AI to Address Problematic Acoustics

CES: Targeted Sensory Immersion at the Japanese Exhibition

The sluggish consumer adoption of virtual reality has pointed to a broader hesitation with immersive technologies that separate the user from their environment. In response, a niche market has evolved for technologies that unobtrusively live on the body while contributing an augmented sensory experience when needed. (See earrings that also offer directional audio from a company called Nova, for instance.) The Japanese exhibit section of the CES Eureka Park startup arena, however, showcased a more environmentally integrated — although still individualized — vision of immersion. Continue reading CES: Targeted Sensory Immersion at the Japanese Exhibition

Subtitles, Closed Captioning Popular Among Young Viewers

More people than ever are using subtitles — often in their native language, to help follow-along with indiscernible audio, according to a study by language-teaching app Preply. Netflix released figures indicating more than 80 percent of its subscribers used subtitles (or closed captions) once a month or more. And the trend is not limited to seniors; younger viewers are about four times more likely to turn on subtitles. The prevalence of rear-facing, or downward-directed speakers in today’s ultra-thin TVs has compounded the problem, often resulting in worse audio than the old-fashioned TV sets, which had front-facing speakers. But there are other issues affecting TV audio. Continue reading Subtitles, Closed Captioning Popular Among Young Viewers