Apple’s Keyframer AI Tool Uses LLMs to Prototype Animation

Apple has taken a novel approach to animation with Keyframer, using large language models to add motion to static images through natural language prompts. “The application of LLMs to animation is underexplored,” Apple researchers say in a paper that describes Keyframer as an “animation prototyping tool.” Based on input from animators and engineers, Keyframer lets users refine their work through “a combination of prompting and direct editing,” the paper explains. The LLM can generate CSS animation code. Users can also use natural language to request design variations. Continue reading Apple’s Keyframer AI Tool Uses LLMs to Prototype Animation

Apple Launches Open-Source Language-Based Image Editor

Apple has released MGIE, an open-source AI model that edits images using natural language instructions. MGIE, short for MLLM-Guided Image Editing, can also modify and optimize images. Developed in conjunction with University of California Santa Barbara, MGIE is Apple’s first AI model. The multimodal MGIE, which understands text and image input, also crops, resizes, flips, and adds filters based on text instructions using what Apple says is an easier instruction set than other AI editing programs, and is simpler and faster than learning a traditional program, like Apple’s own Final Cut Pro. Continue reading Apple Launches Open-Source Language-Based Image Editor

VideoPoet: Google Launches a Multimodal AI Video Generator

Google has unveiled a new large language model designed to advance video generation. VideoPoet is capable of text-to-video, image-to-video, video stylization, video inpainting and outpainting, and video-to-audio. “The leading video generation models are almost exclusively diffusion-based,” Google says, citing Imagen Video as an example. Google finds this counter intuitive, since “LLMs are widely recognized as the de facto standard due to their exceptional learning capabilities across various modalities.” VideoPoet eschews the diffusion approach of relying on separately trained tasks in favor of integrating many video generation capabilities in a single LLM. Continue reading VideoPoet: Google Launches a Multimodal AI Video Generator

Stability AI Is Offering Paid Membership for Commercial Users

As the pressure ratchets up for AI companies to go beyond the wow factor and make money, Stability AI has formalized three subscription tiers as it seeks to expand commercial use of its open-source, multimodal core models. The Stability AI Membership offerings include a free tier for personal and research (i.e., non-commercial) use, a professional tier that costs $20 a month, and a custom-priced enterprise tier for large outfits. The company says that with the three tiers it is “striking a balance between fostering competitiveness and maintaining openness in AI technologies.” Continue reading Stability AI Is Offering Paid Membership for Commercial Users

GenAI Lets Snapchat+ Subscribers Create and Share Images

Snapchat+ is rolling out new artificial intelligence features that let subscribers use text prompts to create generative AI images to share with friends. In addition, the Dreams feature, which creates generative AI selfies, is now able to add your friends to those photos. Snapchat+ subscribers get one pack of 8 Dreams per month as part of their $3.99 monthly fee. An onscreen button labeled “AI” lets subscribers access the AI image generator to choose from a menu of prompts (including “sunny day at the beach” and “planet made of cheese”) or they can enter their own descriptions. Continue reading GenAI Lets Snapchat+ Subscribers Create and Share Images

Adobe Reveals Its New AI Tool for Editing Problematic Audio

Adobe has unveiled Project Sound Lift, an AI-powered technology that separates speech recordings into discrete tracks of voices, non-speech sounds and other background noise in video. The company describes Project Sound Lift as “a one-click solution” that leverages AI to help users easily manipulate audio recordings “across a range of scenarios” to “enhance, transform, and control speech and sound independently.” Adobe’s existing Enhance Speech technology, available in the company’s Premiere Pro editing program, has been integrated within Project Sound Lift to aid creators in producing studio-quality audio content. Continue reading Adobe Reveals Its New AI Tool for Editing Problematic Audio

Meta Touts Its Emu Foundational Model for Video and Editing

Having made the leap from image generation to video generation over the course of a few months in 2022, Meta Platforms introduces Emu, its first visual foundational model, along with Emu Video and Emu Edit, positioned as milestones in the trek to AI moviemaking. Emu uses just two diffusion models to generate 512×512 four-second long videos at 16 frames per second, Meta said, comparing that to 2022’s Make-A-Video, which requires a “cascade” of five models. Internal research found Emu video generations were “strongly preferred” over the Make-A-Video model based on quality (96 percent) and prompt fidelity (85 percent). Continue reading Meta Touts Its Emu Foundational Model for Video and Editing

Shutterstock Offers AI Image Editor for Massive Stock Library

Creative image platform Shutterstock has added AI-powered editing features that provide “the potential for infinite options to refine and perfect images” in the company’s library of more than 700 million stock selections. A go-to source for brand marketers and digital media companies, Shutterstock is offering six signature AI capabilities as well as secondary features such as a virtual AI design assistant and advanced filters under the umbrella Creative AI. What’s more, Shutterstock says it will compensate its licensed artists when their images are edited with AI. Continue reading Shutterstock Offers AI Image Editor for Massive Stock Library

OpenAI Developing ‘Provenance Classifier’ for GenAI Images

OpenAI is developing an AI tool that can identify images created by artificial intelligence — specifically those made in whole or part by its Dall-E 3 image generator. Calling it a “provenance classifier,” company CTO Mira Murati began publicly discussing the detection app last week but said not to expect it in general release anytime soon. This, despite Murati’s claim it is “almost 99 percent reliable.” That is still not good enough for OpenAI, which knows there is much at stake when the public perception of artists’ work can be impacted by a filter applied by AI, which is notoriously capricious. Continue reading OpenAI Developing ‘Provenance Classifier’ for GenAI Images

NBC Streamer SportsEngine Play Targets $37B Youth Market

NBC Sports Next has launched a subscription amateur sports streaming service geared toward the youth market. SportsEngine Play will also offer a free tier for live and on-demand content centered on its target audience. The service leverages the technology acquired with Rapid Replay, a streaming startup purchased by NBC in September 2022. The new service is among a dozen related brands NBC has purchased over the years, including a specialty software company called Sports Ngin that the company bought in 2016 to make apps for youth sports organizations and leagues. Continue reading NBC Streamer SportsEngine Play Targets $37B Youth Market

Captions Debuts AI Lipdub with Translation and Gen Z Slang

Captions, which leverages AI to help its customers produce “studio quality videos directly from their mobile devices,” has launched a new app called Lipdub that automatically translates and dubs content into 28 languages. The free download lets user dub anyone “and experience familiar voices and faces in a suite of new languages.” Lipdub’s translations not only duplicate what the company says is “the subject’s exact voice,” but also syncs lip movements to match. It also incorporates dialects and idioms, with options like Gen Z and Texas slang. Continue reading Captions Debuts AI Lipdub with Translation and Gen Z Slang

Adobe Launches Web Version of Photoshop with AI Features

Adobe has officially added Photoshop on the web as one of its Photoshop plans. The web version is geared to Photoshop newbies and comes complete with Adobe Firefly generative AI features including Generative Fill and Generative Expand. Adobe called it “a major milestone” since introducing Photoshop on the web in beta two years ago, starting with “an early preview of image editing capabilities.” Features now available for commercial use on the web include the ability to easily add or remove elements from any image, change a background, expand the frame, and create visuals using text-based prompts. Continue reading Adobe Launches Web Version of Photoshop with AI Features

AI Startup Capsule Creates Video Editor for Enterprise Teams

AI tech startup Capsule is debuting a video editor it says can help enterprise teams achieve results “10x faster.” “Today, if you work at a large company — in marketing or comms, or maybe even sales or HR — creating even the simplest video can be daunting,” Capsule suggests. After querying more than 300 such enterprise teams about their pain points, Capsule focused on three areas of improvement: simplifying motion graphics, adhering to strict brand guidelines, and making the editing process more collaborative among teams across desktop and mobile, where apps are typically “siloed.” Continue reading AI Startup Capsule Creates Video Editor for Enterprise Teams

YouTube Debuts AI Creator Tools for Shorts and Mobile Video

YouTube announced a new AI-powered creator tool called Dream Screen and hinted at things to come at its annual Made on YouTube event last week. Rolling out on a test basis to select YouTube Shorts creators later this year, Dream Screen will generate “fantastic settings” for video backgrounds. YouTube says it will “eventually” allow creators to “instantly reimagine” their videos by typing an idea prompt that results in new content created with AI tools. The company also announced it will have AI-generated recommendations for video ideas and machine learning assistance with music track search. Continue reading YouTube Debuts AI Creator Tools for Shorts and Mobile Video

GoPro’s Hero 12 Black Offers Something for Pros and Creators

While mainly used by creators for extreme sports photography, the tiny GoPro Hero had its close-up moment in 2015’s “The Martian” to capture the effect of driving a rover across the surface of Mars. It was also used for some action shots in 2021’s “The King’s Man.” Changes to the new Hero 12 Black bring the action cam closer to something storytellers might want to try. Upgrades include the GoPro Enduro battery, with up to 2x longer runtime than its predecessor, and the Max Lens Mod 2.0 accessory, which enables a vast 177-degree field-of-view at 4K/60 fps. Continue reading GoPro’s Hero 12 Black Offers Something for Pros and Creators