VideoPoet: Google Launches a Multimodal AI Video Generator

Google has unveiled a new large language model designed to advance video generation. VideoPoet is capable of text-to-video, image-to-video, video stylization, video inpainting and outpainting, and video-to-audio. “The leading video generation models are almost exclusively diffusion-based,” Google says, citing Imagen Video as an example. Google finds this counter intuitive, since “LLMs are widely recognized as the de facto standard due to their exceptional learning capabilities across various modalities.” VideoPoet eschews the diffusion approach of relying on separately trained tasks in favor of integrating many video generation capabilities in a single LLM. Continue reading VideoPoet: Google Launches a Multimodal AI Video Generator

MAGE AI Unifies Generative and Recognition Image Training

Researchers at MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL) have introduced a computer vision system that combines image recognition and image generation technology into one training model instead of two. The result, MAGE (short for MAsked Generative Encoder) holds promise for a wide variety of use cases and is expected to reduce costs through unified training, according to the team. “To the best of our knowledge, this is the first model that achieves close to state-of-the-art results for both tasks using the same data and training paradigm,” the researchers said. Continue reading MAGE AI Unifies Generative and Recognition Image Training

Nvidia’s Neuralangelo AI Turns 2D Video Clips into 3D Worlds

Nvidia Research is releasing a new AI model called Neuralangelo that can turn 2D iPhone video clips into 3D structures, virtually replicating sculptures, buildings and other real world objects in great detail. Named for Michelangelo’s life-like creations from blocks of marble, Neuralangelo is able to accurately capture repetitive texture patterns, homogenous colors, and strong color variations, tasks that were problematic for earlier AI models. Neuralangelo accomplishes the feat using instant neural graphics primitives, the technology behind Nvidia Instant NeRF. Continue reading Nvidia’s Neuralangelo AI Turns 2D Video Clips into 3D Worlds

Report Points to Increase in Internet-Connected TVs in U.S.

Connected TV penetration has exceeded the 60 percent mark for broadband households in the U.S., according to new data from The Diffusion Group, indicating that an increasing number of consumers are interested in receiving entertainment services such as Netflix and Pandora. TDG’s January 2014 study found that 63 percent of broadband households have at least one Internet-connected TV, up from 53 percent the same time last year. The numbers reflect smart TVs in addition to devices like game consoles and Internet sticks. Continue reading Report Points to Increase in Internet-Connected TVs in U.S.