By
Paula ParisiDecember 17, 2024
Meta’s FAIR (Fundamental AI Research) team has unveiled recent work in areas ranging from transparency and safety to agents, and architectures for machine learning. The projects include Meta Motivo, a foundation model for controlling the behavior of virtual embodied agents, and Video Seal, an open-source model for video watermarking. All were developed in the unit’s pursuit of advanced machine intelligence, helping “models to learn new information more effectively and scale beyond current limits.” Meta announced it is sharing the new FAIR research, code, models and datasets so the research community can build upon its work. Continue reading Meta Rolls Out Watermarking, Behavioral and Concept Models
By
Douglas ChanDecember 17, 2024
Santa Monica-based Snapchat announced a new Monetization Program for content creators this week that will feature expanded revenue opportunities and evolving rewards. Beginning February 1, creators that have at least 50,000 followers and post at least 25 times each month to Saved Stories or Spotlight videos will have the option to place ads in videos that are longer than one minute. Eligible creators would also need to meet one of the following criteria in the most recent month: 10 million Snap views, one million Spotlight views, or 12,000 hours of total view time. According to Snap, Spotlight video viewership is up 25 percent year-over-year. Continue reading Snapchat to Empower Creators with Video Monetization Plan
CES 2025, taking place the week of January 5 in Las Vegas, is expected to focus on artificial intelligence, unveiling a wave of innovative offerings — whether practical, visionary or experimental. As we stand on the brink of transformative change, it’s worth recalling that early AI models often fell short when they attempted to mimic human methods. As we approach CES in service to the entertainment industry, we’ll be most interested in products that use this constant advance to assist and amplify human potential. Media applications that impact the next generation of compelling stories, production techniques, and consumer experiences will be of most interest. Continue reading CES Preview: Standing on the Brink of Transformative Change
By
Hank GerbaDecember 16, 2024
Google has introduced Gemini 2.0, the latest version of its multimodal AI model, signaling a shift toward what the company is calling “the agentic era.” The upgraded model promises not only to outperform previous iterations on standard benchmarks but also introduces more proactive, or agentic, functions. The company announced that “Project Astra,” its experimental assistant, would receive updates that allow it to use Google Search, Lens, and Maps, and that “Project Mariner,” a Chrome extension, would enable Gemini 2.0 to navigate a user’s web browser to complete tasks autonomously. Continue reading Google Releases Gemini 2.0 in Shift Toward Agentic Era of AI
By
Paula ParisiDecember 16, 2024
Google has unveiled Android XR, an operating system for computers and smart glasses powered by Google’s Gemini AI large language model. Samsung confirmed that it will release an extended reality headset that runs on Android XR sometime in 2025. Samsung worked closely with Google and Gemini throughout 2023, leading up to the Galaxy S24 series of smartphones that debuted at CES 2024 last January. Google announced the release of the Android XR SDK Developer Preview kit so new apps can be built and existing ones ported over to the new platform to support Samsung’s new headset and other devices. Continue reading Android XR Powered by Gemini OS for Samsung’s 2025 Headset
By
Paula ParisiDecember 13, 2024
With AI powering a range of new world-building apps, 2025 could be the year the metaverse finally makes an impact. Midjourney joins the world-building club with Patchwork, a collaborate canvas for creating “infinite” fictional worlds. Now in research preview, the tool is being developed as a standalone app, though preview access requires a Midjourney Discord account linked to a Google account. Users are able to connect characters and worlds, and “share” their developing world — evolving as a “board” — with up to 100 collaborative partners on Midjourney (though the company recommends fewer participants for a more focused experience). Continue reading Midjourney Touts Collaborative World-Building App Patchwork
By
Paula ParisiDecember 13, 2024
Ayar Labs, which develops optical interconnect chips for large-scale AI workloads, has secured $155 million in financing, including from competing processor companies Nvidia, Intel and AMD. Founded in 2017, the Silicon Valley-based company is pursuing a different processing path — combining photonic elements with electronic circuits on each chip for what it says provides faster, more efficient processing for artificial intelligence and high-performance computing. “This brings the company’s total funding to $370 million and raises the company’s valuation to above $1 billion,” Ayar notes, adding that the new funding allows the company to scale its optical I/O tech. Continue reading Nvidia, Intel and AMD Invest in AI Chiplet Developer Ayar Labs
By
Paula ParisiDecember 13, 2024
YouTube’s Playables, a no-download app for light games, is testing a multiplayer feature for select titles. The Playables multiplayer lets users play games in real time with others on the platform. The test kicks off with two games available on both desktop and mobile, “Ludo Club” and “Magic Tiles 3.” YouTube launched Playables to all users in May with more than 75 titles and announced this week that it plans to introduce more features and content in the future. Gaming is a “sizable” viewing market for YouTube, according to Statista, which says its most-subscribed game channels each average about 47 million monthly subscribers. Continue reading YouTube Playables Experiments with Live Multiplayer Gaming
By
Paula ParisiDecember 12, 2024
Ten months after its preview, OpenAI has officially released a Sora video model called Sora Turbo. Described as “hyperrealistic,” Sora Turbo generates clips of 10 to 20 seconds from text or image inputs. It outputs video in widescreen, vertical or square aspect ratios at resolutions from 480p to 1080p. The new product is being made available to ChatGPT Plus and Pro subscribers ($20 and $200 per month, respectively) but is not yet included with ChatGPT Team, Enterprise, or Edu plans, or available to minors. The company explains that Sora videos contain C2PA metadata indicating that they were generated by AI. Continue reading OpenAI Releases Sora, Adding It to ChatGPT Plus, Pro Plans
By
Paula ParisiDecember 12, 2024
World Labs, the AI startup co-founded by Stanford AI pioneer Fei-Fei Li, has debuted a “spatial intelligence” system that can generate 3D worlds from a single image. Although the output is not photorealistic, the tech could be a breakthrough for animation companies and video game developers. Deploying what it calls Large World Models (LWMs), World Labs is focused on transforming 2D images into turnkey 3D environments with which users can interact. Observers say that reciprocity is what sets World Labs’ technology apart from offerings by other AI companies that transform 2D to 3D. Continue reading World Labs AI Lets Users Create 3D Worlds from Single Photo
By
Paula ParisiDecember 12, 2024
Hundreds of thousands more YouTube channels are gaining access to its AI-powered auto-dubbing feature, which generates audio translation tracks for YouTube videos, helping to make the platform’s content more accessible to viewers around the world. The expanded rollout targets informational channels in the Partner Program, such as tutorials on cooking, sewing, tourism and home improvement. Availability “will expand to other types of content soon,” according to video streamer, which began testing the feature with select creators last year. Based on technology developed by Aloud, YouTube’s auto-dubbing emerged from the Area 120 internal incubator program. Continue reading YouTube Expands Access to Improved AI-Powered Dubbing
By
Paula ParisiDecember 11, 2024
Reddit has launched a new AI-powered search tool called Reddit Answers. Reddit is already appearing regularly in Google Search returns. The new interface provides a way users can utilize a conversational model to get answers directly from the social platform. “Once a question is asked, curated summaries of relevant conversations and details across Reddit will appear, including links to related communities and posts,” according to Reddit. Whether users will want to skip their usual go-to search engines in favor of querying Reddit alone could have long term ramifications for the 19-year old social platform, which went public in 2023. Continue reading ‘Reddit Answers’ Wants to Gain More Users Searching In-App
By
Paula ParisiDecember 11, 2024
Nielsen is now offering a cross-media U.S. ad performance view that takes into account advertising on controversial social platform TikTok. As a result of the integration advertisers and agencies will, for the first time, be able to compare ad performance on TikTok across all screens, including digital, CTV, and linear. The analytics will be parsed via Nielsen ONE, a cross-media platform that debuted in alpha in May 2023 at which time it was scheduled for broad release in late 2024. Nielsen says the TikTok integration will provide “independent and verified reporting of demographic data” for campaign measurement via Nielsen ONE. Continue reading TikTok Pacts with Nielsen to Measure Cross-Media Advertising
By
Paula ParisiDecember 11, 2024
In a deal said to be reshaping the global advertising industry, Omnicom has reached a definitive agreement to acquire a major rival, the Interpublic Group (IPG), in a stock-for-stock transaction. If the deal receives regulatory approval, the New York-based ad giants will combine to form an agency that will be the largest in the world, bringing together ad legends TBWA Worldwide and McCann Worldgroup for what CNBC estimates will be more than $26 billion in annual revenue. The merger joins “world-class, highly complementary data and technology platforms” at a propitious time, thanks to seismic, AI-driven advances in marketing and adtech. Continue reading Omnicom Will Acquire Interpublic in Major Ad Industry Merger
By
Paula ParisiDecember 10, 2024
Meta Platforms has packed more artificial intelligence into a smaller package with Llama 3.3, which the company released last week. The open-source large language model (LLM) “improves core performance at a significantly lower cost, making it even more accessible to the entire open-source community,” Meta VP of Generative AI Ahmad Al-Dahle wrote on X social. The 70 billion parameter text-only Llama 3.3 is said to perform on par with the 405 billion parameter model that was part of Meta’s Llama 3.1 release in July, with less computing power required, significantly lowering its operational costs. Continue reading Meta’s Llama 3.3 Delivers More Processing for Less Compute