YouTube Adds AI Search Results Carousel for Premium Subs

YouTube is adding an AI-powered search results carousel that serves up video suggestions and topic descriptions. A search for “best beaches in Hawaii,” for example, could generate a carousel listing video clips and information on an assortment of snorkel spots and volcanic beaches. YouTube Premium subscribers in the U.S. can try the feature now on searches related to shopping, travel or location-based activities. The Google-owned platform is also expanding its test with conversational AI to some non-Premium users in the U.S. Premium members have been using it for search, recommendations and as a study aid. Continue reading YouTube Adds AI Search Results Carousel for Premium Subs

Google Gemini Robotics On-Device Controls Robots Locally

Google DeepMind has released a new vision-language-action (VLA) model, Gemini Robotics On-Device, that can operate robots locally, controlling their movements without requiring an Internet connection or the cloud. Google says the software provides “general-purpose dexterity and fast task adaptation,” building on the March release of the first Gemini Robotics VLA model, which brought “Gemini 2.0’s multimodal reasoning and real-world understanding into the physical world.” Since the model operates independent of a data network, it’s useful for latency sensitive applications as well as low or no connectivity environments. Google is also releasing a Gemini Robotics SDK for developers. Continue reading Google Gemini Robotics On-Device Controls Robots Locally

Goldman Sachs Launches an AI Assistant for Its Employees

Investment firm Goldman Sachs just implemented an AI assistant framework for employees across its entire workforce of about 46,000, marking what the bank says is a major technological leap forward for its global operations. The conversational GS AI Assistant includes a developer copilot for coding, translation tools and a nascent “Banker Copilot” aimed at streamlining investment banker workflows. The conversational GS AI Assistant also lets workers interact with large language models including ChatGPT and Gemini within the firewalled parameters of Goldman’s secure compliance software. The rollout comes after a year of pilot testing involving more than 10,000 employees. Continue reading Goldman Sachs Launches an AI Assistant for Its Employees

CineSearch Is a New AI Discovery Tool for Streaming Content

A little over a year since the beta release of its conversational AI search and discovery tool, Cineverse is making cineSearch commercially available to business customers. The Los Angeles startup says its AI-powered framework “solves” the content-hunt quandary for digital networks and streaming services, finding programming across all streaming platforms. Cineverse is making cineSearch available for commercial licensing to OEMs and streaming platforms via the company’s own sales team and through Google Cloud Marketplace. CineSearch was developed using Google’s AI ecosystem — specifically Vertex AI platform and the Gemini 2.0 Pro model. Continue reading CineSearch Is a New AI Discovery Tool for Streaming Content

Google Is Testing ‘Hosted’ GenAI Audio Summaries in Search

Google is testing podcast-like audio search summaries generated by AI. Audio Overviews uses Google’s latest Gemini models to generate “quick, conversational audio overviews for certain search queries.” It can be enabled through Google Labs, the company’s public-facing portal to AI experiments. An Audio Overview “can help you get a lay of the land, offering a convenient, hands-free way to absorb information,” Google says, noting that the feature displays search results “right within the audio player” to make it easy to delve further. Google already had AI audio summaries in NotebookLM and Gemini. Like those, Search features AI discussion “hosts.” Continue reading Google Is Testing ‘Hosted’ GenAI Audio Summaries in Search

Snap to Launch Specs Consumer Smart Glasses Line in 2026

Snap Inc. announced it will launch a “lightweight, immersible” consumer line of AR smart glasses called “Specs” in 2006 (breaking from its “Spectacles” branding). Announcing the sixth-generation of its glasses at the Augmented World Expo conference this week, Snap CEO Evan Spiegel said the eyewear offers “an ultra-powerful wearable computer integrated into a lightweight pair of glasses with see-thru lenses.” Spiegel explained the glasses will be untethered, which suggests they may be powered by the new Qualcomm Snapdragon AR1+ Gen 1 chip, also announced at AWE. Coming a decade after Snap’s first attempt at consumer AR glasses, Specs leverage a $3 billion investment in 11 years of R&D. Continue reading Snap to Launch Specs Consumer Smart Glasses Line in 2026

Character.AI Goes Wide with AvatarFX, Adds Mobile Features

Chatbot platform Character.AI is rolling out its video generator, AvatarFX, in general release after a month in closed beta. It’s also adding a sharing feature called Scenes and Streams that will serve content to Character.AI’s community feed, coming soon to mobile. Users can now tap AvatarFX to create up to five videos per day, starting by uploading a photo, choosing a voice and writing dialogue for the character. Character.AI started as 1:1 text chat in the summer of 2023. Now the company is “expanding into a multi-modal world” with “more ways for creators to build immersive narratives and dynamic experiences.” Continue reading Character.AI Goes Wide with AvatarFX, Adds Mobile Features

OpenAI Seeks to Make ChatGPT New Gateway to the Internet

An internal OpenAI document revealed as part of a discovery demand in the U.S. Justice Department’s antitrust case against Google reveals OpenAI’s plan to turn ChatGPT into the public’s primary Internet interface. Although heavily redacted, the document effectively reveals OpenAI’s goal is to have ChatGPT replace Google Chrome and Search with an “AI super assistant that deeply understands you and is your interface to the Internet.” Aside from bolstering Google’s position that it has competition and is not an impregnable monopoly, the information contained therein provides an in-depth look at OpenAI’s roadmap. Continue reading OpenAI Seeks to Make ChatGPT New Gateway to the Internet

DeepSeek’s New Update Heightens Rivalry with U.S. AI Firms

DeepSeek-R1-0528 is here, and this latest iteration is generating almost as much stir as the initial open-source R1 reasoning model did in January. The Chinese startup, owned by quantitative analysis firm High-Flyer Capital, is touted by one media outlet as “near parity in reasoning capabilities with proprietary paid models such as OpenAI’s o3 and Google Gemini 2.5 Pro.” Promised are stronger capabilities in complex reasoning centered on math, science, business and coding, along with improved features for developers and researchers. As with the earlier release, the DeepSeek-R1-0528 is available under the MIT License, which supports commercial use and allows customization. Continue reading DeepSeek’s New Update Heightens Rivalry with U.S. AI Firms

Google Photos Rolling Out Redesign and New AI Editing Tools

Google is celebrating 10 years of Google Photos by introducing a redesign of the Photos editor, including helpful new tools. The Photos editor gets some AI editing features previously available only on Pixel phones as part of its generative AI Magic Editor. The Photos platform is also expanding access to its AI-powered text-to-image Reimagine and automatic framing and related features first introduced with the Pixel 9. The company explains there are currently more than 1.5 billion monthly Photos users that have stored 9+ trillion photos and videos. The updates reflect Google’s AI push as it continues to integrate Gemini across its growing family of products and services. Continue reading Google Photos Rolling Out Redesign and New AI Editing Tools

YouTube Shorts Powers New Visual Search with Google Lens

YouTube is integrating Google Lens, allowing viewers to search elements of what they see while watching YouTube Shorts. The visual search enhancement aims to provide more ways to unearth information and discover content in an interactive, intuitive way. YouTube provides an example involving a Short filmed on location that features landmarks a viewer may be interested in visiting. In this example, users could ask Google Lens for related information to learn the name of the destination and helpful details regarding its culture and history, results that would appear over the video content as a visual overlay. YouTube began rolling out the Lens feature in beta to all viewers last week. Continue reading YouTube Shorts Powers New Visual Search with Google Lens

Anthropic Touts Mobile Voice Mode, Free Search for Claude

Anthropic’s new mobile conversation voice mode for its large language model Claude lets it search Google Docs, Drive, Calendar and more on smartphones. Just a week after debuting two new LLMs — Claude Opus 4 and Sonnet 4 — Anthropic announced the mobile updates for its Claude AI chatbot for iOS and Android and said it is extending web search for all users on free Claude plans. While Claude’s conversational voice interface is currently available only in English and only via mobile, an API for desktop use and browser-based support are part of future plans. Amazon and Google both have investment stakes in San Francisco-based Anthropic. Continue reading Anthropic Touts Mobile Voice Mode, Free Search for Claude

Google Tees-Up Android XR to Take On Meta, Apple Eyewear

Google and Xreal publicly displayed the first eyewear to run Android XR. Developed as Project Aura, the extended reality smart glasses are expected to be available for purchase later this year or in early 2026. Google is also working with Samsung on a headset as part of Project Moohan. Pricing on the Aura glasses wasn’t announced at Google I/O, but they’ll reportedly cost less than the thousand dollar-plus price tags being floated for the Samsung device and Meta’s next-gen smart glasses, code-named Hypernova. At its annual developer conference, Google also showcased glasses being made with Warby Parker and Gentle Monster that won’t feature AR. Continue reading Google Tees-Up Android XR to Take On Meta, Apple Eyewear

Google Upgrades GenAI Models, Debuts AI Storyteller ‘Flow’

Google is in a filmmaking frame of mind. The search giant introduced Veo 3, the latest version of its generative video model, loading it with cinematic capabilities including a new AI storytelling tool called Flow. At the Google I/O conference the company also debuted an upgraded image generator, Imagen 4, and announced expanded access to the AI music tool Lyria 2. Veo 3 can generate videos with audio — a Google first, adding things like background traffic noises, birds singing, “even dialogue between characters.” It offers improved consistency of characters, scenes and objects, while gaining camera controls, outpainting and object add/remove. Continue reading Google Upgrades GenAI Models, Debuts AI Storyteller ‘Flow’

Google DeepMind AlphaEvolve: Model of Algorithm Efficiency

Google DeepMind has introduced AlphaEvolve, a coding agent that takes an evolutionary approach to general-purpose algorithm discovery and model optimization. AlphaEvolve combines the creative problem-solving abilities of Google’s Gemini models with automated evaluators that verify answers, then applies an evolutionary framework that improves on the most promising results. Evolutionary AI refers to techniques inspired by biological evolution, including natural selection, to optimize and design machine learning models. Continue reading Google DeepMind AlphaEvolve: Model of Algorithm Efficiency