OpenAI Rolls Out New Parental Controls to Help Protect Kids

OpenAI has added parental controls for ChatGPT’s Web interface, with mobile controls coming soon. The controls give parents the ability to reduce or remove certain content and dial down personalization by turning off ChatGPT’s transcript memories. At the same time, OpenAI has added the ability to restrict image generation with the launch of Sora parental controls for ChatGPT-connected teen accounts. There are also controls for sending and receiving direct messages through the app. OpenAI says the changes aim “to give families tools to support their teens’ use of AI.” To activate control access, parents must have their own accounts and teens will need to opt in. Continue reading OpenAI Rolls Out New Parental Controls to Help Protect Kids

Apple Creates an AI Chatbot to Help Train the Next-Gen Siri

Apple has reportedly developed a ChatGPT-like iPhone app for internal use in testing its AI overhaul of Siri. Apple’s AI unit is using the app to assess new features for its famous conversational assistant, putting it through its paces for tasks like sifting through personal data, including documents, audio and emails, and performing actions including photo and video editing, according to reports. Called Veritas, Latin for “truth,” the developmental software is garnering attention as a pivotal tool in Siri’s highly anticipated AI makeover, expected to make its public debut next year. Continue reading Apple Creates an AI Chatbot to Help Train the Next-Gen Siri

Google TV Adding Conversational Intelligence with Gemini AI

Google TV is introducing Gemini as a conversational AI assistant to help find content and get more information about a favorite TV show or movie. Gemini on Google TV goes beyond the simple queries and commands of Google Assistant, which has been around since 2017, and allows “free-flowing conversations with your big screen,” the company explains. “Just say ‘Hey Google’ or press the microphone button on your TV remote” to tap into Gemini for TV to activate the new feature. Gemini is now available on the TCL QM9K series, with more TCL models coming onboard later this year. Google says additional functionality for Gemini on TV is coming soon. Continue reading Google TV Adding Conversational Intelligence with Gemini AI

Microsoft’s Copilot Update Includes Vision AI Screen Sharing

The Microsoft Store has an update to Microsoft Copilot that extends the capabilities of Copilot Vision. Rolling out initially to members of the Windows Insider Program, Desktop Share allows Copilot Vision to see a user’s desktop, enabling real-time conversation with the AI app, which will be able to answer questions about what it sees using text or natural language. Copilot Vision “can help analyze content, provide insights, and answer your questions, coaching you through it aloud,” according to Microsoft, offering things like “tips on making improvements to your creative project, help with improving your resume, or guidance while navigating a new game.” Continue reading Microsoft’s Copilot Update Includes Vision AI Screen Sharing

YouTube Adds AI Search Results Carousel for Premium Subs

YouTube is adding an AI-powered search results carousel that serves up video suggestions and topic descriptions. A search for “best beaches in Hawaii,” for example, could generate a carousel listing video clips and information on an assortment of snorkel spots and volcanic beaches. YouTube Premium subscribers in the U.S. can try the feature now on searches related to shopping, travel or location-based activities. The Google-owned platform is also expanding its test with conversational AI to some non-Premium users in the U.S. Premium members have been using it for search, recommendations and as a study aid. Continue reading YouTube Adds AI Search Results Carousel for Premium Subs

ElevenLabs Text-to-Voice AI Tools Now Available for Mobile

ElevenLabs is bringing its powerful AI voice tools to mobile. Previously, the company’s apps and voice libraries were only available via the Web. Now iOS and Android users can tap ElevenLabs tech on the go with a “faster, intuitive, more powerful experience built natively for mobile” rather than awkwardly through a mobile browser. Combining mobility with creativity, the app lets users create realistic voiceovers for social media or narrate video using ElevenLabs’ text-to-speech models — including Eleven v3, now in alpha, which lets users fine-tune vocalizations using tags. The company has also introduced a new voice assistant, 11ai. Continue reading ElevenLabs Text-to-Voice AI Tools Now Available for Mobile

Google Search Live Features Conversational Voice Capability

Google has launched Search Live with voice-input, a two-way conversational query function for exploring online resources. Presently available via the Google app for Android and iOS to U.S. users enrolled in Google Labs’ AI Mode experiment, Search Live is designed to handle complex, multi-part questions. Google suggests the new feature is “perfect for when you’re on the go or multitasking, like if you’re packing for a trip.” The discursive voice feature follows Google’s general rollout of AI Mode, recently launched to compete against products such as OpenAI’s ChatGPT Search and Perplexity AI. Continue reading Google Search Live Features Conversational Voice Capability

CineSearch Is a New AI Discovery Tool for Streaming Content

A little over a year since the beta release of its conversational AI search and discovery tool, Cineverse is making cineSearch commercially available to business customers. The Los Angeles startup says its AI-powered framework “solves” the content-hunt quandary for digital networks and streaming services, finding programming across all streaming platforms. Cineverse is making cineSearch available for commercial licensing to OEMs and streaming platforms via the company’s own sales team and through Google Cloud Marketplace. CineSearch was developed using Google’s AI ecosystem — specifically Vertex AI platform and the Gemini 2.0 Pro model. Continue reading CineSearch Is a New AI Discovery Tool for Streaming Content

Anthropic Touts Mobile Voice Mode, Free Search for Claude

Anthropic’s new mobile conversation voice mode for its large language model Claude lets it search Google Docs, Drive, Calendar and more on smartphones. Just a week after debuting two new LLMs — Claude Opus 4 and Sonnet 4 — Anthropic announced the mobile updates for its Claude AI chatbot for iOS and Android and said it is extending web search for all users on free Claude plans. While Claude’s conversational voice interface is currently available only in English and only via mobile, an API for desktop use and browser-based support are part of future plans. Amazon and Google both have investment stakes in San Francisco-based Anthropic. Continue reading Anthropic Touts Mobile Voice Mode, Free Search for Claude

AWS Updates Nova Reels and Adds Nova Sonic Voice Model

Amazon has updated its Nova model series, with Nova Reel 1.1 now able to generate AI videos of up to two minutes as well as gaining a new ‘multi-shot’ feature. Announced in December, Nova Reel marked Amazon’s initial foray into generative video. AWS developer advocate Elizabeth Fuentes says that Nova Reel accommodates user prompts of up to 4,000 characters that can generate a series of six-second shots for a sequence totaling two minutes. The company also introduced the Nova Sonic real-time voice model that supports third-party enterprise development. Continue reading AWS Updates Nova Reels and Adds Nova Sonic Voice Model

Midjourney Launches V7 Image Generator with Voice Prompts

Generative AI program Midjourney has issued V7 in alpha, marking its first new model in almost a year. Notable updates include personalization turned on by default, which users must first set up — a process Midjourney says takes 5 minutes — and can then toggle on or off at any time. Another new flagship feature, Draft Mode, lets users render lower resolution images at “half the cost and 10 times the speed,” according to Midjourney, emphasizing “it’s so fast that we change the prompt bar to a ‘conversational mode’ when you’re using it on Web.” Draft Mode also supports voice prompts. Continue reading Midjourney Launches V7 Image Generator with Voice Prompts

Alibaba’s Powerful Multimodal Qwen Model Is Built for Mobile

Alibaba Cloud has released Qwen2.5-Omni-7B, a new AI model the company claims is efficient enough to run on edge devices like mobile phones and laptops. Boasting a relatively light 7-billion parameter footprint, Qwen2.5-Omni-7B understands text, images, audio and video and generates real-time responses in text and natural speech. Alibaba says its combination of compact size and multimodal capabilities is “unique,” offering “the perfect foundation for developing agile, cost-effective AI agents that deliver tangible value, especially intelligent voice applications.” One example would be using a phone’s camera to help a vision impaired-person navigate their environment. Continue reading Alibaba’s Powerful Multimodal Qwen Model Is Built for Mobile

OpenAI Pushes Conversational Agents with Three New Models

OpenAI has debuted three new models for transcription and voice generation — gpt-4o-transcribe, gpt-4o-mini-transcribe and gpt-4o-mini-tts. The text-to-speech and speech-to-text AI models are designed to help developers create AI agents with highly customizable voices. OpenAI claims these models will power natural and responsive voice agents, moving AI out of the text-based communications stage and into intuitive spoken conversations. The suite outperforms existing solutions in accuracy and reliability, OpenAI says, especially with “accents, noisy environments, and varying speech speeds,” making them well-suited for customer call centers and meeting notes. Continue reading OpenAI Pushes Conversational Agents with Three New Models

AI Startup Sesame Develops Next Stage of Voice Generation

Sesame, an AI startup from Oculus co-founder Brendan Iribe, has created a conversational voice model that many feel has achieved uncanny levels of authenticity. Drawing comparisons to the charismatic vocal centerpiece of the 2013 Warner Bros. film “Her,” Sesame seems to have achieved a new level of engagement among AI voice assistants. While some are describing the tech as “amazing.” others have expressed concern over its capabilities. “Our goal is to achieve ‘voice presence’ — the magical quality that makes spoken interactions feel real, understood and valued,” explains a blog post by Iribe and others. Continue reading AI Startup Sesame Develops Next Stage of Voice Generation

New Samsung XR Headset Could Use Sony 4K Micro OLEDs

Samsung shook things up at Mobile World Congress 2025 with a display of its Project Moohan XR headset, which CNBC confirms will be released this year. While the MWC display was just a teaser, and Samsung remained tight-lipped about its specs, the mirrored goggles generated a lot of coverage, including speculation that Samsung may deploy Sony’s 4K Micro OLEDs in the new device, increasing Mohan’s resolution over the Apple Vision Pro by nearly 2 million pixels per eye and offering superior color, too. Samsung worked with Qualcomm and Google to develop Moohan, which will use the new Android XR operating system. Continue reading New Samsung XR Headset Could Use Sony 4K Micro OLEDs