Top Stories

Google Bows Gemini Command Line Interface for Developers

In a move to attract more developers to Gemini, Google is releasing an open-source command line interface (CLI) that will be free for most developers. CLIs offer a means to communicate with operating systems, and can be used as alternatives or complementary to an integrated developer environment (IDE). Gemini CLI has agentic capabilities and can code and “so much more,” according to Google, which lists content generation, problem solving, deep research and task management among its uses. Gemini CLI provides “lightweight access to Gemini, giving you the most direct path from your prompt to our model.” Read more

ElevenLabs Text-to-Voice AI Tools Now Available for Mobile

ElevenLabs is bringing its powerful AI voice tools to mobile. Previously, the company’s apps and voice libraries were only available via the Web. Now iOS and Android users can tap ElevenLabs tech on the go with a “faster, intuitive, more powerful experience built natively for mobile” rather than awkwardly through a mobile browser. Combining mobility with creativity, the app lets users create realistic voiceovers for social media or narrate video using ElevenLabs’ text-to-speech models — including Eleven v3, now in alpha, which lets users fine-tune vocalizations using tags. The company has also introduced a new voice assistant, 11ai. Read more

Meta Unveils New AI Advertising Tools as Part of Advantage+

Meta Platforms has announced new generative AI features developed for marketers that produce video advertisements. Announced as part of Meta’s Advantage+ ad suite, marketers can now use up to 20 product stills to create multi-scene video ads with music and text overlay. The company has also added generative AI voices and virtual try-ons to its Advantage+ marketer toolkit. The upgrades were announced at the Cannes Lions International Festival of Creativity, where the topic of Meta using AI not only to create ads but to algorithmically serve them to target audiences was a topic of conversation. Read more

Google Gemini Robotics On-Device Controls Robots Locally

Google DeepMind has released a new vision-language-action (VLA) model, Gemini Robotics On-Device, that can operate robots locally, controlling their movements without requiring an Internet connection or the cloud. Google says the software provides “general-purpose dexterity and fast task adaptation,” building on the March release of the first Gemini Robotics VLA model, which brought “Gemini 2.0’s multimodal reasoning and real-world understanding into the physical world.” Since the model operates independent of a data network, it’s useful for latency sensitive applications as well as low or no connectivity environments. Google is also releasing a Gemini Robotics SDK for developers. Read more

Google Search Live Features Conversational Voice Capability

Google has launched Search Live with voice-input, a two-way conversational query function for exploring online resources. Presently available via the Google app for Android and iOS to U.S. users enrolled in Google Labs’ AI Mode experiment, Search Live is designed to handle complex, multi-part questions. Google suggests the new feature is “perfect for when you’re on the go or multitasking, like if you’re packing for a trip.” The discursive voice feature follows Google’s general rollout of AI Mode, recently launched to compete against products such as OpenAI’s ChatGPT Search and Perplexity AI. Read more

Also Noted