By
Paula ParisiJuly 7, 2025
China’s Baidu has added AI and voice features to its Ernie search engine and as of June 30 officially made its generative Ernie large language model open source in a direct challenge to OpenAI and Anthropic as well as local Chinese rival DeepSeek. “This isn’t just a China story. Every time a major lab open-sources a powerful model, it raises the bar for the entire industry,” University of Southern California Associate Professor of Computer Science Sean Ren told CNBC. Baidu is also giving the Ernie mobile app more chatbot-like functionality, enabling it to help with drawing, writing and travel tasks. Continue reading Baidu Delivers AI Updates to Search, Open-Sources Ernie 4.5
By
Paula ParisiJuly 2, 2025
Chinese e-commerce giant Alibaba has released a new multimodal model called Qwen VLo that can understand and generate images. Available for free in preview through Qwen Chat, it can use image or text prompts to generate pictures, and accepts text in multiple languages, including Chinese and English. It can also edit, change backgrounds and switch styles, handling multiple image edits in sequence. An upgrade over January’s Qwen 2.5-VL release, Qwen VLo uses progressive generation, allowing users to see the image creation in progress, and Alibaba says it’s particularly good at making inline adjustments to fine-tune images. Continue reading Alibaba’s Qwen VLo Generative AI Shows Images in Progress
By
Paula ParisiJune 27, 2025
Anthropic has updated its Claude AI chatbot with the ability to build, host and share AI-powered apps directly in Claude. Launching in beta, the new function builds upon the Artifacts feature Anthropic introduced last year, allowing users to see and interact with what they asked Claude to create. “Now developers can iterate faster on their AI apps without worrying about the complexity and cost of scaling for a growing audience,” according to Anthropic. The San Francisco-based AI startup adds that millions people have already used Claude to create more than 500 million artifacts — from productivity tools to educational games. Continue reading Anthropic’s Claude Chatbot Is Now a No-Code App Developer
By
Paula ParisiJune 27, 2025
In a move to attract more developers to Gemini, Google is releasing an open-source command line interface (CLI) that will be free for most developers. CLIs offer a means to communicate with operating systems, and can be used as alternatives or complementary to an integrated developer environment (IDE). Gemini CLI has agentic capabilities and can code and “so much more,” according to Google, which lists content generation, problem solving, deep research and task management among its uses. Gemini CLI provides “lightweight access to Gemini, giving you the most direct path from your prompt to our model.” Continue reading Google Bows Gemini Command Line Interface for Developers
By
Paula ParisiJune 25, 2025
Google has launched Search Live with voice-input, a two-way conversational query function for exploring online resources. Presently available via the Google app for Android and iOS to U.S. users enrolled in Google Labs’ AI Mode experiment, Search Live is designed to handle complex, multi-part questions. Google suggests the new feature is “perfect for when you’re on the go or multitasking, like if you’re packing for a trip.” The discursive voice feature follows Google’s general rollout of AI Mode, recently launched to compete against products such as OpenAI’s ChatGPT Search and Perplexity AI. Continue reading Google Search Live Features Conversational Voice Capability
By
Paula ParisiJune 24, 2025
The redesigned Firefly AI app Adobe released in April with third-party model support is now available on iOS and Android. Text-to-video and background editing are among the features included in the new mobile package, which Adobe claims will help users capture inspiration as it strikes with “the freedom to generate images and videos wherever you are.” Adobe says those of all skill levels will be able to use the app, which was designed “to complement the ways we already interact with our phones.” The company is also rolling out its AI-powered online moodboard creator — Firefly Boards — in public beta, now with video functionality. Continue reading Adobe Unveils Firefly Generative AI App for iOS and Android
By
Paula ParisiJune 17, 2025
Startup Zencoder (formerly For Good AI) has launched a cloud-based AI-powered E2E testing agent that simplifies the pipeline from initial code to production-ready applications. Now in public beta, Zentester tackles “verification,” which Zencoder founder and CEO Andrew Filev calls “the missing link” in scaling AI-created code from concept to market-ready app. That complicated process is often delayed by a bottleneck in final testing. Zentester is designed to take that late-stage verification process “from days to hours,” Filev says. Zentester has the typical agent superpowers — seeing and interacting as users do by clicking buttons, filling in forms and navigating workflows. Continue reading Zencoder Testing Agent Shaves Weeks Off App Development
By
Paula ParisiJune 13, 2025
Snap Inc. announced it will launch a “lightweight, immersible” consumer line of AR smart glasses called “Specs” in 2006 (breaking from its “Spectacles” branding). Announcing the sixth-generation of its glasses at the Augmented World Expo conference this week, Snap CEO Evan Spiegel said the eyewear offers “an ultra-powerful wearable computer integrated into a lightweight pair of glasses with see-thru lenses.” Spiegel explained the glasses will be untethered, which suggests they may be powered by the new Qualcomm Snapdragon AR1+ Gen 1 chip, also announced at AWE. Coming a decade after Snap’s first attempt at consumer AR glasses, Specs leverage a $3 billion investment in 11 years of R&D. Continue reading Snap to Launch Specs Consumer Smart Glasses Line in 2026
By
Paula ParisiJune 11, 2025
Determined to rebound from the Llama 4 Behemoth delay, Meta Platforms is swinging for the fences in its artificial intelligence efforts, with CEO Mark Zuckerberg personally supervising creation of a new AI lab focused on superintelligence. Meta has reportedly reached a deal to pay up to $15 billion for a stake in a startup called Scale AI that would bring its 28-year-old co-founder and CEO Alexandr Wang in-house to lead the effort. Other leading Scale AI staff would also come aboard, and Meta is said to be recruiting researchers from OpenAI and Google by dangling “seven- to nine-figure compensation packages,” reports indicate. Continue reading Meta Said to Be Planning Major Push into AI Superintelligence
By
Paula ParisiJune 9, 2025
China’s Manus AI has unveiled a text-to-video generator it says can transform “prompts into complete stories — structured, sequenced, and ready to watch. With a single prompt, Manus plans each scene, crafts the visuals, and animates your vision,” the company announced last week. Manus generated buzz in March for its agentic approach to AI, and now it is putting that autonomous technology to work on generative AI, promising story generation within minutes. Last month, the firm that developed Manus, Butterfly Effect, reportedly secured $75 million in funding led by U.S.-based Benchmark for a nearly $500 million valuation. Continue reading Manus AI Takes an Agentic Approach with Its Video Generator
By
Paula ParisiJune 5, 2025
Microsoft’s Bing mobile search app for Android and iOS is adding a free Bing Video Creator feature powered by OpenAI’s Sora. Text-to-video model Sora was previously available only by subscription through ChatGPT Plus for $20 per month or the $200 monthly ChatGPT Pro. The integration will enable creation of five-second vertical video clips on mobile, with horizontal capability coming soon. Although a major investor in OpenAI, Microsoft has made headway with its own signature AI offerings, including Bing Image Creator and the Copilot AI assistant. The Bing Mobile app is available worldwide via the App Store for iOS and the Play Store for Android. Continue reading Microsoft Debuts Mobile Bing Video Creator Powered by Sora
By
Paula ParisiJune 3, 2025
An internal OpenAI document revealed as part of a discovery demand in the U.S. Justice Department’s antitrust case against Google reveals OpenAI’s plan to turn ChatGPT into the public’s primary Internet interface. Although heavily redacted, the document effectively reveals OpenAI’s goal is to have ChatGPT replace Google Chrome and Search with an “AI super assistant that deeply understands you and is your interface to the Internet.” Aside from bolstering Google’s position that it has competition and is not an impregnable monopoly, the information contained therein provides an in-depth look at OpenAI’s roadmap. Continue reading OpenAI Seeks to Make ChatGPT New Gateway to the Internet
By
Paula ParisiJune 3, 2025
DeepSeek-R1-0528 is here, and this latest iteration is generating almost as much stir as the initial open-source R1 reasoning model did in January. The Chinese startup, owned by quantitative analysis firm High-Flyer Capital, is touted by one media outlet as “near parity in reasoning capabilities with proprietary paid models such as OpenAI’s o3 and Google Gemini 2.5 Pro.” Promised are stronger capabilities in complex reasoning centered on math, science, business and coding, along with improved features for developers and researchers. As with the earlier release, the DeepSeek-R1-0528 is available under the MIT License, which supports commercial use and allows customization. Continue reading DeepSeek’s New Update Heightens Rivalry with U.S. AI Firms
By
Paula ParisiMay 29, 2025
Meta Platforms has restructured its artificial intelligence operations, creating two separate units — an AI Products team headed by Connor Hayes and the AGI Foundations unit jointly run by Ahmad Al-Dahle and Amir Frenkel. The move comes as Meta is delayed getting its latest foundation model Llama 4 Behemoth to market and has also seen several key AI engineers depart. The idea is that dividing a single large organization into two smaller divisions will speed product development and add flexibility as Meta faces tough competition from top companies such as Google and OpenAI, as well as social companies X and TikTok. Continue reading Meta Restructures AI Ops in Wake of Llama 4 Behemoth Delay
By
Paula ParisiMay 28, 2025
OpenAI has upgraded its autonomous web browsing agent Operator to the new reasoning model OpenAI o3 from the prior GPT-4o multimodal LLM engine. The update is being released globally in research preview this month for those who subscribe to OpenAI’s ChatGPT Pro for $200 per month. Operator serves OpenAI’s “computer-using agent” (CUA), a model trained to interact with graphical interfaces that uses the Web to perform tasks for people. “Using its own browser, it can look at a webpage, and interact with it much like a human would by typing, clicking, scrolling and more,” OpenAI explains. Continue reading New Reasoning Model Improves Smarts of OpenAI Operator