By
Paula ParisiAugust 11, 2025
OpenAI is rolling a new foundation model, GPT-5, via API for developers and enterprise users in three branded sizes — gpt-5, gpt-5-mini and gpt-5-nano — “to give developers more flexibility to trade off performance, cost, and latency.” The company said Thursday that it is also making GPT‑5 available to all ChatGPT Plus, Pro, Team and Free tier users. Enterprise and Education tier users are promised access this week. While GPT‑5 in the API platform is the reasoning model that powers maximum performance in ChatGPT, “GPT‑5 in ChatGPT is a system of reasoning, non-reasoning, and router models,” OpenAI explains. Continue reading OpenAI Announces Launch of GPT-5 Model Across All Tiers
By
Paula ParisiAugust 5, 2025
Google has upgraded Gemini 2.5 with a feature called Deep Think that “could be a powerful tool in creative problem solving,” according to the company. Deep Think uses parallel thinking techniques, letting Gemini “generate many ideas at once and consider them simultaneously, even revising or combining different ideas over time, before arriving at the best answer.” Its ability to reason through highly complex problems makes Deep Think mode a powerful tool for researchers, Google says, adding that it also excels at coding. Deep Think is available in the Gemini app to subscribers of Google AI Ultra, priced at $250 per month. Continue reading Google Intros Deep Think Reasoning Model for AI Ultra Subs
By
Paula ParisiJune 27, 2025
Anthropic has updated its Claude AI chatbot with the ability to build, host and share AI-powered apps directly in Claude. Launching in beta, the new function builds upon the Artifacts feature Anthropic introduced last year, allowing users to see and interact with what they asked Claude to create. “Now developers can iterate faster on their AI apps without worrying about the complexity and cost of scaling for a growing audience,” according to Anthropic. The San Francisco-based AI startup adds that millions people have already used Claude to create more than 500 million artifacts — from productivity tools to educational games. Continue reading Anthropic’s Claude Chatbot Is Now a No-Code App Developer
By
Paula ParisiJune 26, 2025
ElevenLabs is bringing its powerful AI voice tools to mobile. Previously, the company’s apps and voice libraries were only available via the Web. Now iOS and Android users can tap ElevenLabs tech on the go with a “faster, intuitive, more powerful experience built natively for mobile” rather than awkwardly through a mobile browser. Combining mobility with creativity, the app lets users create realistic voiceovers for social media or narrate video using ElevenLabs’ text-to-speech models — including Eleven v3, now in alpha, which lets users fine-tune vocalizations using tags. The company has also introduced a new voice assistant, 11ai. Continue reading ElevenLabs Text-to-Voice AI Tools Now Available for Mobile
By
Paula ParisiJune 3, 2025
DeepSeek-R1-0528 is here, and this latest iteration is generating almost as much stir as the initial open-source R1 reasoning model did in January. The Chinese startup, owned by quantitative analysis firm High-Flyer Capital, is touted by one media outlet as “near parity in reasoning capabilities with proprietary paid models such as OpenAI’s o3 and Google Gemini 2.5 Pro.” Promised are stronger capabilities in complex reasoning centered on math, science, business and coding, along with improved features for developers and researchers. As with the earlier release, the DeepSeek-R1-0528 is available under the MIT License, which supports commercial use and allows customization. Continue reading DeepSeek’s New Update Heightens Rivalry with U.S. AI Firms
By
Paula ParisiMay 30, 2025
Anthropic’s new mobile conversation voice mode for its large language model Claude lets it search Google Docs, Drive, Calendar and more on smartphones. Just a week after debuting two new LLMs — Claude Opus 4 and Sonnet 4 — Anthropic announced the mobile updates for its Claude AI chatbot for iOS and Android and said it is extending web search for all users on free Claude plans. While Claude’s conversational voice interface is currently available only in English and only via mobile, an API for desktop use and browser-based support are part of future plans. Amazon and Google both have investment stakes in San Francisco-based Anthropic. Continue reading Anthropic Touts Mobile Voice Mode, Free Search for Claude
By
Paula ParisiMay 28, 2025
OpenAI has upgraded its autonomous web browsing agent Operator to the new reasoning model OpenAI o3 from the prior GPT-4o multimodal LLM engine. The update is being released globally in research preview this month for those who subscribe to OpenAI’s ChatGPT Pro for $200 per month. Operator serves OpenAI’s “computer-using agent” (CUA), a model trained to interact with graphical interfaces that uses the Web to perform tasks for people. “Using its own browser, it can look at a webpage, and interact with it much like a human would by typing, clicking, scrolling and more,” OpenAI explains. Continue reading New Reasoning Model Improves Smarts of OpenAI Operator
By
Paula ParisiApril 28, 2025
Nvidia has released NeMo microservices into general availability with version 25.4, pivoting its profile from a modular toolkit for creating custom generative AI models to emphasizing it as a platform for building AI agents at scale. As AI agents have become an in-demand commodity, Nvidia is leveraging the fact that NeMo’s capabilities seem purpose built to help them grow and thrive. Built around the Kubernetes open-source container management system, NeMo microservices are offered as “an end-to-end developer platform for creating state-of-the-art agentic AI systems,” according to Nvidia. Continue reading Nvidia Positions Its NeMo Microservices for AI Agent-Building
By
Paula ParisiApril 18, 2025
Agentic AI company Moveworks has opened an AI Agent Marketplace that launches with more than 100 pre-built agents, enabling users to discover, install, and deploy AI assistants that automate business processes. Agentic AI is booming, as businesses seek to offload tasks from human workers to software. To support that, new companies and existing ones have started providing pre-built agents that are more convenient than building them from scratch. “What once took weeks to build can now be installed and deployed in mere minutes,” Moveworks says, touting its library offerings. Continue reading Moveworks Joins Competition in Offering Enterprise AI Agents
By
Paula ParisiApril 15, 2025
A new open-source code reasoning model called DeepCoder-14B-Preview has hit the market. Built atop DeepSeek-R1 and Qwen2.5 using reinforcement learning (RL), it aims to provide more flexibility by combining high-performance code generation with reasoning capabilities for real-world applications. Its performance is said to be comparable to OpenAI’s o3-mini, “but with a smaller footprint,” say its developers, the research-driven AI companies Together AI and Agentica. “We democratize the recipe for training a small model into a strong competitive coder,” explains Together AI. Continue reading Researchers Debut Preview of DeepCoder Reasoning Model
By
Paula ParisiApril 14, 2025
Google has debuted a new accelerator chip, Ironwood, a tensor processing unit designed specifically for inference — the ability of AI to predict things. Ironwood will power Google Cloud’s AI Hypercomputer, which runs the company’s Gemini models and is gearing up for the next generation of artificial intelligence workloads. Google’s TPUs are similar to the accelerator GPUs sold by Nvidia, but unlike the GPUs they’re designed for AI and geared toward speeding neural network tasks and mathematical operations. Google says when deployed at scale Ironwood is more than 24 times more powerful than the world’s fastest supercomputer. Continue reading Google Ironwood TPU is Made for Inference and ‘Thinking’ AI
By
Paula ParisiApril 10, 2025
San Francisco-based AI startup Deep Cogito has released five AI models in preview, making them available under an open-source license agreement. The models come in sizes 3B, 8B, 14B, 32B and 70B, with plans to release 109B, 400B and 671B versions in the weeks and months ahead. As for the current models, “each outperforms the best available open models of the same size, including counterparts from Meta, DeepSeek and Alibaba, across most standard benchmarks,” Deep Cogito claims, noting that the 70B model in particular “outperforms the newly released Llama 4 109B MoE model.” Continue reading Deep Cogito Is Out of Stealth with Hybrid Reasoning Models
By
Paula ParisiApril 10, 2025
Amazon has updated its Nova model series, with Nova Reel 1.1 now able to generate AI videos of up to two minutes as well as gaining a new ‘multi-shot’ feature. Announced in December, Nova Reel marked Amazon’s initial foray into generative video. AWS developer advocate Elizabeth Fuentes says that Nova Reel accommodates user prompts of up to 4,000 characters that can generate a series of six-second shots for a sequence totaling two minutes. The company also introduced the Nova Sonic real-time voice model that supports third-party enterprise development. Continue reading AWS Updates Nova Reels and Adds Nova Sonic Voice Model
By
Paula ParisiMarch 24, 2025
OpenAI has debuted three new models for transcription and voice generation — gpt-4o-transcribe, gpt-4o-mini-transcribe and gpt-4o-mini-tts. The text-to-speech and speech-to-text AI models are designed to help developers create AI agents with highly customizable voices. OpenAI claims these models will power natural and responsive voice agents, moving AI out of the text-based communications stage and into intuitive spoken conversations. The suite outperforms existing solutions in accuracy and reliability, OpenAI says, especially with “accents, noisy environments, and varying speech speeds,” making them well-suited for customer call centers and meeting notes. Continue reading OpenAI Pushes Conversational Agents with Three New Models
By
Paula ParisiMarch 21, 2025
A new Discord Social SDK allows developers to integrate the platform in-app for games. Discord is massively popular with gamers; the company estimates PC players alone spend more than 1.5 billion hours each month on the platform. This free SDK can extend the user experience beyond the third-party content in which it becomes embedded to reach the platform’s community of over 200 million monthly active users. “Developers can power friends lists, cross-platform messaging, voice and more for all players — with or without a Discord account,” the company announced. Continue reading New Discord Social SDK Integrates Platform In-App for Games