By
Paula ParisiSeptember 2, 2025
OpenAI and Anthropic — rivals in the AI space who guard their proprietary systems — joined forces for a misalignment evaluation, safety testing each other’s models to identify when and how they fall short of human values. Among the findings: reasoning models including Anthropic’s Claude Opus 4 and Sonnet 4, and OpenAI’s o3 and o4-mini resist jailbreaks, while conversational models like GPT-4.1 were susceptible to prompts or techniques intended to bypass safety protocols. Although the test results were unveiled as users complain chatbots have become overly sycophantic, the tests were “primarily interested in understanding model propensities for harmful action,” per OpenAI. Continue reading Anthropic and OpenAI Report Findings of Joint AI Safety Tests
By
Paula ParisiAugust 21, 2025
This week, DeepSeek-V3.1 dropped on Hugging Face. Media outlets immediately began citing benchmark scores that rival proprietary systems from OpenAI and Anthropic for a system that is available via a permissive license, facilitating wide access. The 685-billion parameter Mixture-of-Experts (MoE) model has 37 billion active parameters and is designed for efficiency. It builds on DeepSeek-pioneered processes like multi-head latent attention (MLA) and multi-token prediction (MTP) to optimize inference, enabling high-performance computing on both enterprise servers loaded with H100 GPUs and consumer hardware like a Mac Studio or comparably powered PC. Continue reading DeepSeek-V3.1 Offered with Improvements in Speed, Context
By
Paula ParisiAugust 19, 2025
Claude Opus 4 and 4.1 now have the discrete ability to end “abusive” or “harmful” conversations in consumer chat interfaces. Anthropic says the feature was developed as part of its exploratory work on the protection and well-being of its AI models. The company also envisions broader safety uses, although it does point out that having a model defensively terminate a chat is an extreme measure, intended for use in rare cases. “We’re working to identify and implement low-cost interventions to mitigate risks to model welfare,” Anthropic explains, qualifying it is unsure “such welfare is possible.” Continue reading New Anthropic Safety Updates Focus on Claude’s Well-Being
By
Paula ParisiAugust 13, 2025
Elon Musk’s xAI has made Grok 4 available on its free tiers as it seeks to take advantage of initial user dissatisfaction with OpenAI’s new GPT-5. The company has positioned Grok as freewheeling and uncensored, a contrast to GPT-5, which has been criticized on Reddit and other social platforms as a “corporate beige zombie” with too many guardrails. After its February debut, Grok 3 was reined-in with checks including removal of its native image generator in March. Grok 4 was released in July with integrated image and video features as well as a “Spicy” mode for creating risqué content. Continue reading Grok 4 Offered Free in xAI Move on ChatGPT-5 Market Share
By
Paula ParisiAugust 11, 2025
Anthropic has released Claude Opus 4.1, an upgrade to Opus 4 that reportedly improves on agentic tasks, computer coding and reasoning. Pricing has not increased from what customers were paying for Opus 4, and the company promises “substantially larger improvements to our models in the coming weeks.” The move comes as Anthropic nears a new funding round targeting $3 to $5 billion, which could place a valuation of up to $170 billion on the startup. Recurring revenue hit $5 billion as of late July, which could increase to $9 billion by the end of the year. Claude Opus 4.1 was released two days before OpenAI unleashed GPT-5, and performs comparably in coding benchmarks. Continue reading Anthropic Seeks to Raise $5 Billion, Debuts Claude Opus 4.1
By
Paula ParisiAugust 7, 2025
OpenAI is releasing two lower-cost, open-weight reasoning models in an effort to be more competitive with Meta, Mistral and DeepSeek and they will be the first OpenAI models available from Amazon. The new offerings — gpt-oss-120b and gpt-oss-20b — will be among the model choices on AWS’s Bedrock and SageMaker AI services. Both models are said to be well-suited for agentic use. The gpt-oss-120b model performs comparably to OpenAI o4-mini on core reasoning and can run on a single 80GB GPU. The gpt-oss-20b model is compared to OpenAI o3‑mini and can run on edge devices with just 16GB of memory. Continue reading Open-Weight Models Are a First from OpenAI in AWS Catalog
By
Paula ParisiJuly 28, 2025
Alibaba’s Qwen team has launched Qwen3-Coder, which it calls its “most agentic code model to date.” While it will be made available in multiple sizes, the most powerful variant — Qwen3-Coder-480B-A35B-Instruct — is being released first. The 480 billion parameter mixture-of-experts model has 35 billion active parameters supporting a context length of 256,000 tokens natively and 1 million tokens with extrapolation methods for “exceptional performance in both coding and agentic tasks,” explains the group, which claims the quasi-open source model has agentic coding, agentic browser use, and agentic tool use comparable to Anthropic’s proprietary Claude Sonnet 4. Continue reading Alibaba Is Rolling Out Its ‘Most Agentic Code Model to Date’
By
Paula ParisiJuly 17, 2025
Amazon Web Services has created the AI Agents and Tools virtual store within the AWS Marketplace, which has added Anthropic’s Claude for Enterprise as a full-featured software-as-a-service solution. In addition to deepening ties between Anthropic and investor Amazon, the move marks a significant expansion of accessibility to a powerful LLM previously available only through Anthropic or via API through Amazon’s Bedrock developer platform. As part of its agentic push, AWS unveiled AgentCore, a developer tool that helps enterprise developers create, deploy and implement AI agents using any framework or model, and announced a $100 million investment in agent development. Continue reading AWS on Agent Push, Adds Claude Enterprise in Marketplace
By
Paula ParisiJuly 7, 2025
China’s Baidu has added AI and voice features to its Ernie search engine and as of June 30 officially made its generative Ernie large language model open source in a direct challenge to OpenAI and Anthropic as well as local Chinese rival DeepSeek. “This isn’t just a China story. Every time a major lab open-sources a powerful model, it raises the bar for the entire industry,” University of Southern California Associate Professor of Computer Science Sean Ren told CNBC. Baidu is also giving the Ernie mobile app more chatbot-like functionality, enabling it to help with drawing, writing and travel tasks. Continue reading Baidu Delivers AI Updates to Search, Open-Sources Ernie 4.5
By
Paula ParisiJune 27, 2025
Anthropic has updated its Claude AI chatbot with the ability to build, host and share AI-powered apps directly in Claude. Launching in beta, the new function builds upon the Artifacts feature Anthropic introduced last year, allowing users to see and interact with what they asked Claude to create. “Now developers can iterate faster on their AI apps without worrying about the complexity and cost of scaling for a growing audience,” according to Anthropic. The San Francisco-based AI startup adds that millions people have already used Claude to create more than 500 million artifacts — from productivity tools to educational games. Continue reading Anthropic’s Claude Chatbot Is Now a No-Code App Developer
By
Paula ParisiJune 27, 2025
In a move to attract more developers to Gemini, Google is releasing an open-source command line interface (CLI) that will be free for most developers. CLIs offer a means to communicate with operating systems, and can be used as alternatives or complementary to an integrated developer environment (IDE). Gemini CLI has agentic capabilities and can code and “so much more,” according to Google, which lists content generation, problem solving, deep research and task management among its uses. Gemini CLI provides “lightweight access to Gemini, giving you the most direct path from your prompt to our model.” Continue reading Google Bows Gemini Command Line Interface for Developers
By
Paula ParisiJune 11, 2025
Determined to rebound from the Llama 4 Behemoth delay, Meta Platforms is swinging for the fences in its artificial intelligence efforts, with CEO Mark Zuckerberg personally supervising creation of a new AI lab focused on superintelligence. Meta has reportedly reached a deal to pay up to $15 billion for a stake in a startup called Scale AI that would bring its 28-year-old co-founder and CEO Alexandr Wang in-house to lead the effort. Other leading Scale AI staff would also come aboard, and Meta is said to be recruiting researchers from OpenAI and Google by dangling “seven- to nine-figure compensation packages,” reports indicate. Continue reading Meta Said to Be Planning Major Push into AI Superintelligence
By
Paula ParisiMay 30, 2025
Anthropic’s new mobile conversation voice mode for its large language model Claude lets it search Google Docs, Drive, Calendar and more on smartphones. Just a week after debuting two new LLMs — Claude Opus 4 and Sonnet 4 — Anthropic announced the mobile updates for its Claude AI chatbot for iOS and Android and said it is extending web search for all users on free Claude plans. While Claude’s conversational voice interface is currently available only in English and only via mobile, an API for desktop use and browser-based support are part of future plans. Amazon and Google both have investment stakes in San Francisco-based Anthropic. Continue reading Anthropic Touts Mobile Voice Mode, Free Search for Claude
By
Paula ParisiMay 20, 2025
OpenAI is releasing its Codex agentic coding tool in research preview. Codex lets developers delegate simple, routine programming tasks to software engineering agents that can generate production-ready code, documenting the work as they go. Codex can work on many tasks in parallel doing things like writing software features, answering questions about a codebase, fixing bugs, and proposing pull requests for review. According to OpenAI, “each task runs in its own cloud sandbox environment,” preloaded within the user’s repository. OpenAI began releasing Codex last week to ChatGPT Pro, Enterprise, and Team users, with support for Plus and Edu coming soon. Continue reading OpenAI Adds Codex Software Agent to Some ChatGPT Plans
By
Paula ParisiMay 15, 2025
Google has formally announced the launch of its AI Futures Fund to identify and invest in artificial intelligence startups. Participating startups will get early access to Google DeepMind’s latest models — Gemini, Imagen and Veo, “hands-on support from Google researchers” and Google Cloud credits. The AI Futures Fund has been ramping up these past several months, working with startups including Fal, Replit and Synthesia, among others. In identifying up-and-coming AI startups, Google is competing against Amazon and Microsoft who have aggressively pursued nascent firms, most notably Anthropic and OpenAI, respectively. Continue reading Google Launches AI Futures Fund to Finance, Foster Startups