Anthropic Claude Sonnet 4.5 Good at Coding and Collegiality

Anthropic has released its latest frontier AI model, Claude Sonnet 4.5, built for high-performance coding and building complex agents. Anthropic claims it is “the best coding model in the world” and “the best model at using computers,” with substantial gains in reasoning and math. Claude Sonnet 4.5 is distinctive in that it can build “production-ready” applications and is not limited to prototypes, Anthropic says, describing it as a major gain over previous models. The new model also excels in research as well as enterprise basics like cybersecurity and finance, per Anthropic. Continue reading Anthropic Claude Sonnet 4.5 Good at Coding and Collegiality

California Enacts an AI Law Focused on Frontier Model Safety

California has become the first state in the nation to enact an AI safety law. The Transparency in Frontier Artificial Intelligence Act requires major AI firms to regularly report safety information, keeping government apprised of the guardrails imposed when building models as well as ongoing risks presented. California is home to leading AI companies including OpenAI, Alphabet, Anthropic, Meta Platforms, Nvidia and xAI, which means the law will be something of a national standard, as the rules imposed on those companies will have follow-through effects in all states. The law also bolsters whistle-blower protections for employees of the affected firms. Continue reading California Enacts an AI Law Focused on Frontier Model Safety

Google Adds Gemini AI Assistant to Chrome Browser in U.S.

Google is rolling out “Gemini in Chrome” to U.S. Mac and Windows desktop users. Business users will get it in the weeks to come, as will Android and iOS mobile devices. The immediate change integrates “Google AI into Chrome across multiple levels so it can better anticipate your needs, help you understand more complex information, and make you more productive when you browse the web.” There are a number of safety features that leverage AI to combat scams and handle things like automatic password resets. And Gemini in Chrome will soon be able to recall websites previously visited without requiring you to scroll through your browsing history. An agentic browsing assistant is also in the works. Continue reading Google Adds Gemini AI Assistant to Chrome Browser in U.S.

Anthropic and OpenAI Report Findings of Joint AI Safety Tests

OpenAI and Anthropic — rivals in the AI space who guard their proprietary systems — joined forces for a misalignment evaluation, safety testing each other’s models to identify when and how they fall short of human values. Among the findings: reasoning models including Anthropic’s Claude Opus 4 and Sonnet 4, and OpenAI’s o3 and o4-mini resist jailbreaks, while conversational models like GPT-4.1 were susceptible to prompts or techniques intended to bypass safety protocols. Although the test results were unveiled as users complain chatbots have become overly sycophantic, the tests were “primarily interested in understanding model propensities for harmful action,” per OpenAI. Continue reading Anthropic and OpenAI Report Findings of Joint AI Safety Tests

DeepSeek-V3.1 Offered with Improvements in Speed, Context

This week, DeepSeek-V3.1 dropped on Hugging Face. Media outlets immediately began citing benchmark scores that rival proprietary systems from OpenAI and Anthropic for a system that is available via a permissive license, facilitating wide access. The 685-billion parameter Mixture-of-Experts (MoE) model has 37 billion active parameters and is designed for efficiency. It builds on DeepSeek-pioneered processes like multi-head latent attention (MLA) and multi-token prediction (MTP) to optimize inference, enabling high-performance computing on both enterprise servers loaded with H100 GPUs and consumer hardware like a Mac Studio or comparably powered PC. Continue reading DeepSeek-V3.1 Offered with Improvements in Speed, Context

New Anthropic Safety Updates Focus on Claude’s Well-Being

Claude Opus 4 and 4.1 now have the discrete ability to end “abusive” or “harmful” conversations in consumer chat interfaces. Anthropic says the feature was developed as part of its exploratory work on the protection and well-being of its AI models. The company also envisions broader safety uses, although it does point out that having a model defensively terminate a chat is an extreme measure, intended for use in rare cases. “We’re working to identify and implement low-cost interventions to mitigate risks to model welfare,” Anthropic explains, qualifying it is unsure “such welfare is possible.” Continue reading New Anthropic Safety Updates Focus on Claude’s Well-Being

Grok 4 Offered Free in xAI Move on ChatGPT-5 Market Share

Elon Musk’s xAI has made Grok 4 available on its free tiers as it seeks to take advantage of initial user dissatisfaction with OpenAI’s new GPT-5. The company has positioned Grok as freewheeling and uncensored, a contrast to GPT-5, which has been criticized on Reddit and other social platforms as a “corporate beige zombie” with too many guardrails. After its February debut, Grok 3 was reined-in with checks including removal of its native image generator in March. Grok 4 was released in July with integrated image and video features as well as a “Spicy” mode for creating risqué content. Continue reading Grok 4 Offered Free in xAI Move on ChatGPT-5 Market Share

Anthropic Seeks to Raise $5 Billion, Debuts Claude Opus 4.1

Anthropic has released Claude Opus 4.1, an upgrade to Opus 4 that reportedly improves on agentic tasks, computer coding and reasoning. Pricing has not increased from what customers were paying for Opus 4, and the company promises “substantially larger improvements to our models in the coming weeks.” The move comes as Anthropic nears a new funding round targeting $3 to $5 billion, which could place a valuation of up to $170 billion on the startup. Recurring revenue hit $5 billion as of late July, which could increase to $9 billion by the end of the year. Claude Opus 4.1 was released two days before OpenAI unleashed GPT-5, and performs comparably in coding benchmarks. Continue reading Anthropic Seeks to Raise $5 Billion, Debuts Claude Opus 4.1

Open-Weight Models Are a First from OpenAI in AWS Catalog

OpenAI is releasing two lower-cost, open-weight reasoning models in an effort to be more competitive with Meta, Mistral and DeepSeek and they will be the first OpenAI models available from Amazon. The new offerings — gpt-oss-120b and gpt-oss-20b — will be among the model choices on AWS’s Bedrock and SageMaker AI services. Both models are said to be well-suited for agentic use. The gpt-oss-120b model performs comparably to OpenAI o4-mini on core reasoning and can run on a single 80GB GPU. The gpt-oss-20b model is compared to OpenAI o3‑mini and can run on edge devices with just 16GB of memory. Continue reading Open-Weight Models Are a First from OpenAI in AWS Catalog

Alibaba Is Rolling Out Its ‘Most Agentic Code Model to Date’

Alibaba’s Qwen team has launched Qwen3-Coder, which it calls its “most agentic code model to date.” While it will be made available in multiple sizes, the most powerful variant — Qwen3-Coder-480B-A35B-Instruct — is being released first. The 480 billion parameter mixture-of-experts model has 35 billion active parameters supporting a context length of 256,000 tokens natively and 1 million tokens with extrapolation methods for “exceptional performance in both coding and agentic tasks,” explains the group, which claims the quasi-open source model has agentic coding, agentic browser use, and agentic tool use comparable to Anthropic’s proprietary Claude Sonnet 4. Continue reading Alibaba Is Rolling Out Its ‘Most Agentic Code Model to Date’

AWS on Agent Push, Adds Claude Enterprise in Marketplace

Amazon Web Services has created the AI Agents and Tools virtual store within the AWS Marketplace, which has added Anthropic’s Claude for Enterprise as a full-featured software-as-a-service solution. In addition to deepening ties between Anthropic and investor Amazon, the move marks a significant expansion of accessibility to a powerful LLM previously available only through Anthropic or via API through Amazon’s Bedrock developer platform. As part of its agentic push, AWS unveiled AgentCore, a developer tool that helps enterprise developers create, deploy and implement AI agents using any framework or model, and announced a $100 million investment in agent development. Continue reading AWS on Agent Push, Adds Claude Enterprise in Marketplace

Baidu Delivers AI Updates to Search, Open-Sources Ernie 4.5

China’s Baidu has added AI and voice features to its Ernie search engine and as of June 30 officially made its generative Ernie large language model open source in a direct challenge to OpenAI and Anthropic as well as local Chinese rival DeepSeek. “This isn’t just a China story. Every time a major lab open-sources a powerful model, it raises the bar for the entire industry,” University of Southern California Associate Professor of Computer Science Sean Ren told CNBC. Baidu is also giving the Ernie mobile app more chatbot-like functionality, enabling it to help with drawing, writing and travel tasks. Continue reading Baidu Delivers AI Updates to Search, Open-Sources Ernie 4.5

Anthropic’s Claude Chatbot Is Now a No-Code App Developer

Anthropic has updated its Claude AI chatbot with the ability to build, host and share AI-powered apps directly in Claude. Launching in beta, the new function builds upon the Artifacts feature Anthropic introduced last year, allowing users to see and interact with what they asked Claude to create. “Now developers can iterate faster on their AI apps without worrying about the complexity and cost of scaling for a growing audience,” according to Anthropic. The San Francisco-based AI startup adds that millions people have already used Claude to create more than 500 million artifacts — from productivity tools to educational games. Continue reading Anthropic’s Claude Chatbot Is Now a No-Code App Developer

Google Bows Gemini Command Line Interface for Developers

In a move to attract more developers to Gemini, Google is releasing an open-source command line interface (CLI) that will be free for most developers. CLIs offer a means to communicate with operating systems, and can be used as alternatives or complementary to an integrated developer environment (IDE). Gemini CLI has agentic capabilities and can code and “so much more,” according to Google, which lists content generation, problem solving, deep research and task management among its uses. Gemini CLI provides “lightweight access to Gemini, giving you the most direct path from your prompt to our model.” Continue reading Google Bows Gemini Command Line Interface for Developers

Meta Said to Be Planning Major Push into AI Superintelligence

Determined to rebound from the Llama 4 Behemoth delay, Meta Platforms is swinging for the fences in its artificial intelligence efforts, with CEO Mark Zuckerberg personally supervising creation of a new AI lab focused on superintelligence. Meta has reportedly reached a deal to pay up to $15 billion for a stake in a startup called Scale AI that would bring its 28-year-old co-founder and CEO Alexandr Wang in-house to lead the effort. Other leading Scale AI staff would also come aboard, and Meta is said to be recruiting researchers from OpenAI and Google by dangling “seven- to nine-figure compensation packages,” reports indicate. Continue reading Meta Said to Be Planning Major Push into AI Superintelligence