Nvidia Says Rubin CPX Inference Accelerator Coming in 2026

Nvidia has designed a new class of GPU for massive-context inference, the Rubin CPX, due in late 2026. Purpose-built to speed the million-token applications used to generate video and create software, the Rubin CPX functions as a specialty accelerator, working in concert with Nvidia Vera CPUs and Rubin GPUs packaged inside the upcoming Vera Rubin NVL144 CPX rack platform. “The Vera Rubin platform will mark another leap in the frontier of AI computing,” revolutionizing massive-context AI just as RTX did graphics and physical AI, said Nvidia CEO Jensen Huang. Continue reading Nvidia Says Rubin CPX Inference Accelerator Coming in 2026

Europe’s Most Powerful Supercomputer Designed to Foster AI

Europe has entered the big leagues of supercomputing with Jupiter, which this month became the first European system to achieve the exascale threshold of more than one quintillion (a billion billion) operations per second. Jupiter is Europe’s most powerful compute platform and the fourth fastest worldwide. It is a hybrid platform that uses a combination of SiPearl and Nvidia chips, respectively supporting HPC tasks like simulations and data analysis as well as AI workloads, such as training large language models and providing access to the Jupiter AI Factory (JAIF), a managed interface for developers and academics. Continue reading Europe’s Most Powerful Supercomputer Designed to Foster AI

OpenAI Reportedly Turning to Broadcom for Custom AI Chips

OpenAI is said to be in talks with Broadcom about developing custom AI inference chips to run its models. On an earnings call last week, Broadcom disclosed that an AI developer had placed a $10 billion order for AI server racks using its chips. That new customer was reported to be OpenAI, which has relied primarily on hotly sought-after Nvidia GPUs for model training and deployment. Broadcom specializes in XPUs — accelerator chips designed for specific uses, like inference for ChatGPT. OpenAI CEO Sam Altman has publicly complained that a shortage of chips has impeded the company’s ability to get new models and products to market. Continue reading OpenAI Reportedly Turning to Broadcom for Custom AI Chips

Nvidia Announces Continued Growth, $26 Billion in Q2 Profit

Santa Clara, California-based Nvidia reported its sales were $46.7 billion for the most recent quarter, marking 56 percent growth over the same period last year and up 6 percent sequentially. Profit rose more than 59 percent to $26.42 billion. The results, which surpassed estimates, reassured global analysts and investors that AI infrastructure spending remains strong, easing — though not erasing — anxieties about an AI bubble. This summer, the chipmaker became the first company to exceed a market cap of $4 trillion, and it is considered a global barometer for the overall health of the artificial intelligence sector. Continue reading Nvidia Announces Continued Growth, $26 Billion in Q2 Profit

SoftBank Invests $2 Billion in Intel as Government Mulls Stake

Japan’s SoftBank has committed to investing $2 billion in U.S. chipmaker Intel as the company struggles to gain traction in the exploding artificial intelligence space and catch up in the mobile market. SoftBank has agreed to purchase roughly 87 million Intel shares at $23 per share to become the company’s fifth or sixth-largest shareholder. The move comes as the Trump administration deliberates converting the U.S. government’s CHIPS Act grants into a 10 percent equity stake in the company as part of its effort to revive American semiconductor manufacturing. Such a deal would make the government Intel’s largest stakeholder. Continue reading SoftBank Invests $2 Billion in Intel as Government Mulls Stake

Genie 3 World Model Produces Minutes of Video in Real Time

Google DeepMind has unveiled Genie 3, a world-building model that uses text and image prompts to generate 3D environments in real time. Still in research preview, Genie 3 can output “several minutes” of video that can be navigated in real time at 24fps and a resolution of 720p. Because it remembers the rules of the world it creates, Genie 3 allows agents to predict how the environment evolves and how actions affect it. Google says world models are “a key steppingstone” to artificial general intelligence, or AGI, since they can train AI agents in “an unlimited curriculum of rich simulation.” Continue reading Genie 3 World Model Produces Minutes of Video in Real Time

SIGGRAPH: Nvidia Touts Server Chip, Cosmos World Models

Nvidia has unveiled the Blackwell Server Edition GPU designed for enterprise servers. The reveal was made at the ACM SIGGRAPH 2025 computer graphics conference, which started Sunday and runs through Thursday in Vancouver. The company also introduced a host of resources for robotics developers that include a new AI family called the Cosmos World Foundation Models, or Cosmos WFMs, which generate “physics-aware” videos. Notable among them is Cosmos Reason, an open and customizable 7-billion-parameter reasoning vision language model (VLM) for physical AI and robotics. Continue reading SIGGRAPH: Nvidia Touts Server Chip, Cosmos World Models

Huawei May Challenge Nvidia with Its CloudMatrix AI System

At the World AI Conference that opened in Shanghai on Saturday, Huawei emerged as China’s best hope for driving a domestic hardware sector for advanced artificial intelligence workloads. There, Huawei debuted its CloudMatrix 384 AI system, powered by 384 of its high-performance processors, the Ascend 910C GPUs. The setup has drawn favorable comparisons to Nvidia’s flagship supercomputing platform, the GB200 NVL72, a rack-scale solution for on-site AI and HPC tasks. Huawei’s new hardware reportedly drew large crowds to its booth, but the company declined to share detailed comments or live benchmarks, suggesting a tightly controlled public presentation. Continue reading Huawei May Challenge Nvidia with Its CloudMatrix AI System

OpenAI and Oracle Confirm $30B Annual Data Center Contract

Oracle and OpenAI have confirmed the $30 billion per year AI data center contract reported earlier this month. The deal will provide OpenAI’s Stargate project with 4.5 gigawatts of additional data center capacity in the U.S. Oracle is a Stargate partner and has been working in partnership with OpenAI on the Stargate I site, coming online in Abilene, Texas. “This additional partnership with Oracle will bring us to over 5 gigawatts of Stargate AI data center capacity under development, which will run over 2 million chips,” OpenAI explains. The “investment will create new jobs, accelerate America’s reindustrialization, and help advance U.S. AI leadership.” Continue reading OpenAI and Oracle Confirm $30B Annual Data Center Contract

T-Mobile 5G Update to L4S Improves Gaming and Video Calls

T-Mobile has begun updating its 5G network to the L4S standard (Low Latency, Low Loss, Scalable), becoming the first mobile service to do so. The technology reduces latency, resulting in improved video calls and smoother cloud gaming. T-Mobile says the format is “a key step toward a smarter, programmable 5G,” describing L4S as consistently delivering “low latency, minimal packet loss and real-time responsiveness — even under heavy traffic,” marking a significant improvement in “performance-driven use cases where every millisecond matters,” including Extended Reality (XR) “and even remote driving” for driverless cars. Continue reading T-Mobile 5G Update to L4S Improves Gaming and Video Calls

OpenAI Contracts Google Cloud and Debuts ChatGPT Agent

OpenAI is adding Google Cloud to its list of global infrastructure providers for ChatGPT after relying exclusively on Microsoft Azure since the chatbot’s 2022 launch until January 2025 when Stargate was announced. Oracle and CoreWeave are also OpenAI cloud providers. Oracle is a Stargate investor, as is Nvidia, which holds a minority interest in CoreWeave. OpenAI has been active as it heads toward a December deadline for transitioning to a for-profit company. Meanwhile, ChatGPT is integrating a payment system to receive commissions on sales it initiates, and yesterday OpenAI launched a new AI agent that can perform complex tasks within a user’s browser. Continue reading OpenAI Contracts Google Cloud and Debuts ChatGPT Agent

Perplexity Launches Comet AI Web Browser for Premium Subs

Nvidia-backed startup Perplexity AI is challenging Google with a new AI-powered web browser called Comet that is built on the company’s proprietary AI search engine. The new browser is initially available to those paying $200 per month to subscribe to the Perplexity Max plan and by invitation to those who register online for the company’s waitlist. The browser also comes with Comet Assistant, an agent that automates routine tasks such as summarizing emails and navigating webpages. Comet Assistant can be opened as a sidebar on any webpage to answer questions about the content being presented. Continue reading Perplexity Launches Comet AI Web Browser for Premium Subs

Twitch Announces New Features to Enhance Mobile Viewing

Twitch is upgrading its presentation capabilities with the addition of vertical viewing and dual-format streaming. The popular Amazon-owned gaming platform is also adding 2K video in open beta for all partners and affiliates, broadening accessibility for high-definition formats. In January 2024, Twitch began experimenting with HD including 4K, which remains in closed beta for select users. The format tests were part of Twitch’s Enhanced Broadcast initiative with Nvidia and OBS Studio, purveyor of the open broadcaster format. Creators with access to Enhanced Broadcasting in Twitch Studio can use dual formatting to simultaneously stream in horizontal and vertical formats. Continue reading Twitch Announces New Features to Enhance Mobile Viewing

Dell Is Building Next DOE Supercomputer, Powered by Nvidia

The U.S. Department of Energy has commissioned Dell to deliver its next supercomputer, expected to come online in 2026. Referred to as Doudna, in honor of the Nobel Prize-winning biochemist Jennifer Doudna, it is also known as NERSC-10 for its home at the DOE’s National Energy Research Scientific Computing Center at Lawrence Berkeley National Laboratory in Berkeley, California. Powered by Nvidia’s new Vera Rubin platform, Doudna will be optimized for AI workloads and aims to deliver a greater than tenfold speed boost over NERSC’s current flagship machine, Perlmutter, while using only 2-3x the power. Continue reading Dell Is Building Next DOE Supercomputer, Powered by Nvidia

Odyssey’s AI World Modeling Engine Streams Interactive 3D

Artificial intelligence startup Odyssey, which turns two this year, has unveiled an interactive streaming AI video model. Available on the web in research preview, the model generates video streams every 40 milliseconds that viewers can navigate through — much like interacting with a 3D-rendered video game using either a keyboard, game controller or smartphone. Odyssey describes the current experience as similar to “exploring a glitchy dream” and says that while “utility is limited for now” its breakthrough is based on the fact that “improvements won’t be driven by hand-built game engines, but rather by models and data.” Continue reading Odyssey’s AI World Modeling Engine Streams Interactive 3D