Microsoft Contracts with Nebius for $17.4 Billion in AI Capacity

AI infrastructure company Nebius Group NV has entered into a $17.4 billion deal to provide dedicated compute power to Microsoft from a new data center in Vineland, New Jersey. The five-year agreement could be worth up to $19.4 billion with additional capacity and services. The news sent Nebius shares surging by 49 percent on the Nasdaq composite, underscoring how the rapidly growing demand for AI support can influence the fate of companies. The deal added $1 billion to the value of Nebius founder Arkady Volozh’s stake. The Russian expatriate founded that country’s equivalent of Google. Continue reading Microsoft Contracts with Nebius for $17.4 Billion in AI Capacity

Nvidia Says Rubin CPX Inference Accelerator Coming in 2026

Nvidia has designed a new class of GPU for massive-context inference, the Rubin CPX, due in late 2026. Purpose-built to speed the million-token applications used to generate video and create software, the Rubin CPX functions as a specialty accelerator, working in concert with Nvidia Vera CPUs and Rubin GPUs packaged inside the upcoming Vera Rubin NVL144 CPX rack platform. “The Vera Rubin platform will mark another leap in the frontier of AI computing,” revolutionizing massive-context AI just as RTX did graphics and physical AI, said Nvidia CEO Jensen Huang. Continue reading Nvidia Says Rubin CPX Inference Accelerator Coming in 2026

OpenAI Reportedly Turning to Broadcom for Custom AI Chips

OpenAI is said to be in talks with Broadcom about developing custom AI inference chips to run its models. On an earnings call last week, Broadcom disclosed that an AI developer had placed a $10 billion order for AI server racks using its chips. That new customer was reported to be OpenAI, which has relied primarily on hotly sought-after Nvidia GPUs for model training and deployment. Broadcom specializes in XPUs — accelerator chips designed for specific uses, like inference for ChatGPT. OpenAI CEO Sam Altman has publicly complained that a shortage of chips has impeded the company’s ability to get new models and products to market. Continue reading OpenAI Reportedly Turning to Broadcom for Custom AI Chips

Microsoft AI Introduces Proprietary Foundation, Voice Models

Microsoft is rolling out its first internally developed AI models. Branded Microsoft AI (MAI), the two initial releases are MAI-Voice-1, a “highly expressive and natural speech generation model,” and MAI-1-preview, a mixture-of-experts LLM designed for consumer facing applications. The move demonstrates Microsoft’s intent to move beyond exclusive reliance on OpenAI models to power its Copilot assistant and other applications. By striking out on its own, Microsoft is paving a smoother road for OpenAI’s transition to a for-profit entity, which the company is scheduled to initiate by the end of the year. Continue reading Microsoft AI Introduces Proprietary Foundation, Voice Models

Nvidia Announces Continued Growth, $26 Billion in Q2 Profit

Santa Clara, California-based Nvidia reported its sales were $46.7 billion for the most recent quarter, marking 56 percent growth over the same period last year and up 6 percent sequentially. Profit rose more than 59 percent to $26.42 billion. The results, which surpassed estimates, reassured global analysts and investors that AI infrastructure spending remains strong, easing — though not erasing — anxieties about an AI bubble. This summer, the chipmaker became the first company to exceed a market cap of $4 trillion, and it is considered a global barometer for the overall health of the artificial intelligence sector. Continue reading Nvidia Announces Continued Growth, $26 Billion in Q2 Profit

SoftBank Invests $2 Billion in Intel as Government Mulls Stake

Japan’s SoftBank has committed to investing $2 billion in U.S. chipmaker Intel as the company struggles to gain traction in the exploding artificial intelligence space and catch up in the mobile market. SoftBank has agreed to purchase roughly 87 million Intel shares at $23 per share to become the company’s fifth or sixth-largest shareholder. The move comes as the Trump administration deliberates converting the U.S. government’s CHIPS Act grants into a 10 percent equity stake in the company as part of its effort to revive American semiconductor manufacturing. Such a deal would make the government Intel’s largest stakeholder. Continue reading SoftBank Invests $2 Billion in Intel as Government Mulls Stake

SIGGRAPH: Nvidia Touts Server Chip, Cosmos World Models

Nvidia has unveiled the Blackwell Server Edition GPU designed for enterprise servers. The reveal was made at the ACM SIGGRAPH 2025 computer graphics conference, which started Sunday and runs through Thursday in Vancouver. The company also introduced a host of resources for robotics developers that include a new AI family called the Cosmos World Foundation Models, or Cosmos WFMs, which generate “physics-aware” videos. Notable among them is Cosmos Reason, an open and customizable 7-billion-parameter reasoning vision language model (VLM) for physical AI and robotics. Continue reading SIGGRAPH: Nvidia Touts Server Chip, Cosmos World Models

Odyssey’s AI World Modeling Engine Streams Interactive 3D

Artificial intelligence startup Odyssey, which turns two this year, has unveiled an interactive streaming AI video model. Available on the web in research preview, the model generates video streams every 40 milliseconds that viewers can navigate through — much like interacting with a 3D-rendered video game using either a keyboard, game controller or smartphone. Odyssey describes the current experience as similar to “exploring a glitchy dream” and says that while “utility is limited for now” its breakthrough is based on the fact that “improvements won’t be driven by hand-built game engines, but rather by models and data.” Continue reading Odyssey’s AI World Modeling Engine Streams Interactive 3D

DGX Cloud Lepton: Nvidia’s New GPU Compute Marketplace

Nvidia is rolling out DGX Cloud Lepton, a platform that connects AI developers with GPU access available through various cloud providers. Nvidia calls it “a compute marketplace” that offers tens of thousands of GPUs through a global network that features Nvidia Cloud Partners (NCPs). Among them: CoreWeave, Crusoe, Firmus, Foxconn, GMI Cloud, Lambda, Nebius, Nscale, Softbank Corp. and Yotta Data Services — offering Nvidia Blackwell and other architecture GPUs. Developers can tap into GPU compute capacity in specific regions for both on-demand and long-term computing, Nvidia says, adding that it expects leading cloud computing providers to eventually sign on. Continue reading DGX Cloud Lepton: Nvidia’s New GPU Compute Marketplace

Nvidia, Foxconn Plan to Build an AI Supercomputer in Taiwan

Nvidia is joining forces with Foxconn to build Taiwan’s first supercomputer. Foxconn, the world’s largest contract manufacturer of electronics, will implement the system through its subsidiary Big Innovation Company, which specializes in advanced tech solutions for enterprise. The supercomputer will leverage 10,000 Nvidia Blackwell GPUs, providing “orders-of-magnitude faster performance, compared with previous-generation systems,” said Nvidia CEO Jensen Huang in his Computex keynote. Huang also announced a new initiative that will let companies build semi-custom chips and talked-up desktop supercomputers in the works with Acer and Asus. Continue reading Nvidia, Foxconn Plan to Build an AI Supercomputer in Taiwan

Lightricks LTXV Makes Video Generation Faster and Cheaper

Lightricks, the company behind the Facetune and Videoleap apps, has released a new video model called LTX Video, or LTXV, that generates what the company describes as high-quality AI video at speeds up to 30 times faster than competing products, and does it using consumer-grade hardware. The open-source, 13-billion parameter model achieves such efficiency by utilizing an approach called multiscale rendering, which generates video in progressively detailed layers. The program can run on high-end laptops and standard desktop computers, opening up generative video to an audience beyond those who have access to enterprise equipment. Continue reading Lightricks LTXV Makes Video Generation Faster and Cheaper

Google Ironwood TPU is Made for Inference and ‘Thinking’ AI

Google has debuted a new accelerator chip, Ironwood, a tensor processing unit designed specifically for inference — the ability of AI to predict things. Ironwood will power Google Cloud’s AI Hypercomputer, which runs the company’s Gemini models and is gearing up for the next generation of artificial intelligence workloads. Google’s TPUs are similar to the accelerator GPUs sold by Nvidia, but unlike the GPUs they’re designed for AI and geared toward speeding neural network tasks and mathematical operations. Google says when deployed at scale Ironwood is more than 24 times more powerful than the world’s fastest supercomputer. Continue reading Google Ironwood TPU is Made for Inference and ‘Thinking’ AI

TSMC Reportedly Ready for Joint Venture with Intel Foundries

Semiconductor giant Intel has reached a tentative agreement with Taiwan’s TSMC and some U.S. firms to create a joint venture that would assume operating responsibility for Intel’s chip fabrication plants here. TSMC will reportedly hold a 20 percent stake in the JV, while Intel and the other investors would control the remaining 80 percent. This specific JV is limited to Intel’s foundry unit, which posted a 2024 operating loss of $13.4 billion in 2024 and is not expected to break even until 2027. New Intel CEO Lip-Bu Tan said at last week’s Intel Vision conference that he will spin off all non-core units. Continue reading TSMC Reportedly Ready for Joint Venture with Intel Foundries

Nintendo Switch 2 Out in June with 4K Support, In-Game Chat

Gamers have been waiting for years for Nintendo’s new gaming device and now they only have to wait until June 5 when the Switch 2 hits shelves starting at $450. The device has a larger, 1080p LCD screen, and supports 4K and in-game chat. Details were revealed during a Nintendo Direct online presentation. Nintendo, which announced details of the new console in January, will over the next few months be holding a series of global roadshow events aimed at letting people have a hands-on experience with Switch 2. The original has sold more than 150 million units. Continue reading Nintendo Switch 2 Out in June with 4K Support, In-Game Chat

Nvidia Forges AI Initiative to Streamline Production Workflows

During Nvidia’s GTC AI Conference in San Jose earlier this month, VP and GM of Media & Entertainment Richard Kerris presented the Nvidia Media2 initiative that builds on the company’s Blackwell GPU foundation to enable real-time AI solutions for all aspects of media production workflows. His talk showcased a broad range of generative AI breakthroughs in real-time ray tracing and VFX, video search and summarization, and musically-based sound effects (SFX). Kerris also shared insights on the media industry’s reception to AI thus far and humbly implored the audience to consider using such technology as an effective new tool for storytelling. Continue reading Nvidia Forges AI Initiative to Streamline Production Workflows