GTC: Nvidia Unveils Blackwell GPU for Trillion-Parameter LLMs

Nvidia unveiled what it is calling the world’s most powerful AI processing system, the Blackwell GPU, purpose built to power real-time generative AI on trillion-parameter large language models at what the company says will be up to 25x less cost and energy consumption than its predecessors. Blackwell’s capabilities will usher in what the company promises will be a new era in generative AI computing. News from Nvidia’s GTC 2024 developer conference included the NIM software platform, purpose built to streamline the setup of custom and pre-trained AI models in a production environment, and the DGX SuperPOD server, powered by Blackwell. Continue reading GTC: Nvidia Unveils Blackwell GPU for Trillion-Parameter LLMs

Nvidia Revenue and Profits Soar on Strength of AI Chip Sales

Demand for artificial intelligence computer chips drove Nvidia income up 769 percent to nearly $12.3 billion for Q4, year-over-year, and 286 percent — to just over $29.7 billion — for the full-year fiscal 2024 frame that ended January 28. Revenue was $22.1 billion (+265 percent) and $60.9 billion (+126 percent) for the respective periods. Data center sales hit record highs of $18.4 billion for the quarter, up 409 percent from the previous year, $47.5 billion for the fiscal year, an increase of 217 percent. Gaming revenue was flat for Q4, at $2.9 billion, and up 115 percent for the year. Continue reading Nvidia Revenue and Profits Soar on Strength of AI Chip Sales

Amazon Unveils Productivity Chatbot, Gets Nvidia Superchip

Amazon Web Services is introducing Amazon Q, an AI chatbot geared toward enterprise clients who can customize it to increase productivity for their specific business needs. AWS also announced that it has updated its homegrown Graviton4 chips for a 30 percent performance boost. AWS confirmed it will be the first Big Tech firm to deploy the latest version of Nvidia’s Grace Hopper Superchip AI accelerator, and additionally will become a data center host for Nvidia’s DGX Cloud service. The announcements were disclosed at the AWS re:Invent conference in Las Vegas. Continue reading Amazon Unveils Productivity Chatbot, Gets Nvidia Superchip

Nvidia Sales Surge as Rivals Circle and China Sanctions Loom

Nvidia logged another record quarter, with Q3 revenue of $18.12 billion, up 206 percent from a year ago and a 34 percent increase from Q2 that exceeded both its own and analyst projections. The surge, attributed to increasing demand for the chips that drive artificial intelligence, logged primarily under Nvidia’s data center results a record $14.51 billion, up 279 percent from the prior year and 41 percent from Q2. Profits swelled to $9.2 billion, a stunning 1,259 percent increase from 2022’s $680 million. The results for Nvidia’s Q3 were for the three-month period ending October 31. Continue reading Nvidia Sales Surge as Rivals Circle and China Sanctions Loom

Tech Titans Convene in Washington for First AI Insight Forum

The first U.S. Senate AI Insight Forum was a lively event, with xAI’s Elon Musk calling for a federal department of artificial intelligence while Meta’s Mark Zuckerberg emphasized a need for transparency and Google’s Sundar Pichai stressed AI’s potential to improve the human condition with regard to things like health and energy. The three-hour meeting was organized by Senate Majority Leader Chuck Schumer (D-New York) who said the crash course would address both how AI “enriches our world and opens the door to new prosperity” and how society can “minimize the very real risks.” Continue reading Tech Titans Convene in Washington for First AI Insight Forum

Google Takes on the Competition with Cloud and AI Services

Google is making many of its most powerful cloud computing tools available commercially for the first time, Google Cloud CEO Thomas Kurian shared at the company’s Cloud Next ’23 conference in San Francisco. In a bid to catch up with top AI rivals such as Amazon and Microsoft, the Google Distributed Cloud will open for general business including at the edge with Vertex AI and PaLM 2. Google Cloud will serve up AI from Anthropic, in which it is an investor, as well as from Meta Platforms. In addition, an AI-infused Gmail productivity suite is on the way. Continue reading Google Takes on the Competition with Cloud and AI Services

Demand for AI Chips Drives Nvidia to Revenue Record in Q2

Nvidia announced Q2 revenue of $13.51 billion, a 101 percent year-over-year increase that sets a new company record. The data center division — which accounts for the majority of AI chip sales — also established a new benchmark: $10.32 billion in Q2, a 171 percent leap over the prior fiscal Q2. Nvidia projects that revenue for the current quarter will hit $16 billion — about $3.5 billion above analysts’ expectations. Nvidia chips power OpenAI’s popular ChatGPT and other generative AI and cloud computing apps from companies including Amazon, Google, Meta Platforms, Microsoft and VMWare. Continue reading Demand for AI Chips Drives Nvidia to Revenue Record in Q2

Nvidia’s NeMo Delivers AI Customization to Snowflake Cloud

Bozeman, Montana-based DaaS firm Snowflake has partnered with Nvidia to let clients customize LLMs (large language models) using proprietary data in the Snowflake Data Cloud. Nvidia’s NeMo platform and GPU-accelerated computing will power the effort to tailor models to specific business use cases, such as chatbots with category expertise as opposed to generalists, search engines attuned to context or generative text deep knowledge. Since most companies are eager to harness brand-specific AI without having to build a model from scratch, this category of service is generating a lot of interest. Continue reading Nvidia’s NeMo Delivers AI Customization to Snowflake Cloud

Nvidia Announces a Wide Range of AI Initiatives at Computex

Nvidia CEO Jensen Huang’s keynote at Computex Taipei marked the official launch of the company’s Grace Hopper Superchip, a breakthrough in accelerated processing, designed for giant-scale AI and high-performance computing applications. Huang also raised the curtain on Nvidia’s new supercomputer, the DGX GH200, which connects 256 Hopper chips into a single data-center-sized GPU with 144 terabytes of scalable shared memory to build massive AI models at the enterprise level. Google, Meta and Microsoft are among the first in line to gain access to the DGX GH200, positioned as “a blueprint for future hyperscale generative AI infrastructure.” Continue reading Nvidia Announces a Wide Range of AI Initiatives at Computex

AI Helps Steer Nvidia Toward $1 Trillion Market Capitalization

Nvidia announced $7.19 billion in revenue for the first quarter ended April 30. That’s down 13 percent compared to the February through April frame in 2022, but up 19 percent from Q4, which ended January 29. Nvidia has forecast a stunning $11 billion in sales for Q2. That projected 64 percent increase puts Nvidia on track to be the first chip company with a $1 trillion valuation. CEO Jensen Huang attributes the sales spike to exploding demand for GPUs to run artificial intelligence systems. “We are significantly increasing our supply to meet surging demand for them,” Huang said of the processors. Continue reading AI Helps Steer Nvidia Toward $1 Trillion Market Capitalization

Nvidia Introduces Cloud Services to Leverage AI Capabilities

Nvidia is launching new cloud services to help businesses leverage AI at scale. Under the banner Nvidia AI Foundations, the company is providing tools to let clients build and run their own generative AI models that are custom trained on data specific to the intended task. The individual cloud offerings are Nvidia NeMo for language models and Nvidia Picasso for 3D visuals including video and images. Speaking at Nvidia’s annual GPU Technology Conference (GTC) last week, CEO Jensen Huang said “the impressive capabilities of generative AI have created a sense of urgency for companies to reimagine their products and business models.” Continue reading Nvidia Introduces Cloud Services to Leverage AI Capabilities

Nvidia Chief Suggests ChatGPT Marks an AI Inflection Point

Nvidia CEO Jensen Huang has declared OpenAI’s ChatGPT as creating an “iPhone moment for artificial intelligence.” Speaking at the Haas School of Business at Berkeley, Huang suggested that ChatGPT is revolutionary for engaging the imagination of millions and opening their eyes to the possibilities the technology holds, much as Apple’s iPhone did for mobile computing, ushering in a new era. ChatGPT has taken the world by storm, and it is the diversity of use that Huang feels makes it special — with some putting it to work to create code, while others use it to write fiction or plan meals and much more. Continue reading Nvidia Chief Suggests ChatGPT Marks an AI Inflection Point

TSMC’s Advanced Chipmaking Plans Leak Before Biden Visit

TSMC has revised plans for its Arizona chip plant, reportedly the result of pressure from customers including Apple, Nvidia and AMD, who urged the Taiwanese company to reconsider its plan to output 5-nanometer processors that will be old news by the time the $12 billion plant opens in 2024. TSMC is expected to announce during a scheduled Tuesday visit by President Biden and Commerce Secretary Gina Raimondo that it will output advanced 4-nanometer chips when production commences and will add a second nearby plant to manufacture even more sophisticated 3-nanometer chips. Continue reading TSMC’s Advanced Chipmaking Plans Leak Before Biden Visit

Nvidia Debuts New AI Model That Quickly Generates Objects

Nvidia Research is introducing a new AI model that largely automates the process of creating virtual worlds, making it easier for developers to populate games and VR experiences with a diverse array of 3D buildings, vehicles, characters and more. Trained using only 2D images, GET3D generates 3D shapes with high-fidelity textures and complex geometric details. GET3D can generate “a virtually unlimited number of 3D shapes based on the data it’s trained on,” according to Nvidia, which says the objects can be used in 3D representations of buildings or the great outdoors, in games or the metaverse. Continue reading Nvidia Debuts New AI Model That Quickly Generates Objects

Nvidia Introduces AI-Powered GPUs and Cloud LLM Services

“Computing is advancing at incredible speeds. Acceleration is propelling this rocket, and it’s fuel is AI,” Nvidia founder and CEO Jensen Huang said in his 2022 GTC conference keynote, announcing two new AI services: the Nvidia NeMo large language model service, which helps customize LLMs, and the Nvidia BioNeMo LLM service, aimed at bio researchers. Nvidia also unveiled its GeForce RTX 40 Series GPUs, shipping Q4. Powered by the company’s new architecture, Ada Lovelace, the two new models — GeForce RTX 4090 and GeForce RTX 4080 — offer better ray tracing performance and AI-based neural graphics. Continue reading Nvidia Introduces AI-Powered GPUs and Cloud LLM Services