By
Paula ParisiAugust 25, 2023
Nvidia announced Q2 revenue of $13.51 billion, a 101 percent year-over-year increase that sets a new company record. The data center division — which accounts for the majority of AI chip sales — also established a new benchmark: $10.32 billion in Q2, a 171 percent leap over the prior fiscal Q2. Nvidia projects that revenue for the current quarter will hit $16 billion — about $3.5 billion above analysts’ expectations. Nvidia chips power OpenAI’s popular ChatGPT and other generative AI and cloud computing apps from companies including Amazon, Google, Meta Platforms, Microsoft and VMWare. Continue reading Demand for AI Chips Drives Nvidia to Revenue Record in Q2
By
Paula ParisiJune 30, 2023
Bozeman, Montana-based DaaS firm Snowflake has partnered with Nvidia to let clients customize LLMs (large language models) using proprietary data in the Snowflake Data Cloud. Nvidia’s NeMo platform and GPU-accelerated computing will power the effort to tailor models to specific business use cases, such as chatbots with category expertise as opposed to generalists, search engines attuned to context or generative text deep knowledge. Since most companies are eager to harness brand-specific AI without having to build a model from scratch, this category of service is generating a lot of interest. Continue reading Nvidia’s NeMo Delivers AI Customization to Snowflake Cloud
By
Paula ParisiMay 31, 2023
Nvidia CEO Jensen Huang’s keynote at Computex Taipei marked the official launch of the company’s Grace Hopper Superchip, a breakthrough in accelerated processing, designed for giant-scale AI and high-performance computing applications. Huang also raised the curtain on Nvidia’s new supercomputer, the DGX GH200, which connects 256 Hopper chips into a single data-center-sized GPU with 144 terabytes of scalable shared memory to build massive AI models at the enterprise level. Google, Meta and Microsoft are among the first in line to gain access to the DGX GH200, positioned as “a blueprint for future hyperscale generative AI infrastructure.” Continue reading Nvidia Announces a Wide Range of AI Initiatives at Computex
By
Paula ParisiMay 26, 2023
Nvidia announced $7.19 billion in revenue for the first quarter ended April 30. That’s down 13 percent compared to the February through April frame in 2022, but up 19 percent from Q4, which ended January 29. Nvidia has forecast a stunning $11 billion in sales for Q2. That projected 64 percent increase puts Nvidia on track to be the first chip company with a $1 trillion valuation. CEO Jensen Huang attributes the sales spike to exploding demand for GPUs to run artificial intelligence systems. “We are significantly increasing our supply to meet surging demand for them,” Huang said of the processors. Continue reading AI Helps Steer Nvidia Toward $1 Trillion Market Capitalization
By
Paula ParisiMarch 29, 2023
Nvidia is launching new cloud services to help businesses leverage AI at scale. Under the banner Nvidia AI Foundations, the company is providing tools to let clients build and run their own generative AI models that are custom trained on data specific to the intended task. The individual cloud offerings are Nvidia NeMo for language models and Nvidia Picasso for 3D visuals including video and images. Speaking at Nvidia’s annual GPU Technology Conference (GTC) last week, CEO Jensen Huang said “the impressive capabilities of generative AI have created a sense of urgency for companies to reimagine their products and business models.” Continue reading Nvidia Introduces Cloud Services to Leverage AI Capabilities
By
Paula ParisiFebruary 27, 2023
Nvidia CEO Jensen Huang has declared OpenAI’s ChatGPT as creating an “iPhone moment for artificial intelligence.” Speaking at the Haas School of Business at Berkeley, Huang suggested that ChatGPT is revolutionary for engaging the imagination of millions and opening their eyes to the possibilities the technology holds, much as Apple’s iPhone did for mobile computing, ushering in a new era. ChatGPT has taken the world by storm, and it is the diversity of use that Huang feels makes it special — with some putting it to work to create code, while others use it to write fiction or plan meals and much more. Continue reading Nvidia Chief Suggests ChatGPT Marks an AI Inflection Point
By
Paula ParisiDecember 5, 2022
TSMC has revised plans for its Arizona chip plant, reportedly the result of pressure from customers including Apple, Nvidia and AMD, who urged the Taiwanese company to reconsider its plan to output 5-nanometer processors that will be old news by the time the $12 billion plant opens in 2024. TSMC is expected to announce during a scheduled Tuesday visit by President Biden and Commerce Secretary Gina Raimondo that it will output advanced 4-nanometer chips when production commences and will add a second nearby plant to manufacture even more sophisticated 3-nanometer chips. Continue reading TSMC’s Advanced Chipmaking Plans Leak Before Biden Visit
By
Paula ParisiSeptember 27, 2022
Nvidia Research is introducing a new AI model that largely automates the process of creating virtual worlds, making it easier for developers to populate games and VR experiences with a diverse array of 3D buildings, vehicles, characters and more. Trained using only 2D images, GET3D generates 3D shapes with high-fidelity textures and complex geometric details. GET3D can generate “a virtually unlimited number of 3D shapes based on the data it’s trained on,” according to Nvidia, which says the objects can be used in 3D representations of buildings or the great outdoors, in games or the metaverse. Continue reading Nvidia Debuts New AI Model That Quickly Generates Objects
By
Paula ParisiSeptember 22, 2022
“Computing is advancing at incredible speeds. Acceleration is propelling this rocket, and it’s fuel is AI,” Nvidia founder and CEO Jensen Huang said in his 2022 GTC conference keynote, announcing two new AI services: the Nvidia NeMo large language model service, which helps customize LLMs, and the Nvidia BioNeMo LLM service, aimed at bio researchers. Nvidia also unveiled its GeForce RTX 40 Series GPUs, shipping Q4. Powered by the company’s new architecture, Ada Lovelace, the two new models — GeForce RTX 4090 and GeForce RTX 4080 — offer better ray tracing performance and AI-based neural graphics. Continue reading Nvidia Introduces AI-Powered GPUs and Cloud LLM Services
By
Paula ParisiAugust 15, 2022
Nvidia founder and CEO Jensen Huang shared his vision for a computer graphics industry transformed by AI, the metaverse and digital humans. “The combination of AI and computer graphics will power the metaverse, the next evolution of the Internet,” Huang told attendees at SIGGRAPH 2022 in Vancouver. To support this transformation, Nvidia unveiled the Avatar Cloud Engine (ACE) and discussed plans to build out the Universal Scene Description (USD) industry standard, which Huang called “the language of the metaverse.” New extensions for Omniverse and graphics workflow optimizations using machine learning were also part of the mix.
Continue reading Nvidia Unveils New Tools for AI, the Metaverse at SIGGRAPH
By
Paula ParisiMay 10, 2022
Nvidia has begun previewing its latest H100 Tensor Core GPU, promising “an order-of-magnitude performance leap for large-scale AI and HPC” over previous iterations, according to the company. Nvidia founder and CEO Jensen Huang announced the Hopper earlier this year, and IT professionals’ website ServeTheHome recently had a chance to see a H100 SXM5 module demonstrated. Consuming up to 700W in an effort to deliver 60 FP64 Tensor teraflops, the module — which features 80 billion transistors and has 8448/16896 FP64/FP32 cores in addition to 538 Tensor cores — is described as “monstrous” in the best way. Continue reading Nvidia Touts New H100 GPU and Grace CPU Superchip for AI
By
Paula ParisiMarch 24, 2022
Nvidia CEO Jensen Huang announced a host of new AI tech geared toward data centers at the GTC 2022 conference this week. Available in Q3, the H100 Tensor Core GPUs are built on the company’s new Hopper GPU architecture. Huang described the H100 as the next “engine of the world’s AI infrastructures.” Hopper debuts in Nvidia DGX H100 systems designed for enterprise. With data centers, “companies are manufacturing intelligence and operating giant AI factories,” Huang said, speaking from a real-time virtual environment in the firm’s Omniverse 3D simulation platform. Continue reading Nvidia Introduces New Architecture to Power AI Data Centers
By
Paula ParisiFebruary 9, 2022
Nvidia has scrapped plans to buy Arm from Softbank Group due to “significant regulatory challenges preventing the consummation of the transaction,” according to a joint statement that indicates Arm will proceed with plans for an IPO. In what is being positioned as a coincidence of timing, Arm says Simon Segars has resigned as CEO with Rene Haas, formerly president, stepping into the role. After being announced in September 2020, the $40 billion deal faced opposition from both the European Commission and the Federal Trade Commission, which in December sued to block the sale. Continue reading Nvidia Calls Off $40 Billion Acquisition of Arm from Softbank
By
Paula ParisiNovember 24, 2021
Nvidia is fast-tracking its cybersecurity efforts, emphasizing zero trust through new product integrations designed to protect enterprise customers from attack while supporting artificial intelligence, machine learning and server workloads that scale. Earlier this month Nvidia promoted its full-stack data center security solution: DOCA 1.2 accelerated software, running on BlueField-3 DPUs using the Morpheus AI framework — a configuration that can “secure a data center at every touchpoint,” including users, devices and the data itself, Nvidia founder and CEO Jensen Huang explained at Nvidia’s GTC 2021 event earlier this month. Continue reading Nvidia Introduces a Full-Stack Solution for Zero Trust Security
By
Paula ParisiNovember 11, 2021
Nvidia is mapping out a customer service future populated with real-time avatars who use natural-language AI with real-world customers. The company, which has seemingly transformed from graphics powerhouse to AI authority (in just under 28 years since being founded by Jensen Huang, company CEO) used this week’s GTC conference to emphasize full-stack computing. The speed and flexibility of the company’s three GPU chips offer general purpose enterprise potential, thanks to Nvidia’s parallel-processing platform, CUDA. Huang backed this assertion with a slide indicating Nvidia has deployed more than 150 SDKs to industries generating $1 trillion. Continue reading Nvidia Goes Full-Stack, Touts Artificial Intelligence and Cloud