• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

NVIDIA AI-Ready Servers From World's Leading System Manufacturers to Supercharge Generative AI for Enterprises

GFreeman

News Editor
Staff member
Joined
Mar 6, 2023
Messages
1,588 (2.39/day)
NVIDIA today announced the world's leading system manufacturers will deliver AI-ready servers that support VMware Private AI Foundation with NVIDIA, announced separately today, to help companies customize and deploy generative AI applications using their proprietary business data. NVIDIA AI-ready servers will include NVIDIA L40S GPUs, NVIDIA BlueField -3 DPUs and NVIDIA AI Enterprise software to enable enterprises to fine-tune generative AI foundation models and deploy generative AI applications like intelligent chatbots, search and summarization tools. These servers also provide NVIDIA-accelerated infrastructure and software to power VMware Private AI Foundation with NVIDIA.

NVIDIA L40S-powered servers from leading global system manufacturers - Dell Technologies, Hewlett Packard Enterprise and Lenovo . will be available by year-end to accelerate enterprise AI. "A new computing era has begun," said Jensen Huang, founder and CEO of NVIDIA. "Companies in every industry are racing to adopt generative AI. With our ecosystem of world-leading software and system partners, we are bringing generative AI to the world's enterprises."



NVIDIA AI-ready servers are an ideal platform for businesses that will deploy VMware Private AI Foundation with NVIDIA.

"Generative AI is supercharging digital transformation, and enterprises need a fully integrated solution to more securely build applications that enable them to advance their business," said Raghu Raghuram, CEO of VMware. "Through the combined expertise of VMware, NVIDIA and our server manufacturer partners, businesses will be able to develop and deploy AI with data privacy, security and control."

Powering Generative AI Transformation in the Enterprise
NVIDIA AI-ready servers are designed to provide full-stack accelerated infrastructure and software for industries racing to adopt generative AI for a broad range of applications, including drug discovery, retail product descriptions, intelligent virtual assistants, manufacturing simulation and fraud detection.

The servers feature NVIDIA AI Enterprise, the operating system of the NVIDIA AI platform. The software provides production-ready enterprise support and security for over 100 frameworks, pretrained models, toolkits and software, including NVIDIA NeMo for LLMs, NVIDIA Modulus for simulations, NVIDIA RAPIDS for data science and NVIDIA Triton Inference Server for production AI.

Built to handle complex AI workloads with billions of parameters, L40S GPUs include fourth-generation Tensor Cores and an FP8 Transformer Engine, delivering over 1.45 petaflops of tensor processing power and up to 1.7x training performance compared with the NVIDIA A100 Tensor Core GPU.

For generative AI applications such as intelligent chatbots, assistants, search and summarization, the NVIDIA L40S enables up to 1.2x more generative AI inference performance than the NVIDIA A100 GPU.

Integrating NVIDIA BlueField DPUs drives further speedups by accelerating, offloading and isolating the tremendous compute load of virtualization, networking, storage, security and other cloud-native AI services.

NVIDIA ConnectX -7 SmartNICs offer advanced hardware offloads and ultra-low latency, delivering best-in-class, scalable performance for data-intensive generative AI workloads.

Broad Ecosystem to Speed Enterprise Generative AI Deployments
The world's leading computer makers are building NVIDIA AI-ready servers, including the Dell PowerEdge R760xa, HPE ProLiant Gen11 servers for VMware Private AI Foundation with NVIDIA, and Lenovo ThinkSystem SR675 V3.

"Generative AI is a catalyst for innovation, helping to solve some of the world's most pressing challenges," said Michael Dell, chairman and chief executive officer, Dell Technologies. "Dell Generative AI Solutions with NVIDIA AI-ready servers will play a critical role in advancing human progress by driving unprecedented levels of productivity and revolutionizing the way industries operate."

"Generative AI will usher in a new scale of productivity for enterprises, from powering chatbots and digital assistants to helping with the design and development of new solutions," said Antonio Neri, president and CEO of HPE. "We are pleased to continue working closely with NVIDIA to feature its GPUs and software in a range of enterprise tuning and inference workload solutions that will accelerate deployments of generative AI."

"Businesses are eager to adopt generative AI to power intelligent transformation," said Yang Yuanqing, chairman and CEO of Lenovo. "In collaboration with NVIDIA and VMware, Lenovo is further extending our leadership in generative AI and solidifying our unique position in helping customers in their AI journey."

Availability
NVIDIA AI-ready servers with L40S GPUs and BlueField DPUs will be available by year-end, with instances available from cloud service providers expected in the coming months.

View at TechPowerUp Main Site | Source
 
Joined
May 18, 2009
Messages
2,986 (0.52/day)
Location
MN
System Name Personal / HTPC
Processor Ryzen 5900x / Ryzen 5600X3D
Motherboard Asrock x570 Phantom Gaming 4 /ASRock B550 Phantom Gaming
Cooling Corsair H100i / bequiet! Pure Rock Slim 2
Memory 32GB DDR4 3200 / 16GB DDR4 3200
Video Card(s) EVGA XC3 Ultra RTX 3080Ti / EVGA RTX 3060 XC
Storage 500GB Pro 970, 250 GB SSD, 1TB & 500GB Western Digital / lots
Display(s) Dell - S3220DGF & S3222DGM 32"
Case CoolerMaster HAF XB Evo / CM HAF XB Evo
Audio Device(s) Logitech G35 headset
Power Supply 850W SeaSonic X Series / 750W SeaSonic X Series
Mouse Logitech G502
Keyboard Black Microsoft Natural Elite Keyboard
Software Windows 10 Pro 64 / Windows 10 Pro 64
Glanced over this and I think I got the gist of it:

AI = glorified search engine give us more money for our impressive use of "AI".
 
Joined
Jun 21, 2021
Messages
3,121 (2.42/day)
System Name daily driver Mac mini M2 Pro
Processor Apple proprietary M2 Pro (6 p-cores, 4 e-cores)
Motherboard Apple proprietary
Cooling Apple proprietary
Memory Apple proprietary 16GB LPDDR5 unified memory
Video Card(s) Apple proprietary M2 Pro (16-core GPU)
Storage Apple proprietary onboard 512GB SSD + various external HDDs
Display(s) LG UltraFine 27UL850W (4K@60Hz IPS)
Case Apple proprietary
Audio Device(s) Apple proprietary
Power Supply Apple proprietary
Mouse Apple Magic Trackpad 2
Keyboard Keychron K1 tenkeyless (Gateron Reds)
VR HMD Oculus Rift S (hosted on a different PC)
Software macOS Sonoma 14.7
Benchmark Scores (My Windows daily driver is a Beelink Mini S12 Pro. I'm not interested in benchmarking.)
What this really looks like is a continuation of their slew of announcements that they started at SIGGRAPH a couple of weeks ago. It's basically a cloud based service like Amazon AWS. The emphasis here is on generative AI which is one type of machine learning.

They're offering an easy way to get a start on AI without all of the expense and legwork in setting up your own hardware, software, and infrastructure. In that way, it's similar to EC3 offerings from ten years ago for general computing (I ran a few Windows applications on an EC3 instance during a couple of 1-year free trials).

In the Eighties, you would just buy time at the university computer lab. This is just the modern cloud-based version with ML GPUs and DPUs.

Nvidia already has an extensive developer ecosystem. This is yet another way to get more newcomers locked into that ecosystem and APIs so when they're ready to buy their own hardware, they'll pick Nvidia equipment for a smoother transition. AMD does not have the same ecosystem right now.
 
Top