• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

NVIDIA and Microsoft Showcase Blackwell Preview, Omniverse Industrial AI and RTX AI PCs at Microsoft Ignite

GFreeman

News Editor
Staff member
Joined
Mar 6, 2023
Messages
1,583 (2.41/day)
NVIDIA and Microsoft today unveiled product integrations designed to advance full-stack NVIDIA AI development on Microsoft platforms and applications. At Microsoft Ignite, Microsoft announced the launch of the first cloud private preview of the Azure ND GB200 V6 VM series, based on the NVIDIA Blackwell platform. The Azure ND GB200 v6 will be a new AI-optimized virtual machine (VM) series and combines the NVIDIA GB200 NVL72 rack design with NVIDIA Quantum InfiniBand networking.

In addition, Microsoft revealed that Azure Container Apps now supports NVIDIA GPUs, enabling simplified and scalable AI deployment. Plus, the NVIDIA AI platform on Azure includes new reference workflows for industrial AI and an NVIDIA Omniverse Blueprint for creating immersive, AI-powered visuals. At Ignite, NVIDIA also announced multimodal small language models (SLMs) for RTX AI PCs and workstations, enhancing digital human interactions and virtual assistants with greater realism.



NVIDIA Blackwell Powers Next-Gen AI on Microsoft Azure
Microsoft's new Azure ND GB200 V6 VM series will harness the powerful performance of NVIDIA GB200 Grace Blackwell Superchips, coupled with advanced NVIDIA Quantum InfiniBand networking. This offering is optimized for large-scale deep learning workloads to accelerate breakthroughs in natural language processing, computer vision and more.

The Blackwell-based VM series complements previously announced Azure AI clusters with ND H200 V5 VMs, which provide increased high-bandwidth memory for improved AI inferencing. The ND H200 V5 VMs are already being used by OpenAI to enhance ChatGPT.

Azure Container Apps Enables Serverless AI Inference With NVIDIA Accelerated Computing
Serverless computing provides AI application developers increased agility to rapidly deploy, scale and iterate on applications without worrying about underlying infrastructure. This enables them to focus on optimizing models and improving functionality while minimizing operational overhead.

The Azure Container Apps serverless containers platform simplifies deploying and managing microservices-based applications by abstracting away the underlying infrastructure.

Azure Container Apps now supports NVIDIA-accelerated workloads with serverless GPUs, allowing developers to use the power of accelerated computing for real-time AI inference applications in a flexible, consumption-based, serverless environment. This capability simplifies AI deployments at scale while improving resource efficiency and application performance without the burden of infrastructure management.

Serverless GPUs allow development teams to focus more on innovation and less on infrastructure management. With per-second billing and scale-to-zero capabilities, customers pay only for the compute they use, helping ensure resource utilization is both economical and efficient. NVIDIA is also working with Microsoft to bring NVIDIA NIM microservices to serverless NVIDIA GPUs in Azure to optimize AI model performance.

NVIDIA Unveils Omniverse Reference Workflows for Advanced 3D Applications
NVIDIA announced reference workflows that help developers to build 3D simulation and digital twin applications on NVIDIA Omniverse and Universal Scene Description (OpenUSD) - accelerating industrial AI and advancing AI-driven creativity.

A reference workflow for 3D remote monitoring of industrial operations is coming soon to enable developers to connect physically accurate 3D models of industrial systems to real-time data from Azure IoT Operations and Power BI.

These two Microsoft services integrate with applications built on NVIDIA Omniverse and OpenUSD to provide solutions for industrial IoT use cases. This helps remote operations teams accelerate decision-making and optimize processes in production facilities.

The Omniverse Blueprint for precise visual generative AI enables developers to create applications that let nontechnical teams generate AI-enhanced visuals while preserving brand assets. The blueprint supports models like SDXL and Shutterstock Generative 3D to streamline the creation of on-brand, AI-generated images.

Leading creative groups, including Accenture Song, Collective, GRIP, Monks and WPP, have adopted this NVIDIA Omniverse Blueprint to personalize and customize imagery across markets.

Accelerating Gen AI for Windows With RTX AI PCs
NVIDIA's collaboration with Microsoft extends to bringing AI capabilities to personal computing devices.

At Ignite, NVIDIA announced its new multimodal SLM, NVIDIA Nemovision-4B Instruct, for understanding visual imagery in the real world and on screen. It's coming soon to RTX AI PCs and workstations and will pave the way for more sophisticated and lifelike digital human interactions.

Plus, updates to NVIDIA TensorRT Model Optimizer (ModelOpt) offer Windows developers a path to optimize a model for ONNX Runtime deployment. TensorRT ModelOpt enables developers to create AI models for PCs that are faster and more accurate when accelerated by RTX GPUs. This enables large models to fit within the constraints of PC environments, while making it easy for developers to deploy across the PC ecosystem with ONNX runtimes.

RTX AI-enabled PCs and workstations offer enhanced productivity tools, creative applications and immersive experiences powered by local AI processing.

Full-Stack Collaboration for AI Development
NVIDIA's extensive ecosystem of partners and developers brings a wealth of AI and high-performance computing options to the Azure platform.

SoftServe, a global IT consulting and digital services provider, today announced the availability of SoftServe Gen AI Industrial Assistant, based on the NVIDIA AI Blueprint for multimodal PDF data extraction, on the Azure marketplace. The assistant addresses critical challenges in manufacturing by using AI to enhance equipment maintenance and improve worker productivity.

At Ignite, AT&T will showcase how it's using NVIDIA AI and Azure to enhance operational efficiency, boost employee productivity and drive business growth through retrieval-augmented generation and autonomous assistants and agents.

View at TechPowerUp Main Site | Source
 
Joined
Sep 30, 2024
Messages
111 (1.34/day)
Does anybody know if Blackwell actually offers any additional features over the previous architecture, beyond just performance?
 
Joined
Dec 14, 2011
Messages
1,086 (0.23/day)
Location
South-Africa
Processor AMD Ryzen 9 5900X
Motherboard ASUS ROG STRIX B550-F GAMING (WI-FI)
Cooling Noctua NH-D15 G2
Memory 32GB G.Skill DDR4 3600Mhz CL18
Video Card(s) ASUS GTX 1650 TUF
Storage SAMSUNG 990 PRO 2TB
Display(s) Dell S3220DGF
Case Corsair iCUE 4000X
Audio Device(s) ASUS Xonar D2X
Power Supply Corsair AX760 Platinum
Mouse Razer DeathAdder V2 - Wireless
Keyboard Corsair K70 PRO - OPX Linear Switches
Software Microsoft Windows 11 - Enterprise (64-bit)
Do we have any new news on the availability of the RTX5000 series GPUs? Wondering if reviews/models will be in-time for Christmas. You are hiding a few under your floorboards, aren't you? :roll: @W1zzard
 

W1zzard

Administrator
Staff member
Joined
May 14, 2004
Messages
27,963 (3.72/day)
Processor Ryzen 7 5700X
Memory 48 GB
Video Card(s) RTX 4080
Storage 2x HDD RAID 1, 3x M.2 NVMe
Display(s) 30" 2560x1600 + 19" 1280x1024
Software Windows 10 64-bit
Wondering if reviews/models will be in time for Christmas
That seems extremely unlikely, everyone expects that NVIDIA will reveal this at CES
 
Joined
Dec 14, 2011
Messages
1,086 (0.23/day)
Location
South-Africa
Processor AMD Ryzen 9 5900X
Motherboard ASUS ROG STRIX B550-F GAMING (WI-FI)
Cooling Noctua NH-D15 G2
Memory 32GB G.Skill DDR4 3600Mhz CL18
Video Card(s) ASUS GTX 1650 TUF
Storage SAMSUNG 990 PRO 2TB
Display(s) Dell S3220DGF
Case Corsair iCUE 4000X
Audio Device(s) ASUS Xonar D2X
Power Supply Corsair AX760 Platinum
Mouse Razer DeathAdder V2 - Wireless
Keyboard Corsair K70 PRO - OPX Linear Switches
Software Microsoft Windows 11 - Enterprise (64-bit)
That seems extremely unlikely, everyone expects that NVIDIA will reveal this at CES
It's a shame, last I heard, they tried for a December launch, well, it's was just rumours, guess we will find out, I was hoping you might have heard something on the grape-vine by now. ^_^
 
Joined
Dec 1, 2020
Messages
491 (0.33/day)
Processor Ryzen 5 7600X
Motherboard ASRock B650M PG Riptide
Cooling Noctua NH-D15
Memory DDR5 6000Mhz CL28 32GB
Video Card(s) Nvidia Geforce RTX 3070 Palit GamingPro OC
Storage Corsair MP600 Force Series Gen.4 1TB
Does anybody know if Blackwell actually offers any additional features over the previous architecture, beyond just performance?
Hopefully not. I am sick of new "features" locked for the latest GPUs or "features" that barrely improve the picture quallity while the fps is halfed
 
Joined
May 10, 2023
Messages
352 (0.59/day)
Location
Brazil
Processor 5950x
Motherboard B550 ProArt
Cooling Fuma 2
Memory 4x32GB 3200MHz Corsair LPX
Video Card(s) 2x RTX 3090
Display(s) LG 42" C2 4k OLED
Power Supply XPG Core Reactor 850W
Software I use Arch btw
Does anybody know if Blackwell actually offers any additional features over the previous architecture, beyond just performance?
Improved tensor cores, support for smaller data sizes (meaning that quantized models will run even faster), Trusted execution (so resources sharing the GPU can't peek into the data from one another), dedicated decompression hardware, and better failure detection (which is REALLY important in large deployments).

I believe only the first two will end up in the consumer lineup, with the others being limited to the x100 chips.

Hopefully not. I am sick of new "features" locked for the latest GPUs or "features" that barrely improve the picture quallity while the fps is halfed
This product is not meant for consumers, nor mean to display any "pictures" whatsoever.
 
Joined
Sep 30, 2024
Messages
111 (1.34/day)
Improved tensor cores, support for smaller data sizes (meaning that quantized models will run even faster), Trusted execution (so resources sharing the GPU can't peek into the data from one another), dedicated decompression hardware, and better failure detection (which is REALLY important in large deployments).

I believe only the first two will end up in the consumer lineup, with the others being limited to the x100 chips.


This product is not meant for consumers, nor mean to display any "pictures" whatsoever.
Yeah, so it offers nothing new for gaming. I'm assuming that DLSS 4.0 will be some kind of "new" feature, and probably locked to the 50x0 series. I'm assuming it will use some kind of fake "A.I." marketing bs.

Such a shame when there is simply no competition whatsoever.
 
Joined
May 10, 2023
Messages
352 (0.59/day)
Location
Brazil
Processor 5950x
Motherboard B550 ProArt
Cooling Fuma 2
Memory 4x32GB 3200MHz Corsair LPX
Video Card(s) 2x RTX 3090
Display(s) LG 42" C2 4k OLED
Power Supply XPG Core Reactor 850W
Software I use Arch btw
Yeah, so it offers nothing new for gaming.
Those datacenter products never showed anything new for gaming, they are not meant for that.
You'll need for the consumer Blackwell release to know if there'll be anything like a DLSS 4.0 or not.
 
Joined
Sep 30, 2024
Messages
111 (1.34/day)
Those datacenter products never showed anything new for gaming, they are not meant for that.
You'll need for the consumer Blackwell release to know if there'll be anything like a DLSS 4.0 or not.
I understand that, but the chip is the same, so if there were large changes in that version, those changes would also be in the consumer version (5090).

I know there are parts of the chip that are not anything to do with AI/enterprise workloads that are of interest to gamers, such as the display output, NVenc/dec etc.
 
Joined
May 10, 2023
Messages
352 (0.59/day)
Location
Brazil
Processor 5950x
Motherboard B550 ProArt
Cooling Fuma 2
Memory 4x32GB 3200MHz Corsair LPX
Video Card(s) 2x RTX 3090
Display(s) LG 42" C2 4k OLED
Power Supply XPG Core Reactor 850W
Software I use Arch btw
but the chip is the same
It is not, really. If you want to compare, just take a look at preivous gens. A100 was way too different from the GA102, AD102 had little in common with H100, and so on and so on.
Usually the x100 chips even get a different CUDA capability number compared to the consumer variants (albeit a minor revision difference).
 
Top