• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

NVIDIA Releases Digital Human Microservices, Paving Way for Future of Generative AI Avatars

btarunr

Editor & Senior Moderator
Staff member
Joined
Oct 9, 2007
Messages
46,891 (7.62/day)
Location
Hyderabad, India
System Name RBMK-1000
Processor AMD Ryzen 7 5700G
Motherboard ASUS ROG Strix B450-E Gaming
Cooling DeepCool Gammax L240 V2
Memory 2x 8GB G.Skill Sniper X
Video Card(s) Palit GeForce RTX 2080 SUPER GameRock
Storage Western Digital Black NVMe 512GB
Display(s) BenQ 1440p 60 Hz 27-inch
Case Corsair Carbide 100R
Audio Device(s) ASUS SupremeFX S1220A
Power Supply Cooler Master MWE Gold 650W
Mouse ASUS ROG Strix Impact
Keyboard Gamdias Hermes E2
Software Windows 11 Pro
NVIDIA today announced the general availability of NVIDIA ACE generative AI microservices to accelerate the next wave of digital humans, as well as new generative AI breakthroughs coming soon to the platform. Companies in customer service, gaming and healthcare are the first to adopt ACE technologies to simplify creating, animating and operating lifelike digital humans across customer service, telehealth, gaming and entertainment.

These include NVIDIA Nemotron-3 4.5B, the company's first SLM, which has been purpose-built to run on device with similar levels of precision and accuracy as large language models (LLMs) running in the cloud. Nemotron-3 4.5B SLM is now in early access. NVIDIA Audio2Face and NVIDIA Riva ASR on-device models will be available soon in early access. The new NVIDIA AI Inference Manager software development kit simplifies the deployment of ACE to PCs. It preconfigures the PC with the necessary AI models, engines and dependencies while orchestrating AI inference seamlessly across PCs and the cloud.



The suite of NVIDIA ACE digital human generative AI technologies now generally available includes:
  • NVIDIA Riva ASR, TTS and NMT—for automatic speech recognition, text-to-speech conversion and translation
  • NVIDIA Nemotron LLM—for language understanding and contextual response generation
  • NVIDIA Audio2Face—for realistic facial animation based on audio tracks
  • NVIDIA Omniverse RTX—for real-time, path-traced realistic skin and hair
Newly announced technologies include:
  • NVIDIA Audio2Gesture — for generating body gestures based on audio tracks, available soon
  • NVIDIA Nemotron-3 4.5B — a new small language model (SLM) purpose-built for low-latency, on-device RTX AI PC inference
  • "Digital humans will revolutionize industries," said Jensen Huang, founder and CEO of NVIDIA. "Breakthroughs in multi-modal large language models and neural graphics — delivered by NVIDIA ACE to our ecosystem of developers — are bringing us closer to a future of intent-driven computing, where interacting with computers is as natural as interacting with humans."

Digital Humans Come to 100 Million RTX AI PCs
To date, NVIDIA has provided ACE as NIM microservices for developers to operate in data centers. Now NVIDIA is building ACE PC NIM microservices for deployment across the installed base of 100 million RTX AI PCs and laptops.

These include NVIDIA Nemotron-3 4.5B, the company's first SLM, which has been purpose-built to run on device with similar levels of precision and accuracy as large language models (LLMs) running in the cloud. Nemotron-3 4.5B SLM is now in early access. NVIDIA Audio2Face and NVIDIA Riva ASR on-device models will be available soon in early access.

The new NVIDIA AI Inference Manager software development kit simplifies the deployment of ACE to PCs. It preconfigures the PC with the necessary AI models, engines and dependencies while orchestrating AI inference seamlessly across PCs and the cloud.

An updated version of the Covert Protocol tech demo, developed in collaboration with Inworld AI, is being shown at the COMPUTEX trade show. Using Audio2Face and Riva ASR running locally on GeForce RTX PCs, the demo allows players to interact and influence digital-human non-playable characters (NPCs) with conversational language to complete their mission.

Digital Human Ecosystem Expands With Latest ACE Technologies
ACE is making waves with developers building a variety of applications from companies such as Aww Inc., Dell Technologies, Gumption, Hippocratic AI, Inventec, OurPalm, Perfect World Games, Reallusion, ServiceNow, Soulbotix, SoulShell and UneeQ.

Aww Inc., a pioneering virtual human company based in Japan, launched its first virtual celebrity, Imma, in 2018. Imma has since become the face of major global brands in more than 50 countries. Now, Aww Inc. plans to leverage ACE Audio2Face microservices for real-time animation, enabling a highly interactive communication experience with its users.

Perfect World Games, a game developer and publisher, is adopting ACE in its new mythological wilderness tech demo, Legends. Players can interact with a fully interactive, realistic, multilingual, AI NPC in both English and Mandarin. Using NVIDIA Audio2Face NIM, the character's audio responses generate realistic facial animation in real time.

Inventec, a major technology company that is investing heavily in AI, is using NVIDIA Audio2Face NIM to enhance its healthcare AI agent within the VRSTATE platform. The integration provides a more engaging, comforting virtual consultation experience. At COMPUTEX, Inventec is showcasing an AI agent that can help patients access information about their health.

ServiceNow, the AI platform for business transformation, recently showcased ACE NIM in a generative AI service agent demo for its Now Assist Gen AI Experience, highlighting the potential for digital avatars to enhance customer and employee interactions across industries including retail, travel and more.

Dell Technologies unveiled its cutting-edge Dell Generative AI Solution for Digital Assistants at Dell Technologies World last month. The offering allows businesses to leverage intelligent digital assistants that engage customers through natural conversations across various industries such as retail, healthcare and customer service.

NVIDIA Celebrates Digital Human Startups at COMPUTEX 2024
NVIDIA art teams used generative AI tools built on ACE, including Synthesia and Hour One, to produce a "digital Jensen" avatar that was generated by video from text.

The multilingual avatar featured Huang's unique voice and style, generated by ElevenLabs' proprietary AI speech and voice technology in Mandarin Chinese and English. NVIDIA also collaborated with Voicemod, an NVIDIA Inception member specializing in AI voice technology, to compose the ending theme song of Huang's keynote.

ACE NIM Now Available
NVIDIA ACE NIM microservices for server deployments including Riva and Audio2Face are now in production, adding NVIDIA AI Enterprise software for developers to receive enterprise-class support. Register for early access to ACE NIM microservices that run on RTX AI PCs.

View at TechPowerUp Main Site
 
Joined
Nov 27, 2023
Messages
1,729 (6.60/day)
System Name The Workhorse
Processor AMD Ryzen R9 5900X
Motherboard Gigabyte Aorus B550 Pro
Cooling CPU - Noctua NH-D15S Case - 3 Noctua NF-A14 PWM at the bottom, 2 Fractal Design 180mm at the front
Memory GSkill Trident Z 3200CL14
Video Card(s) NVidia GTX 1070 MSI QuickSilver
Storage Adata SX8200Pro
Display(s) LG 32GK850G
Case Fractal Design Torrent
Audio Device(s) FiiO E-10K DAC/Amp, Samson Meteorite USB Microphone
Power Supply Corsair RMx850 (2018)
Mouse Razer Viper (Original)
Keyboard Cooler Master QuickFire Rapid TKL keyboard (Cherry MX Black)
Software Windows 11 Pro (23H2)
I don’t think that this passes the “not uncanny valley” test. If anything, I feel it will be far creepier in practice than it seems during presentations.
 
Joined
Jan 3, 2021
Messages
3,081 (2.33/day)
Location
Slovenia
Processor i5-6600K
Motherboard Asus Z170A
Cooling some cheap Cooler Master Hyper 103 or similar
Memory 16GB DDR4-2400
Video Card(s) IGP
Storage Samsung 850 EVO 250GB
Display(s) 2x Oldell 24" 1920x1200
Case Bitfenix Nova white windowless non-mesh
Audio Device(s) E-mu 1212m PCI
Power Supply Seasonic G-360
Mouse Logitech Marble trackball, never had a mouse
Keyboard Key Tronic KT2000, no Win key because 1994
Software Oldwin
More rogue copilots.
 
Joined
May 29, 2012
Messages
520 (0.12/day)
System Name CUBE_NXT
Processor i9 12900K @ 5.0Ghz all P-cores with E-cores enabled
Motherboard Gigabyte Z690 Aorus Master
Cooling EK AIO Elite Cooler w/ 3 Phanteks T30 fans
Memory 64GB DDR5 @ 5600Mhz
Video Card(s) EVGA 3090Ti Ultra Hybrid Gaming w/ 3 Phanteks T30 fans
Storage 1 x SK Hynix P41 Platinum 1TB, 1 x 2TB, 1 x WD_BLACK SN850 2TB, 1 x WD_RED SN700 4TB
Display(s) Alienware AW3418DW
Case Lian-Li O11 Dynamic Evo w/ 3 Phanteks T30 fans
Power Supply Seasonic PRIME 1000W Titanium
Software Windows 11 Pro 64-bit
Looks absolutely fucking garbage like all of the NPC "AI" crap nvidia keeps showing off. Worthless trash.
 
Joined
Jul 29, 2022
Messages
450 (0.60/day)
I don’t think that this passes the “not uncanny valley” test. If anything, I feel it will be far creepier in practice than it seems during presentations.
The 600 series Terminators had rubber skin, we could spot them easily...

 
Joined
Aug 20, 2007
Messages
21,078 (3.40/day)
System Name Pioneer
Processor Ryzen R9 7950X
Motherboard GIGABYTE Aorus Elite X670 AX
Cooling Noctua NH-D15 + A whole lotta Sunon and Corsair Maglev blower fans...
Memory 64GB (4x 16GB) G.Skill Flare X5 @ DDR5-6000 CL30
Video Card(s) XFX RX 7900 XTX Speedster Merc 310
Storage Intel 905p Optane 960GB boot, +2x Crucial P5 Plus 2TB PCIe 4.0 NVMe SSDs
Display(s) 55" LG 55" B9 OLED 4K Display
Case Thermaltake Core X31
Audio Device(s) TOSLINK->Schiit Modi MB->Asgard 2 DAC Amp->AKG Pro K712 Headphones or HDMI->B9 OLED
Power Supply FSP Hydro Ti Pro 850W
Mouse Logitech G305 Lightspeed Wireless
Keyboard WASD Code v3 with Cherry Green keyswitches + PBT DS keycaps
Software Gentoo Linux x64 / Windows 11 Enterprise IoT 2024
AI generate a shit for me to give.
 
Top