NVIDIA Announces Tesla T4 Tensor Core GPU

btarunr · Sep 13, 2018

Fueling the growth of AI services worldwide, NVIDIA today launched an AI data center platform that delivers the industry's most advanced inference acceleration for voice, video, image and recommendation services. The NVIDIA TensorRT Hyperscale Inference Platform features NVIDIA Tesla T4 GPUs based on the company's breakthrough NVIDIA Turing architecture and a comprehensive set of new inference software.

Delivering the fastest performance with lower latency for end-to-end applications, the platform enables hyperscale data centers to offer new services, such as enhanced natural language interactions and direct answers to search queries rather than a list of possible results. "Our customers are racing toward a future where every product and service will be touched and improved by AI," said Ian Buck, vice president and general manager of Accelerated Business at NVIDIA. "The NVIDIA TensorRT Hyperscale Platform has been built to bring this to reality - faster and more efficiently than had been previously thought possible."

Every day, massive data centers process billions of voice queries, translations, images, videos, recommendations and social media interactions. Each of these applications requires a different type of neural network residing on the server where the processing takes place.

To optimize the data center for maximum throughput and server utilization, the NVIDIA TensorRT Hyperscale Platform includes both real-time inference software and Tesla T4 GPUs, which process queries up to 40x faster than CPUs alone.

NVIDIA estimates that the AI inference industry is poised to grow in the next five years into a $20 billion market.

Industry's Most Advanced AI Inference Platform
The NVIDIA TensorRT Hyperscale Platform includes a comprehensive set of hardware and software offerings optimized for powerful, highly efficient inference. Key elements include:

NVIDIA Tesla T4 GPU -

Featuring 320 Turing Tensor Cores and 2,560 CUDA cores, this new GPU provides breakthrough performance with flexible, multi-precision capabilities, from FP32 to FP16 to INT8, as well as INT4. Packaged in an energy-efficient, 75-watt, small PCIe form factor that easily fits into most servers, it offers 65 teraflops of peak performance for FP16, 130 teraflops for INT8 and 260 teraflops for INT4.
NVIDIA TensorRT 5 - An inference optimizer and runtime engine, NVIDIA TensorRT 5 supports Turing Tensor Cores and expands the set of neural network optimizations for multi-precision workloads.
NVIDIA TensorRT inference server - This containerized microservice software enables applications to use AI models in data center production. Freely available from the NVIDIA GPU Cloud container registry, it maximizes data center throughput and GPU utilization, supports all popular AI models and frameworks, and integrates with Kubernetes and Docker.

Supported by Technology Leaders Worldwide
Support for NVIDIA's new inference platform comes from leading consumer and business technology companies around the world.

"We are working hard at Microsoft to deliver the most innovative AI-powered services to our customers," said Jordi Ribas, corporate vice president for Bing and AI Products at Microsoft. "Using NVIDIA GPUs in real-time inference workloads has improved Bing's advanced search offerings, enabling us to reduce object detection latency for images. We look forward to working with NVIDIA's next-generation inference hardware and software to expand the way people benefit from AI products and services."

Chris Kleban, product manager at Google Cloud, said: "AI is becoming increasingly pervasive, and inference is a critical capability customers need to successfully deploy their AI models, so we're excited to support NVIDIA's Turing Tesla T4 GPUs on Google Cloud Platform soon."

More information, including details on how to request early access to T4 GPUs on Google Cloud Platform, is available here.

dditional companies, including all major server manufacturers, voicing support for the NVIDIA TensorRT Hyperscale Platform include:

"Cisco's UCS portfolio delivers policy-driven, GPU-accelerated systems and solutions to power every phase of the AI lifecycle. With the NVIDIA Tesla T4 GPU based on the NVIDIA Turing architecture, Cisco customers will have access to the most efficient accelerator for AI inference workloads - gaining insights faster and accelerating time to action."
- Kaustubh Das, vice president of product management, Data Center Group, Cisco

"Dell EMC is focused on helping customers transform their IT while benefiting from advancements such as artificial intelligence. As the world's leading provider of server systems, Dell EMC continues to enhance the PowerEdge server portfolio to help our customers ultimately achieve their goals. Our close collaboration with NVIDIA and historical adoption of the latest GPU accelerators available from their Tesla portfolio play a vital role in helping our customers stay ahead of the curve in AI training and inference."
- Ravi Pendekanti, senior vice president of product management and marketing, Servers & Infrastructure Systems, Dell EMC

"Fujitsu plans to incorporate NVIDIA's Tesla T4 GPUs into our global Fujitsu Server PRIMERGY systems lineup. Leveraging this latest, high-efficiency GPU accelerator from NVIDIA, we will provide our customers around the world with servers highly optimized for their growing AI needs."
- Hideaki Maeda, vice president of the Products Division, Data Center Platform Business Unit, Fujitsu Ltd.

"At HPE, we are committed to driving intelligence at the edge for faster insight and improved experiences. With the NVIDIA Tesla T4 GPU, based on the NVIDIA Turing architecture, we are continuing to modernize and accelerate the data center to enable inference at the edge."
- Bill Mannel, vice president and general manager, HPC and AI Group, Hewlett Packard Enterprise

"IBM Cognitive Systems is able to deliver 4x faster deep learning training times as a result of a co-optimized hardware and software on a simplified AI platform with PowerAI, our deep learning training and inference software, and IBM Power Systems AC922 accelerated servers. We have a history of partnership and innovation with NVIDIA, and together we co-developed the industry's only CPU-to-GPU NVIDIA NVLink connection on IBM Power processors, and we are excited to explore the new NVIDIA T4 GPU accelerator to extend this state of the art leadership for inference workloads."
- Steve Sibley, vice president of Power Systems Offering Management, IBM

"We are excited to see NVIDIA bring GPU inference to Kubernetes with the NVIDIA TensorRT inference server, and look forward to integrating it with Kubeflow to provide users with a simple, portable and scalable way to deploy AI inference across diverse infrastructures."
- David Aronchick, co-founder and product manager of Kubeflow

"Open source cross-framework inference is vital to production deployments of machine learning models. We are excited to see how the NVIDIA TensorRT inference server, which brings a powerful solution for both GPU and CPU inference serving at scale, enables faster deployment of AI applications and improves infrastructure utilization."
- Kash Iftikhar, vice president of product development, Oracle Cloud Infrastructure

"Supermicro is innovating to address the rapidly emerging high-throughput inference market driven by technologies such as 5G, Smart Cities and IOT devices, which are generating huge amounts of data and require real-time decision making. We see the combination of NVIDIA TensorRT and the new Turing architecture-based T4 GPU accelerator as the ideal combination for these new, demanding and latency-sensitive workloads and plan to aggressively leverage them in our GPU system product line."
- Charles Liang, president and CEO, Supermicro

View at TechPowerUp Main Site

First Strike · Sep 13, 2018

Real intriguing, seems to be a TU104 cutdown. Cut from 48 SM to 40 SM, 6 GPC to 5 GPC. What's the point?

First Strike said:
Real intriguing, seems to be a TU104 cutdown. Cut from 48 SM to 40 SM, 6 GPC to 5 GPC. What's the point?

Oh 75W TDP, super-binned I see.
@btarunr You seemed to forget to mention the memory side of things. :confused:

Arjai · Sep 13, 2018

inference. Used 11 times, throughout this article..

"or assumed to be true"
"includes hypotheses"

BS, to me.

cucker tarlson · Sep 13, 2018

Arjai said:
inference. Used 11 times, throughout this article..

"or assumed to be true"
"includes hypotheses"

BS, to me.

why ?
don't know much about AI acceleration,but isn't it all about creating the most accurate outcome based on statistical data ?

First Strike · Sep 13, 2018

Arjai said:
inference. Used 11 times, throughout this article..
"or assumed to be true"
"includes hypotheses"

BS, to me.

Ya, you just trolled on a scientific jargon that is quite fundamental in neural network AI field.
A neural network AI = training model&algorithm + inferencing algorithm. Now you called the second part BS.

notb · Sep 13, 2018

Arjai said:
inference. Used 11 times, throughout this article..

BS, to me.

"Inference" is a mathematical term. That's how we call the thing neural networks do.

Hence, it appears a lot in a text about product designed for training neural networks.

Open a text about a gaming GPU and check how many times words "game" and "gaming" appear. Is that also BS?

ZoneDymo · Sep 13, 2018

things are getting pretty tense

techy1 · Sep 13, 2018

ZoneDymo said:
things are getting pretty tense

*ba dum tss*
:roll:

DeathtoGnomes · Sep 13, 2018

This is all fine and dandy until Dave opens the airlock.

techy1 · Sep 13, 2018

DeathtoGnomes said:
This is all fine and dandy until Dave opens the airlock.

I am sorry Dave I'm afraid I can't do that

silentbogo · Sep 13, 2018

First Strike said:
@btarunr You seemed to forget to mention the memory side of things.

16GB GDDR6 256-bit wide.
https://www.ixbt.com/news/2018/09/13/nvidia-tesla-t4.html

TheoneandonlyMrK · Sep 13, 2018

Could this be the pro incarnation of the 2060?

jabbadap · Sep 13, 2018

theoneandonlymrk said:
Could this be the pro incarnation of the 2060?

uhm, how about no? RTX 2070 has 2304 ccs this one has 2560.

Vayra86 · Sep 13, 2018

ZoneDymo said:
things are getting pretty tense

Real Tense.

Arjai said:
inference. Used 11 times, throughout this article..

"or assumed to be true"
"includes hypotheses"

BS, to me.

This is pure gold. BS to you doesn't make it BS in the real world. But it's telling - also the person who gave you a big +3 on that post doesn't surprise me one bit...

The fact is, if RT and deep learning has a right to exist, its precisely in this segment of the market (and not gaming GPUs). You did notice this isn't a Geforce release, I hope?

medi01 · Sep 13, 2018

Something something, every day, massive, billions, even more expensive, something.

TheoneandonlyMrK · Sep 13, 2018

jabbadap said:
uhm, how about no? RTX 2070 has 2304 ccs this one has 2560.

awe did a question cause you to be sarcastic umn No, you decided to answer like a tool.

DeathtoGnomes · Sep 13, 2018

theoneandonlymrk said:
awe did a question cause you to be sarcastic umn No, you decided to answer like a tool.

come on keep it civil, no name calling, Mr. Tool. :nutkick:

jabbadap · Sep 13, 2018

theoneandonlymrk said:
awe did a question cause you to be sarcastic umn No, you decided to answer like a tool.

Well let's try this way. No it's not. 1.) it's server part not a pro(quadro) part, 2.) it has more cuda cores than 2070.

Well one thing that it might have been sort of 2060, if full tu106 would have 2560cc(which i doubt because of 3 GPC:s with Turing SM structure is 2304cc, I doubt nvidia would change that). And there is rumors that RTX 2070 is tu106 not a cut down tu104, which would make a rtx 2070 as successor of gtx1060 not gtx0170.

TheoneandonlyMrK · Sep 13, 2018

DeathtoGnomes said:
come on keep it civil, no name calling, Mr. Tool.

Hope you realise that's just you ,i said Like a tool.
@jabbadap that is better ty.

Vayra86 · Sep 13, 2018

medi01 said:
Something something, every day, massive, billions, even more expensive, something.

Fantastic contribution! Thanks man, where would we be without your wisdom.

T4C Fantasy · Sep 13, 2018

jabbadap said:
uhm, how about no? RTX 2070 has 2304 ccs this one has 2560.

Yea and 2070 is TU106 which maxes at 2304

Its a fact that 2070 is TU106, not a rumor xD

Arjai · Sep 13, 2018

For the sake of toast. Will some of you take a deep breath, maybe engage brain, before instantly become some know it all prick?

BS, to me. Perhaps it is not, to you. Maybe, since everyone here has become sooooo sensitive, I should have phrased it, "No offense to you, or you, or you but, this all seems like a pile of malarkey, to me."

In fact, keep an eye out for my new phrase... :shadedshu:

cucker tarlson · Sep 13, 2018

T4C Fantasy said:
Yea and 2070 is TU106 which maxes at 2304

Its a fact that 2070 is TU106, not a rumor xD

Weird,but seems true. Full 106 with 14gbps ddr6 chip will breathe down the neck of a cut 104 xx80 card this time. Unless 8 SM per GPC vs 12 on 106 will make a difference.

Arjai said:
For the sake of toast. Will some of you take a deep breath, maybe engage brain, before instantly become some know it all prick?

BS, to me. Perhaps it is not, to you. Maybe, since everyone here has become sooooo sensitive, I should have phrased it, "No offense to you, or you, or you but, this all seems like a pile of malarkey, to me."

In fact, keep an eye out for my new phrase...

If you complain that everyone's so sensitive these days, why do you seem to be the only one triggered by anything that comes from nvidia or intel ?

Fluffmeister · Sep 13, 2018

This card certainly packs a punch for something that fits in a 75 watt package.

cucker tarlson · Sep 13, 2018

8.1TFlops FP32 at 75W,crazy.

System Name	RBMK-1000
Processor	AMD Ryzen 7 5700G
Motherboard	ASUS ROG Strix B450-E Gaming
Cooling	DeepCool Gammax L240 V2
Memory	2x 8GB G.Skill Sniper X
Video Card(s)	Palit GeForce RTX 2080 SUPER GameRock
Storage	Western Digital Black NVMe 512GB
Display(s)	BenQ 1440p 60 Hz 27-inch
Case	Corsair Carbide 100R
Audio Device(s)	ASUS SupremeFX S1220A
Power Supply	Cooler Master MWE Gold 650W
Mouse	ASUS ROG Strix Impact
Keyboard	Gamdias Hermes E2
Software	Windows 11 Pro

System Name	Bay2- Lowerbay/ HP 3770/T3500-2+T3500-3+T3500-4/ Opti-Con/Orange/White/Grey
Processor	i3 2120's/ i7 3770/ x5670's/ i5 2400/Ryzen 2700/Ryzen 2700/R7 3700x
Motherboard	HP UltraSlim's/ HP mid size/ Dell T3500 workstation's/ Dell 390/B450 AorusM/B450 AorusM/B550 AorusM
Cooling	All stock coolers/Grey has an H-60
Memory	2GB/ 4GB/ 12 GB 3 chan/ 4GB sammy/T-Force 16GB 3200/XPG 16GB 3000/Ballistic 3600 16GB
Video Card(s)	HD2000's/ HD 2000/ 1 MSI GT710,2x MSI R7 240's/ HD4000/ Red Dragon 580/Sapphire 580/Sapphire 580
Storage	?HDD's/ 500 GB-er's/ 500 GB/2.5 Samsung 500GB HDD+WD Black 1TB/ WD Black 500GB M.2/Corsair MP600 M.2
Display(s)	1920x1080/ ViewSonic VX24568 between the rest/1080p TV-Grey
Case	HP 8200 UltraSlim's/ HP 8200 mid tower/Dell T3500's/ Dell 390/SilverStone Kublai KL06/NZXT H510 W x2
Audio Device(s)	Sonic Master/ onboard's/ Beeper's!
Power Supply	19.5 volt bricks/ Dell PSU/ 525W sumptin/ same/Seasonic 750 80+Gold/EVGA 500 80+/Antec 650 80+Gold
Mouse	cheap GigaWire930, CMStorm Havoc + Logitech M510 wireless/iGear usb x2/MX 900 wireless kit 4 Grey
Keyboard	Dynex, 2 no name, SYX and a Logitech. All full sized and USB. MX900 kit for Grey
Software	Mint 18 Sylvia/ Opti-Con Mint KDE/ T3500's on Kubuntu/HP 3770 is Win 10/Win 10 Pro/Win 10 Pro/Win10
Benchmark Scores	World Community Grid is my benchmark!!

System Name	Purple rain
Processor	10.5 thousand 4.2G 1.1v
Motherboard	Zee 490 Aorus Elite
Cooling	Noctua D15S
Memory	16GB 4133 CL16-16-16-31 Viper Steel
Video Card(s)	RTX 2070 Super Gaming X Trio
Storage	SU900 128,8200Pro 1TB,850 Pro 512+256+256,860 Evo 500,XPG950 480, Skyhawk 2TB
Display(s)	Acer XB241YU+Dell S2716DG
Case	P600S Silent w. Alpenfohn wing boost 3 ARGBT+ fans
Audio Device(s)	K612 Pro w. FiiO E10k DAC,W830BT wireless
Power Supply	Superflower Leadex Gold 850W
Mouse	G903 lightspeed+powerplay,G403 wireless + Steelseries DeX + Roccat rest
Keyboard	HyperX Alloy SilverSpeed (w.HyperX wrist rest),Razer Deathstalker
Software	Windows 10
Benchmark Scores	A LOT

System Name	Cyberline
Processor	Intel Core i7 2600k -> 12600k
Motherboard	Asus P8P67 LE Rev 3.0 -> Gigabyte Z690 Auros Elite DDR4
Cooling	Tuniq Tower 120 -> Custom Watercoolingloop
Memory	Corsair (4x2) 8gb 1600mhz -> Crucial (8x2) 16gb 3600mhz
Video Card(s)	AMD RX480 -> RX7800XT
Storage	Samsung 750 Evo 250gb SSD + WD 1tb x 2 + WD 2tb -> 2tb MVMe SSD
Display(s)	Philips 32inch LPF5605H (television) -> Dell S3220DGF
Case	antec 600 -> Thermaltake Tenor HTCP case
Audio Device(s)	Focusrite 2i4 (USB)
Power Supply	Seasonic 620watt 80+ Platinum
Mouse	Elecom EX-G
Keyboard	Rapoo V700
Software	Windows 10 Pro 64bit

System Name	gamingPZ
Processor	i7-6700k
Motherboard	Asrock Z170M Pro4S
Cooling	scythe mugen4
Memory	32GB ddr4 2400mhz crucial ballistix sport lt
Video Card(s)	gigabyte GTX 1070 ti
Storage	ssd - crucial MX500 1TB
Case	silverstone sugo sg10
Power Supply	Evga G2 650w
Software	win10

NVIDIA Announces Tesla T4 Tensor Core GPU

btarunr

Editor & Senior Moderator

First Strike

Arjai

cucker tarlson

First Strike

notb

ZoneDymo

techy1

DeathtoGnomes

techy1

silentbogo

Moderator

TheoneandonlyMrK

jabbadap

Vayra86

medi01

TheoneandonlyMrK

DeathtoGnomes

jabbadap

TheoneandonlyMrK

Vayra86

T4C Fantasy

CPU & GPU DB Maintainer

Arjai

cucker tarlson

Fluffmeister

cucker tarlson

System Name	Dumbass
Processor	AMD Ryzen 7800X3D
Motherboard	ASUS TUF gaming B650
Cooling	Artic Liquid Freezer 2 - 420mm
Memory	G.Skill Sniper 32gb DDR5 6000
Video Card(s)	GreenTeam 4070 ti super 16gb
Storage	Samsung EVO 500gb & 1Tb, 2tb HDD, 500gb WD Black
Display(s)	1x Nixeus NX_EDG27, 2x Dell S2440L (16:9)
Case	Phanteks Enthoo Primo w/8 140mm SP Fans
Audio Device(s)	onboard (realtek?) - SPKRS:Logitech Z623 200w 2.1
Power Supply	Corsair HX1000i
Mouse	Steeseries Esports Wireless
Keyboard	Corsair K100
Software	windows 10 H
Benchmark Scores	https://i.imgur.com/aoz3vWY.jpg?2

System Name	WS#1337
Processor	Ryzen 7 5700X3D
Motherboard	ASUS X570-PLUS TUF Gaming
Cooling	Xigmatek Scylla 240mm AIO
Memory	64GB DDR4-3600(4x16)
Video Card(s)	MSI RTX 3070 Gaming X Trio
Storage	ADATA Legend 2TB
Display(s)	Samsung Viewfinity Ultra S6 (34" UW)
Case	ghetto CM Cosmos RC-1000
Audio Device(s)	ALC1220
Power Supply	SeaSonic SSR-550FX (80+ GOLD)
Mouse	Logitech G603
Keyboard	Modecom Volcano Blade (Kailh choc LP)
VR HMD	Google dreamview headset(aka fancy cardboard)
Software	Windows 11, Ubuntu 24.04 LTS

System Name	RyzenGtEvo/ Asus strix scar II
Processor	Amd R5 5900X/ Intel 8750H
Motherboard	Crosshair hero8 impact/Asus
Cooling	360EK extreme rad+ 360$EK slim all push, cpu ek suprim Gpu full cover all EK
Memory	Gskill Trident Z 3900cas18 32Gb in four sticks./16Gb/16GB
Video Card(s)	Asus tuf RX7900XT /Rtx 2060
Storage	Silicon power 2TB nvme/8Tb external/1Tb samsung Evo nvme 2Tb sata ssd/1Tb nvme
Display(s)	Samsung UAE28"850R 4k freesync.dell shiter
Case	Lianli 011 dynamic/strix scar2
Audio Device(s)	Xfi creative 7.1 on board ,Yamaha dts av setup, corsair void pro headset
Power Supply	corsair 1200Hxi/Asus stock
Mouse	Roccat Kova/ Logitech G wireless
Keyboard	Roccat Aimo 120
VR HMD	Oculus rift
Software	Win 10 Pro
Benchmark Scores	laptop Timespy 6506

System Name	Tiny the White Yeti
Processor	7800X3D
Motherboard	MSI MAG Mortar b650m wifi
Cooling	CPU: Thermalright Peerless Assassin / Case: Phanteks T30-120 x3
Memory	32GB Corsair Vengeance 30CL6000
Video Card(s)	ASRock RX7900XT Phantom Gaming
Storage	Lexar NM790 4TB + Samsung 850 EVO 1TB + Samsung 980 1TB + Crucial BX100 250GB
Display(s)	Gigabyte G34QWC (3440x1440)
Case	Lian Li A3 mATX White
Audio Device(s)	Harman Kardon AVR137 + 2.1
Power Supply	EVGA Supernova G2 750W
Mouse	Steelseries Aerox 5
Keyboard	Lenovo Thinkpad Trackpoint II
VR HMD	HD 420 - Green Edition ;)
Software	W11 IoT Enterprise LTSC
Benchmark Scores	Over 9000

System Name	M3401 notebook
Processor	5600H
Motherboard	NA
Memory	16GB
Video Card(s)	3050
Storage	500GB SSD
Display(s)	14" OLED screen of the laptop
Software	Windows 10
Benchmark Scores	3050 scores good 15-20% lower than average, despite ASUS's claims that it has uber cooling.

System Name	Whaaaat Kiiiiiiid!
Processor	Intel Core i9-14900K @ Default
Motherboard	Gigabyte Z690 AORUS Elite AX DDR4
Cooling	Corsair H150i AIO Cooler
Memory	Corsair Dominator Platinum 128GB DDR4-3200
Video Card(s)	EVGA GeForce RTX 3080 FTW3 ULTRA @ Default
Storage	Samsung 970 PRO 512GB + Crucial MX500 2TB x3 + Crucial MX500 4TB + Samsung 980 PRO 1TB
Display(s)	27" LG 27MU67-B 4K, + 27" Acer Predator XB271HU 1440P
Case	Thermaltake Core X9 Snow
Audio Device(s)	Logitech G PRO X 2 Lightspeed
Power Supply	SeaSonic Platinum 1050W Snow Silent
Mouse	Logitech G903 Lightspeed
Keyboard	Logitech G915 X Lightspeed
Software	Windows 11 Pro
Benchmark Scores	FFXV: 19329

Processor	AMD Ryzen 7 5700X3D
Motherboard	MSI MAG B550 TOMAHAWK
Cooling	Thermalright Peerless Assassin 120 SE
Memory	Team Group Dark Pro 8Pack Edition 3600Mhz CL16
Video Card(s)	NVIDIA GeForce RTX 3080 FE
Storage	Kingston A2000 1TB + Seagate HDD workhorse
Display(s)	Hisense 55" U7K
Case	Antec 1200
Power Supply	Seasonic Focus GX-850
Mouse	Razer Deathadder Chroma
Keyboard	Logitech UltraX
Software	Windows 11