Tachyum Demo Shows Prodigy Will Be Faster Than NVIDIA and Intel Chips

btarunr · Aug 11, 2020

Tachyum Inc. today announced that it has successfully completed a demonstration showing its Prodigy Universal Processor running faster than any other processor, HPC or AI chips, including ones from NVIDIA and Intel. This is the latest of many recent milestones achieved by Tachyum as the company continues its march towards Prodigy's product release next year.

Tachyum demonstrated how its computational operation and the speed of its product design, using an industry-standard Verilog simulation of the actual Prodigy post layout hardware, is the superior solution to current competitive offerings. Not only does Prodigy execute instructions at very high speeds, but Tachyum now has an infrastructure implemented for automatically checking correct results from the Verilog RTL. These automated tests check Verilog output for correctness compared to Tachyum's C-model, which was used to measure performance, and is now the 'Golden Model' for the Verilog hardware simulation to ensure it produces identical, step-by-step results.

This verification milestone dramatically increases Tachyum's productivity and its ability to test the Prodigy hardware design efficiently in order to find bugs and correct them prior to tape-out. With this latest accomplishment, Tachyum now has automated the constrained random test generation capability, which further adds to its productivity.

Tachyum's previous hardware design milestone was to build components and interconnect them, which was successfully completed in April. The most recent hardware design milestone - and resulting tool - is about the Prodigy processor producing correct results and its performance on test programs. Prodigy is now handling branch mispredictions, or compiler misprediction of memory dependency, whereupon it detects, recovers and produces correct results.

Thanks to Tachyum's IP suppliers, the company is now able to do read/writes from Prodigy communications mesh to its DDR5 DIMMs hardware memory models. The global clock is now connected from the PLL to Prodigy cores. RAMBIST and other manufacturability features are now integrated into the Prodigy hardware design in large part due to Tachyum's physical design partner.

"This latest hardware milestone is a testament to the diligent work of our engineering team and the vast human resources we have been able to assemble to complete a revolutionary solution never before seen," said Dr. Radoslav Danilak, Tachyum founder and CEO. "We set out to produce the highest performance, lowest energy and most cost-efficient processor for the hyperscale, HPC and AI marketplace, and these milestones are proving that we are achieving those goals. With a product that is faster than the fastest Intel Xeon or NVIDIA A100 Chips, Prodigy is nearing all of its stated objectives and remains on track to make its debut as planned next year."

Tachyum's Prodigy can run HPC applications, convolution AI, explainable AI, general AI, bio AI and spiking neural networks, as well as normal data center workloads on a single homogeneous processor platform with its simple programming model. Using CPU, GPU, TPU and other accelerators in lieu of Prodigy for these different types of workloads is inefficient. A heterogeneous processing fabric, with unique hardware dedicated to each type of workload (e.g. data center, AI, HPC), results in underutilization of hardware resources, and a more challenging programming environment. Prodigy's ability to seamlessly switch among these various workloads dramatically changes the competitive landscape and the economics of data centers.

Prodigy significantly improves computational performance, energy consumption, hardware (server) utilization and space requirements compared to existing chips provisioned in hyperscale data centers today. It will also allow Edge developers for IoT to exploit its low power and high performance, along with its simple programming model to deliver AI to the edge.

Prodigy is truly a universal processor. In addition to native Prodigy code, it also runs legacy x86, ARM and RISC-V binaries. And, with a single, highly efficient processor architecture, Prodigy delivers industry-leading performance across data center, AI, and HPC workloads. Prodigy, the company's flagship Universal Processor, will enter volume production in 2021. In April the Prodigy chip successfully proved its viability with a complete chip layout exceeding speed targets. In August the processor is able to correctly execute short programs, with results automatically verified against the software model, while exceeding the target clock speeds. The next step is to get a manufactured wholly functional FPGA prototype of the chip later this year, which is the last milestone before tape-out.

Prodigy outperforms the fastest Xeon processors at 10x lower power on data center workloads, as well as outperforming NVIDIA's fastest GPU on HPC, AI training and inference. The 125 HPC Prodigy racks can deliver a 32 tensor EXAFLOPS. Prodigy's 3X lower cost per MIPS and 10X lower power translates to a 4X lower data center Total Cost of Ownership (TCO), enables billions of dollars of savings for hyperscalers such as Google, Facebook, Amazon, Alibaba, and others. Since Prodigy is the world's only processor that can switch between data center, AI and HPC workloads, unused servers can be used as CAPEX-free AI or HPC cloud, because the servers have already been amortized.

For demo resources and videos, visit this page.

View at TechPowerUp Main Site

Vya Domus · Aug 11, 2020

Call it a gut feeling but this seems like the Theranos of silicon.

Assimilator · Aug 11, 2020

News day so slow you have to post about vapourware, huh?

TheoneandonlyMrK · Aug 11, 2020

Yet we see no proof of it actually doing anything, disappointed.

ebivan · Aug 11, 2020

When will they acually release something? So far all I hear is "We can do anything better" (but only in internal tests...)

Verpal · Aug 11, 2020

Verilog RTL.

OK.

Call me again when you got silicon sampled and start running benchmark on it.

mahirzukic2 · Aug 11, 2020

Verpal said:
Verilog RTL.

OK.

Call me again when you got silicon sampled and start running benchmark on it.

As per:

Prodigy, the company's flagship Universal Processor, will enter volume production in 2021. In April the Prodigy chip successfully proved its viability with a complete chip layout exceeding speed targets. In August the processor is able to correctly execute short programs, with results automatically verified against the software model, while exceeding the target clock speeds. The next step is to get a manufactured wholly functional FPGA prototype of the chip later this year, which is the last milestone before tape-out.

We need to wait for 2021, I would guess H1.

windwhirl · Aug 11, 2020

Vya Domus said:
Call it a gut feeling but this seems like the Theranos of silicon.

I would like it if you were wrong, it means more competition, etc., but I'm kinda getting the same vibe here :laugh:

Steevo · Aug 11, 2020

How much security is built in?

If every application were trusted and didn't have any faults processors would be simple to build.

windwhirl · Aug 11, 2020

Steevo said:
How much security is built in?

If every application were trusted and didn't have any faults processors would be simple to build.

You inb4 massive security vulnerabilities take away 90% of the performance? :roll:

Frick · Aug 11, 2020

Beating Nvidia and Intel in custom AI is ... realistic. Chips built for specific tasks are way better at that task than general purpose stuff, and if you (like Amazon) build code specific for that chip ... yeah it'll be faster than anything Intel and Nvidia has to offer, for that specific task. This thing can be faster in very specific workloads, which is the point. That it also runs "legacy x86, ARM and RISC-V binaries" is ... I have no idea if it's a good idea or not. The performance will be terrible. Is there a market for that? I have no idea. Maybe?

Anyway these guys seem to be made up by some SandForce (remember SandForce?) people and some Wave Computing (uh-oh) people. At least they're serious about it, and even though I doubt anything that isn't actual numbers from a real product it will be interesting to see how it pans out.

Also there is a myriad of AI compute companies out there right now, and I quite like it.

Some more details

Creating the Universal Processor

“If I had to calculate 100% certainty on every deal I did, I would do zero deals.” – Chip Gaines “Prodigy” is an appropriate name for the new microprocessor under development at Tachyum, a startup …

www.eejournal.com

Vya Domus · Aug 11, 2020

Frick said:
Also there is a myriad of AI compute companies out there right now, and I quite like it.

I kind of don't, they are waste of manpower and investor cash. "AI" will be the next dot-com bubble, mark my words.

bug · Aug 11, 2020

theoneandonlymrk said:
Yet we see no proof of it actually doing anything, disappointed.

Well, it can appear in the news today, so there's that.

Frick · Aug 11, 2020

Vya Domus said:
I kind of don't, they are waste of manpower and investor cash. "AI" will be the next dot-com bubble, mark my words.

Lots of them will go bust fo sho, but otoh it is always good that the existing dragons have some fires under them, and if the dragons buy out the upcomers (which is likely if the tech is good) it will find its way to consumers anyway (maybe). And it's still an emergent market, so lots of things can happen before history decides what the outcome turned out to be. It's easy to tell what failed after the fact. Also things rarely happen in isolation, progress here might be applicable to there.

TheoneandonlyMrK · Aug 11, 2020

bug said:
Well, it can appear in the news today, so there's that.

It was in the news the other day for saying 10x Xeon or ampere at 100th the power or something.

In Any workload , on any instruction set.

It was probably less boastful but that's still an accurate review of the last Pr piece and this one.

Facts/benches=zero.

Yawn, I would love a third decent architecture, but vapour isn't worth much time.

kapone32 · Aug 11, 2020

They will sign a licence with Biostar to distribute pre built systems for everything from Etherium to DOTA. GOG will invest in a hardware solution for their platform (if you know what I mean) and the CPU market will become ultra competitive. Nvidia or Intel would buy one another and AMD, Sony and MS would form an alliance to respond with a 5nm APU with Zen5 CPU cores and RDNA3 GPU cores. Intel\Nvidia would come with a chip about the size of TR4 with 2 3080TIs, 64 PCIE 4.0 lanes and 24 7nm CPU cores that run at a stable 6.7 GHZ with a 7 GHZ boost. Of course all of this is as possible as building a livable colony on the Moon.

Steevo · Aug 11, 2020

windwhirl said:
You inb4 massive security vulnerabilities take away 90% of the performance?

Yep, with cache handling, predictive branch speculation, symmetric threads, all needing hardware management that can communicate locks to programs and define addressing spaces for each and manage security in tandem with the operating system I dont see this happening.

Want to see how much time in wait each thread/core has thise metrics are available in Windows. The only reason consoles for example run slightly to moderately faster than PCs is software trust and closed ecosystems. But no one is going to tell me that chip A @4ghz is 10 times faster than chip B @4ghz in a branch heavy, dependant, out of order serial work load. The only programs that run faster with more cores is when workloads aren't dependant on current results.

Wshlist · Aug 11, 2020

Seems a fair company in the sense that they don't mention AMD and nicely avoid that comparison

Also don't see Google's AI chips mentioned come to think of it.

It's a pity though that they flirt with the DoD and 'intellengence orgs' needs.

aQi · Aug 11, 2020

Thats like someone just pretending to be they are not.
Without any “solid” I cant agree to any of that.
Releasing next year ? Sure take 100 just make sure to bring silicon samples with you.

TheoneandonlyMrK · Aug 11, 2020

Wshlist said:
Seems a fair company in the sense that they don't mention AMD and nicely avoid that comparison
Also don't see Google's AI chips mentioned come to think of it.

It's a pity though that they flirt with the DoD and 'intellengence orgs' needs.

I'm not bothered who they compare with, just show some proof with the claims.

AsRock · Aug 12, 2020

Frick said:
Beating Nvidia and Intel in custom AI is ... realistic. Chips built for specific tasks are way better at that task than general purpose stuff, and if you (like Amazon) build code specific for that chip ... yeah it'll be faster than anything Intel and Nvidia has to offer, for that specific task. This thing can be faster in very specific workloads, which is the point. That it also runs "legacy x86, ARM and RISC-V binaries" is ... I have no idea if it's a good idea or not. The performance will be terrible. Is there a market for that? I have no idea. Maybe?

Anyway these guys seem to be made up by some SandForce (remember SandForce?) people and some Wave Computing (uh-oh) people. At least they're serious about it, and even though I doubt anything that isn't actual numbers from a real product it will be interesting to see how it pans out.

Also there is a myriad of AI compute companies out there right now, and I quite like it.

Some more details

Creating the Universal Processor

“If I had to calculate 100% certainty on every deal I did, I would do zero deals.” – Chip Gaines “Prodigy” is an appropriate name for the new microprocessor under development at Tachyum, a startup …

www.eejournal.com

O remember those the 1st SSD and ONLY SSD to fail on me.

ExcuseMeWtf · Aug 12, 2020

March next year seems HIGHLY optimistic, if they only have that to show.

System Name	RBMK-1000
Processor	AMD Ryzen 7 5700G
Motherboard	ASUS ROG Strix B450-E Gaming
Cooling	DeepCool Gammax L240 V2
Memory	2x 8GB G.Skill Sniper X
Video Card(s)	Palit GeForce RTX 2080 SUPER GameRock
Storage	Western Digital Black NVMe 512GB
Display(s)	BenQ 1440p 60 Hz 27-inch
Case	Corsair Carbide 100R
Audio Device(s)	ASUS SupremeFX S1220A
Power Supply	Cooler Master MWE Gold 650W
Mouse	ASUS ROG Strix Impact
Keyboard	Gamdias Hermes E2
Software	Windows 11 Pro

System Name	Good enough
Processor	AMD Ryzen R9 7900 - Alphacool Eisblock XPX Aurora Edge
Motherboard	ASRock B650 Pro RS
Cooling	2x 360mm NexXxoS ST30 X-Flow, 1x 360mm NexXxoS ST30, 1x 240mm NexXxoS ST30
Memory	32GB - FURY Beast RGB 5600 Mhz
Video Card(s)	Sapphire RX 7900 XT - Alphacool Eisblock Aurora
Storage	1x Kingston KC3000 1TB 1x Kingston A2000 1TB, 1x Samsung 850 EVO 250GB , 1x Samsung 860 EVO 500GB
Display(s)	LG UltraGear 32GN650-B + 4K Samsung TV
Case	Phanteks NV7
Power Supply	GPS-750C

System Name	Firelance.
Processor	Threadripper 3960X
Motherboard	ROG Strix TRX40-E Gaming
Cooling	IceGem 360 + 6x Arctic Cooling P12
Memory	8x 16GB Patriot Viper DDR4-3200 CL16
Video Card(s)	MSI GeForce RTX 4060 Ti Ventus 2X OC
Storage	2TB WD SN850X (boot), 4TB Crucial P3 (data)
Display(s)	3x AOC Q32E2N (32" 2560x1440 75Hz)
Case	Enthoo Pro II Server Edition (Closed Panel) + 6 fans
Power Supply	Fractal Design Ion+ 2 Platinum 760W
Mouse	Logitech G602
Keyboard	Razer Pro Type Ultra
Software	Windows 10 Professional x64

System Name	RyzenGtEvo/ Asus strix scar II
Processor	Amd R5 5900X/ Intel 8750H
Motherboard	Crosshair hero8 impact/Asus
Cooling	360EK extreme rad+ 360$EK slim all push, cpu ek suprim Gpu full cover all EK
Memory	Corsair Vengeance Rgb pro 3600cas14 16Gb in four sticks./16Gb/16GB
Video Card(s)	Powercolour RX7900XT Reference/Rtx 2060
Storage	Silicon power 2TB nvme/8Tb external/1Tb samsung Evo nvme 2Tb sata ssd/1Tb nvme
Display(s)	Samsung UAE28"850R 4k freesync.dell shiter
Case	Lianli 011 dynamic/strix scar2
Audio Device(s)	Xfi creative 7.1 on board ,Yamaha dts av setup, corsair void pro headset
Power Supply	corsair 1200Hxi/Asus stock
Mouse	Roccat Kova/ Logitech G wireless
Keyboard	Roccat Aimo 120
VR HMD	Oculus rift
Software	Win 10 Pro
Benchmark Scores	8726 vega 3dmark timespy/ laptop Timespy 6506

Processor	Ryzen 3900x
Motherboard	B550M Steel Legend
Cooling	XPX (custom loop)
Memory	32GB 3200MHz cl16
Video Card(s)	3080 with Bykski block (custom loop)
Storage	980 Pro
Case	Fractal 804
Power Supply	Focus Plus Gold 750FX
Mouse	G603
Keyboard	G610 brown
Software	yes, lots!

Tachyum Demo Shows Prodigy Will Be Faster Than NVIDIA and Intel Chips

btarunr

Editor & Senior Moderator

Vya Domus

Assimilator

TheoneandonlyMrK

ebivan

Verpal

mahirzukic2

windwhirl

Steevo

windwhirl

Frick

Fishfaced Nincompoop

Creating the Universal Processor

Vya Domus

bug

Frick

Fishfaced Nincompoop

TheoneandonlyMrK

kapone32

Steevo

Wshlist

aQi

TheoneandonlyMrK

AsRock

TPU addict

Creating the Universal Processor

ExcuseMeWtf

System Name	Workhorse
Processor	13900K 5.9 Ghz single core (2x) 5.6 Ghz Allcore @ -0.15v offset / 4.5 Ghz e-core -0.15v offset
Motherboard	MSI Z690A-Pro DDR4
Cooling	Arctic Liquid Cooler 360 3x Arctic 120 PWM Push + 3x Arctic 140 PWM Pull
Memory	2 x 32GB DDR4-3200-CL16 G.Skill RipJaws V @ 4133 Mhz CL 18-22-42-42-84 2T 1.45v
Video Card(s)	RX 6600XT 8GB
Storage	PNY CS3030 1TB nvme SSD, 2 x 3TB HDD, 1x 4TB HDD, 1 x 6TB HDD
Display(s)	Samsung 34" 3440x1400 60 Hz
Case	Coolermaster 690
Audio Device(s)	Topping Dx3 Pro / Denon D2000 soon to mod it/Fostex T50RP MK3 custom cable and headband / Bose NC700
Power Supply	Enermax Revolution D.F. 850W ATX 2.4
Mouse	Logitech G5 / Speedlink Kudos gaming mouse (12 years old)
Keyboard	A4Tech G800 (old) / Apple Magic keyboard

System Name	System V
Processor	AMD Ryzen 5 3600
Motherboard	Asus Prime X570-P
Cooling	Cooler Master Hyper 212 // a bunch of 120 mm Xigmatek 1500 RPM fans (2 ins, 3 outs)
Memory	2x8GB Ballistix Sport LT 3200 MHz (BLS8G4D32AESCK.M8FE) (CL16-18-18-36)
Video Card(s)	Gigabyte AORUS Radeon RX 580 8 GB
Storage	SHFS37A240G / DT01ACA200 / ST10000VN0008 / ST8000VN004 / SA400S37960G / SNV21000G / NM620 2TB
Display(s)	LG 22MP55 IPS Display
Case	NZXT Source 210
Audio Device(s)	Logitech G430 Headset
Power Supply	Corsair CX650M
Software	Whatever build of Windows 11 is being served in Canary channel at the time.
Benchmark Scores	Corona 1.3: 3120620 r/s Cinebench R20: 3355 FireStrike: 12490 TimeSpy: 4624

System Name	Compy 386
Processor	7800X3D
Motherboard	Asus
Cooling	Air for now.....
Memory	64 GB DDR5 6400Mhz
Video Card(s)	7900XTX 310 Merc
Storage	Samsung 990 2TB, 2 SP 2TB SSDs, 24TB Enterprise drives
Display(s)	55" Samsung 4K HDR
Audio Device(s)	ATI HDMI
Mouse	Logitech MX518
Keyboard	Razer
Software	A lot.
Benchmark Scores	Its fast. Enough.

System Name	Black MC in Tokyo
Processor	Ryzen 5 7600
Motherboard	MSI X670E Gaming Plus Wifi
Cooling	Be Quiet! Pure Rock 2
Memory	2 x 16GB Corsair Vengeance @ 6000Mhz
Video Card(s)	XFX 6950XT Speedster MERC 319
Storage	Kingston KC3000 1TB \| WD Black SN750 2TB \|WD Blue 1TB x 2 \| Toshiba P300 2TB \| Seagate Expansion 8TB
Display(s)	Samsung U32J590U 4K + BenQ GL2450HT 1080p
Case	Fractal Design Define R4
Audio Device(s)	Plantronics 5220, Nektar SE61 keyboard
Power Supply	Corsair RM850x v3
Mouse	Logitech G602
Keyboard	Dell SK3205
Software	Windows 10 Pro
Benchmark Scores	Rimworld 4K ready!

Processor	Intel i5-12600k
Motherboard	Asus H670 TUF
Cooling	Arctic Freezer 34
Memory	2x16GB DDR4 3600 G.Skill Ripjaws V
Video Card(s)	EVGA GTX 1060 SC
Storage	500GB Samsung 970 EVO, 500GB Samsung 850 EVO, 1TB Crucial MX300 and 2TB Crucial MX500
Display(s)	Dell U3219Q + HP ZR24w
Case	Raijintek Thetis
Audio Device(s)	Audioquest Dragonfly Red :D
Power Supply	Seasonic 620W M12
Mouse	Logitech G502 Proteus Core
Keyboard	G.Skill KM780R
Software	Arch Linux + Win10

System Name	Best AMD Computer
Processor	AMD 7900X3D
Motherboard	Asus X670E E Strix
Cooling	In Win SR36
Memory	GSKILL DDR5 32GB 5200 30
Video Card(s)	Sapphire Pulse 7900XT (Watercooled)
Storage	Corsair MP 700, Seagate 530 2Tb, Adata SX8200 2TBx2, Kingston 2 TBx2, Micron 8 TB, WD AN 1500
Display(s)	GIGABYTE FV43U
Case	Corsair 7000D Airflow
Audio Device(s)	Corsair Void Pro, Logitch Z523 5.1
Power Supply	Deepcool 1000M
Mouse	Logitech g7 gaming mouse
Keyboard	Logitech G510
Software	Windows 11 Pro 64 Steam. GOG, Uplay, Origin
Benchmark Scores	Firestrike: 46183 Time Spy: 25121