AMD Works on At Least Three Radeon RX Vega SKUs, Slowest Faster than GTX 1070?

trparky · May 7, 2017

That's where you have to balance the length of the pipeline vs the clock speed you want to reach, it's a balancing act. That's where good branch predictors come into play. If your branch predictor is good or at least can learn along the way much like Ryzen's branch predictor can, you can have a long pipeline and not incur a performance penalty. However, if you have a bad branch predictor like what the old Intel Pentium 4 Prescott had a long pipeline can result in a sever loss in performance.

uuuaaaaaa · May 7, 2017

trparky said:
That's where you have to balance the length of the pipeline vs the clock speed you want to reach, it's a balancing act. That's where good branch predictors come into play. If your branch predictor is good or at least can learn along the way much like Ryzen's branch predictor can, you can have a long pipeline and not incur a performance penalty. However, if you have a bad branch predictor like what the old Intel Pentium 4 Prescott had a long pipeline can result in a sever loss in performance.

I still love my Pentium 4 3.4 EE Gallatin

efikkan · May 7, 2017

trparky said:
That's where you have to balance the length of the pipeline vs the clock speed you want to reach, it's a balancing act. That's where good branch predictors come into play. If your branch predictor is good or at least can learn along the way much like Ryzen's branch predictor can, you can have a long pipeline and not incur a performance penalty. However, if you have a bad branch predictor like what the old Intel Pentium 4 Prescott had a long pipeline can result in a sever loss in performance.

In fact, you don't even need branches or cache misses for more stalls occur due to pipeline length. Code is full of data dependencies, let's say you'll have a simple calculation like this:

Code:

d = a + b + c;
e = a + d;
f = e + b;
g = f + c;

You have multiple dependencies here, which has to be resolved sequentially. Each dependency has to wait for the instruction to be completely executed, meaning the length of the pipeline will affect the length of the stall. CPUs have since the 90s tried to work around this by out-of-order execution, and longer pipelines also means the dependencies has to be executed even earlier to prevent stalls. But eventually this means that branching is going to become a even larger problem, since each misprediction causes all calculations to be discarded. So if there are dependencies after the branching, you'll not only get a larger stall because of the flushing, but also because you'll then have to execute multiple dependencies without any benefit of out-of-order execution. This is why the penalties of long pipelines and mispredictions multiply.

Skylake does in fact have better branch prediction than Ryzen, even old Sandy-Bridge does it better. But branch prediction can only help a bit, since it's basically just statistics about which conditionals usually evaluates to true and which does not. If a conditional is 99% true and 1% false, it will start guessing 99% correct after a few iterations. But if a conditional is ~50% true and ~50% false, the CPU will only guess half of them correctly, and that is in fact the theoretical maximum. If you want to improve performance beyond this, you're left with trying to reduce the penalty costs, or rewriting the software

And one final note; the branch predictor (and prefetcher in general) were much better in Prescott than Athlon64, but the severe penalties of the super-long pipeline outweighed the benefits of a better prefetcher. There are limits to what a good prefetcher can do, so even with the best prefetcher Intel was crushed by a much more simple design.

ratirt · May 7, 2017

efikkan said:
In fact, you don't even need branches or cache misses for more stalls occur due to pipeline length. Code is full of data dependencies, let's say you'll have a simple calculation like this:

Code:

d = a + b + c; e = a + d; f = e + b; g = f + c;

And one final note; the branch predictor (and prefetcher in general) were much better in Prescott than Athlon64, but the severe penalties of the super-long pipeline outweighed the benefits of a better prefetcher. There are limits to what a good prefetcher can do, so even with the best prefetcher Intel was crushed by a much more simple design.

It wasn't about the pipeline or prefetcher mainly that intel failed. It was Intel underestimation of the clock speed and the heat and voltage that would be needed to mitigate the longer pipeline in Prescott to match Athlon's speed. Longer pipeline gave more clock speed but the heat and voltage increased also. They didn't realize it would be that much and that is why Prescott would burst into flames without good cooler which in those times were not that efficient as they are now.
After that Intel dropped netburst architecture.

Super XP · May 7, 2017

diatribe said:
Are they expensive? Hell yes!

Are they over-priced? Not at all.

So long as people keep buying them at their current prices the market will allow for Nvidia to charge more and more with each new generation.

If they were not selling and stores were left with stock on the shelves and in their warehouses, THEN they would be over-priced. That's just how free-markets work. If you don't like it, speak with your money and hope others follow suit.

Will have to respectfully disagree. Nvidia doesn't have competition in the high end and enthusiast line. Yet they price these cards by picking RipOff prices from the sky.

Intel has been doing the very same thing for many years, Over Pricing it's CPU's because Bulldozer wasn't competitive.

Look what Ryzen did to those Over Priced Intel Processors. Intel is in damage control. Has been since ZEN.

Nvidia has absolutely no measure in how to price it's High end GPU's. AMD's GPU Line Up RX480/580 aren't competitive enough. Don't confuse a company ripping people off with over prices to actual economics.

If you disagree, then we will agree to disagree.

Sukhoi · May 14, 2017

RejZoR said:
Pro Duo is not a consumer gaming card...

Nobody said that. The article even called it what it actually is: a halo product.

ratirt · May 14, 2017

Sukhoi said:
Nobody said that. The article even called it what it actually is: a halo product.

Nobody said also that it will be. Halo means more of a top notch product best of whatever is offered by AMD but it doesn't mean it must be a consumer card. Might as well be business purpose or both. Who knows.

System Name	My Ryzen 7 7700X Super Computer
Processor	AMD Ryzen 7 7700X
Motherboard	Gigabyte B650 Aorus Elite AX
Cooling	DeepCool AK620 with Arctic Silver 5
Memory	2x16GB G.Skill Trident Z5 NEO DDR5 EXPO (CL30)
Video Card(s)	XFX AMD Radeon RX 7900 GRE
Storage	Samsung 980 EVO 1 TB NVMe SSD (System Drive), Samsung 970 EVO 500 GB NVMe SSD (Game Drive)
Display(s)	Acer Nitro XV272U (DisplayPort) and Acer Nitro XV270U (DisplayPort)
Case	Lian Li LANCOOL II MESH C
Audio Device(s)	On-Board Sound / Sony WH-XB910N Bluetooth Headphones
Power Supply	MSI A850GF
Mouse	Logitech M705
Keyboard	Steelseries
Software	Windows 11 Pro 64-bit
Benchmark Scores	https://valid.x86.fr/liwjs3

System Name	No name / Purple Haze
Processor	Phenom II 1100T @ 3.8Ghz / Pentium 4 3.4 EE Gallatin @ 3.825Ghz
Motherboard	MSI 970 Gaming/ Abit IC7-MAX3
Cooling	CM Hyper 212X / Scythe Andy Samurai Master (CPU) - Modded Ati Silencer 5 rev. 2 (GPU)
Memory	8GB GEIL GB38GB2133C10ADC + 8GB G.Skill F3-14900CL9-4GBXL / 2x1GB Crucial Ballistix Tracer PC4000
Video Card(s)	Asus R9 Fury X Strix (4096 SP's/1050 Mhz)/ PowerColor X850XT PE @ (600/1230) AGP + (HD3850 AGP)
Storage	Samsung 250 GB / WD Caviar 160GB
Display(s)	Benq XL2411T
Audio Device(s)	motherboard / Creative Sound Blaster X-Fi XtremeGamer Fatal1ty Pro + Front panel
Power Supply	Tagan BZ 900W / Corsair HX620w
Mouse	Zowie AM
Keyboard	Qpad MK-50
Software	Windows 7 Pro 64Bit / Windows XP
Benchmark Scores	64CU Fury: http://www.3dmark.com/fs/11269229 / X850XT PE http://www.3dmark.com/3dm05/5532432

Processor	AMD Ryzen 9 5900X \|\|\| Intel Core i7-3930K
Motherboard	ASUS ProArt B550-CREATOR \|\|\| Asus P9X79 WS
Cooling	Noctua NH-U14S \|\|\| Be Quiet Pure Rock
Memory	Crucial 2 x 16 GB 3200 MHz \|\|\| Corsair 8 x 8 GB 1333 MHz
Video Card(s)	MSI GTX 1060 3GB \|\|\| MSI GTX 680 4GB
Storage	Samsung 970 PRO 512 GB + 1 TB \|\|\| Intel 545s 512 GB + 256 GB
Display(s)	Asus ROG Swift PG278QR 27" \|\|\| Eizo EV2416W 24"
Case	Fractal Design Define 7 XL x 2
Audio Device(s)	Cambridge Audio DacMagic Plus
Power Supply	Seasonic Focus PX-850 x 2
Mouse	Razer Abyssus
Keyboard	CM Storm QuickFire XT
Software	Ubuntu

System Name	Bro2
Processor	Ryzen 5800X
Motherboard	Gigabyte X570 Aorus Elite
Cooling	Corsair h115i pro rgb
Memory	32GB G.Skill Flare X 3200 CL14 @3800Mhz CL16
Video Card(s)	Powercolor 6900 XT Red Devil 1.1v@2400Mhz
Storage	M.2 Samsung 970 Evo Plus 500MB/ Samsung 860 Evo 1TB
Display(s)	LG 27UD69 UHD / LG 27GN950
Case	Fractal Design G
Audio Device(s)	Realtec 5.1
Power Supply	Seasonic 750W GOLD
Mouse	Logitech G402
Keyboard	Logitech slim
Software	Windows 10 64 bit

System Name	RiseZEN Gaming PC
Processor	AMD Ryzen 7 5800X @ Auto
Motherboard	Asus ROG Strix X570-E Gaming ATX Motherboard
Cooling	Corsair H115i Elite Capellix AIO, 280mm Radiator, Dual RGB 140mm ML Series PWM Fans
Memory	G.Skill TridentZ 64GB (4 x 16GB) DDR4 3200
Video Card(s)	ASUS DUAL RX 6700 XT DUAL-RX6700XT-12G
Storage	Corsair MP510 480GB M.2 - 2 x WD_BLACK 1TB SN850X M.2 1TB - Lexar NQ790 M.2 2TB
Display(s)	ASUS ROG Strix 34” XG349C 144Hz 1440p + Asus ROG 27" MG278Q 144Hz WQHD 1440p
Case	Corsair Obsidian Series 450D Gaming Case
Audio Device(s)	SteelSeries 5Hv2 w/ Sound Blaster Z SE
Power Supply	Corsair RM750x Power Supply
Mouse	Razer Death-Adder + Viper 8K HZ Ambidextrous Gaming Mouse - Ergonomic Left Hand Edition
Keyboard	Logitech G910 Orion Spectrum RGB Gaming Keyboard
Software	Windows 10 Pro - 64-Bit Edition (Back to Win 10 because 11 is garbage)
Benchmark Scores	I'm the Doctor, Doctor Who. The Definition of Gaming is PC Gaming...

Processor	R7 3700X W/ PBO on
Motherboard	TUF B550 Plus
Cooling	Scythe Fuma 2
Memory	2x16 Trident Z Neo 3600 CL16
Video Card(s)	EVGA RTX 3060 Ti FTW3
Storage	SK Hynix P41 1TB + Crucial MX500 1TB 2.5+ Teamgroup MS30 1TB M.2+ WD Blue 4TB 5400RPM
Display(s)	Dell S2721DGF, Dell P2317H
Case	Cougar MX330-G
Audio Device(s)	beyerdynamic DT770 PRO 32Ω, Fifine K669
Power Supply	Corsair CX650M 2017
Mouse	Logitech G502 Hero Wired
Keyboard	Logitech G213
VR HMD	Oculus Rift S

AMD Works on At Least Three Radeon RX Vega SKUs, Slowest Faster than GTX 1070?

trparky

uuuaaaaaa

efikkan

ratirt

Super XP

Sukhoi

ratirt