• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

AMD Works on At Least Three Radeon RX Vega SKUs, Slowest Faster than GTX 1070?

Joined
Mar 6, 2017
Messages
3,330 (1.18/day)
Location
North East Ohio, USA
System Name My Ryzen 7 7700X Super Computer
Processor AMD Ryzen 7 7700X
Motherboard Gigabyte B650 Aorus Elite AX
Cooling DeepCool AK620 with Arctic Silver 5
Memory 2x16GB G.Skill Trident Z5 NEO DDR5 EXPO (CL30)
Video Card(s) XFX AMD Radeon RX 7900 GRE
Storage Samsung 980 EVO 1 TB NVMe SSD (System Drive), Samsung 970 EVO 500 GB NVMe SSD (Game Drive)
Display(s) Acer Nitro XV272U (DisplayPort) and Acer Nitro XV270U (DisplayPort)
Case Lian Li LANCOOL II MESH C
Audio Device(s) On-Board Sound / Sony WH-XB910N Bluetooth Headphones
Power Supply MSI A850GF
Mouse Logitech M705
Keyboard Steelseries
Software Windows 11 Pro 64-bit
Benchmark Scores https://valid.x86.fr/liwjs3
That's where you have to balance the length of the pipeline vs the clock speed you want to reach, it's a balancing act. That's where good branch predictors come into play. If your branch predictor is good or at least can learn along the way much like Ryzen's branch predictor can, you can have a long pipeline and not incur a performance penalty. However, if you have a bad branch predictor like what the old Intel Pentium 4 Prescott had a long pipeline can result in a sever loss in performance.
 
Joined
Sep 2, 2011
Messages
1,019 (0.21/day)
Location
Porto
System Name No name / Purple Haze
Processor Phenom II 1100T @ 3.8Ghz / Pentium 4 3.4 EE Gallatin @ 3.825Ghz
Motherboard MSI 970 Gaming/ Abit IC7-MAX3
Cooling CM Hyper 212X / Scythe Andy Samurai Master (CPU) - Modded Ati Silencer 5 rev. 2 (GPU)
Memory 8GB GEIL GB38GB2133C10ADC + 8GB G.Skill F3-14900CL9-4GBXL / 2x1GB Crucial Ballistix Tracer PC4000
Video Card(s) Asus R9 Fury X Strix (4096 SP's/1050 Mhz)/ PowerColor X850XT PE @ (600/1230) AGP + (HD3850 AGP)
Storage Samsung 250 GB / WD Caviar 160GB
Display(s) Benq XL2411T
Audio Device(s) motherboard / Creative Sound Blaster X-Fi XtremeGamer Fatal1ty Pro + Front panel
Power Supply Tagan BZ 900W / Corsair HX620w
Mouse Zowie AM
Keyboard Qpad MK-50
Software Windows 7 Pro 64Bit / Windows XP
Benchmark Scores 64CU Fury: http://www.3dmark.com/fs/11269229 / X850XT PE http://www.3dmark.com/3dm05/5532432
That's where you have to balance the length of the pipeline vs the clock speed you want to reach, it's a balancing act. That's where good branch predictors come into play. If your branch predictor is good or at least can learn along the way much like Ryzen's branch predictor can, you can have a long pipeline and not incur a performance penalty. However, if you have a bad branch predictor like what the old Intel Pentium 4 Prescott had a long pipeline can result in a sever loss in performance.

I still love my Pentium 4 3.4 EE Gallatin :D
 
Joined
Jun 10, 2014
Messages
2,987 (0.78/day)
Processor AMD Ryzen 9 5900X ||| Intel Core i7-3930K
Motherboard ASUS ProArt B550-CREATOR ||| Asus P9X79 WS
Cooling Noctua NH-U14S ||| Be Quiet Pure Rock
Memory Crucial 2 x 16 GB 3200 MHz ||| Corsair 8 x 8 GB 1333 MHz
Video Card(s) MSI GTX 1060 3GB ||| MSI GTX 680 4GB
Storage Samsung 970 PRO 512 GB + 1 TB ||| Intel 545s 512 GB + 256 GB
Display(s) Asus ROG Swift PG278QR 27" ||| Eizo EV2416W 24"
Case Fractal Design Define 7 XL x 2
Audio Device(s) Cambridge Audio DacMagic Plus
Power Supply Seasonic Focus PX-850 x 2
Mouse Razer Abyssus
Keyboard CM Storm QuickFire XT
Software Ubuntu
That's where you have to balance the length of the pipeline vs the clock speed you want to reach, it's a balancing act. That's where good branch predictors come into play. If your branch predictor is good or at least can learn along the way much like Ryzen's branch predictor can, you can have a long pipeline and not incur a performance penalty. However, if you have a bad branch predictor like what the old Intel Pentium 4 Prescott had a long pipeline can result in a sever loss in performance.
In fact, you don't even need branches or cache misses for more stalls occur due to pipeline length. Code is full of data dependencies, let's say you'll have a simple calculation like this:
Code:
d = a + b + c;
e = a + d;
f = e + b;
g = f + c;
You have multiple dependencies here, which has to be resolved sequentially. Each dependency has to wait for the instruction to be completely executed, meaning the length of the pipeline will affect the length of the stall. CPUs have since the 90s tried to work around this by out-of-order execution, and longer pipelines also means the dependencies has to be executed even earlier to prevent stalls. But eventually this means that branching is going to become a even larger problem, since each misprediction causes all calculations to be discarded. So if there are dependencies after the branching, you'll not only get a larger stall because of the flushing, but also because you'll then have to execute multiple dependencies without any benefit of out-of-order execution. This is why the penalties of long pipelines and mispredictions multiply.

Skylake does in fact have better branch prediction than Ryzen, even old Sandy-Bridge does it better. But branch prediction can only help a bit, since it's basically just statistics about which conditionals usually evaluates to true and which does not. If a conditional is 99% true and 1% false, it will start guessing 99% correct after a few iterations. But if a conditional is ~50% true and ~50% false, the CPU will only guess half of them correctly, and that is in fact the theoretical maximum. If you want to improve performance beyond this, you're left with trying to reduce the penalty costs, or rewriting the software :p

And one final note; the branch predictor (and prefetcher in general) were much better in Prescott than Athlon64, but the severe penalties of the super-long pipeline outweighed the benefits of a better prefetcher. There are limits to what a good prefetcher can do, so even with the best prefetcher Intel was crushed by a much more simple design.
 
Last edited:
Joined
May 31, 2016
Messages
4,437 (1.43/day)
Location
Currently Norway
System Name Bro2
Processor Ryzen 5800X
Motherboard Gigabyte X570 Aorus Elite
Cooling Corsair h115i pro rgb
Memory 32GB G.Skill Flare X 3200 CL14 @3800Mhz CL16
Video Card(s) Powercolor 6900 XT Red Devil 1.1v@2400Mhz
Storage M.2 Samsung 970 Evo Plus 500MB/ Samsung 860 Evo 1TB
Display(s) LG 27UD69 UHD / LG 27GN950
Case Fractal Design G
Audio Device(s) Realtec 5.1
Power Supply Seasonic 750W GOLD
Mouse Logitech G402
Keyboard Logitech slim
Software Windows 10 64 bit
In fact, you don't even need branches or cache misses for more stalls occur due to pipeline length. Code is full of data dependencies, let's say you'll have a simple calculation like this:
Code:
d = a + b + c;
e = a + d;
f = e + b;
g = f + c;
And one final note; the branch predictor (and prefetcher in general) were much better in Prescott than Athlon64, but the severe penalties of the super-long pipeline outweighed the benefits of a better prefetcher. There are limits to what a good prefetcher can do, so even with the best prefetcher Intel was crushed by a much more simple design.

It wasn't about the pipeline or prefetcher mainly that intel failed. It was Intel underestimation of the clock speed and the heat and voltage that would be needed to mitigate the longer pipeline in Prescott to match Athlon's speed. Longer pipeline gave more clock speed but the heat and voltage increased also. They didn't realize it would be that much and that is why Prescott would burst into flames without good cooler which in those times were not that efficient as they are now.
After that Intel dropped netburst architecture.
 
Joined
Mar 23, 2005
Messages
4,086 (0.57/day)
Location
Ancient Greece, Acropolis (Time Lord)
System Name RiseZEN Gaming PC
Processor AMD Ryzen 7 5800X @ Auto
Motherboard Asus ROG Strix X570-E Gaming ATX Motherboard
Cooling Corsair H115i Elite Capellix AIO, 280mm Radiator, Dual RGB 140mm ML Series PWM Fans
Memory G.Skill TridentZ 64GB (4 x 16GB) DDR4 3200
Video Card(s) ASUS DUAL RX 6700 XT DUAL-RX6700XT-12G
Storage Corsair Force MP500 480GB M.2 & MP510 480GB M.2 - 2 x WD_BLACK 1TB SN850X NVMe 1TB
Display(s) ASUS ROG Strix 34” XG349C 180Hz 1440p + Asus ROG 27" MG278Q 144Hz WQHD 1440p
Case Corsair Obsidian Series 450D Gaming Case
Audio Device(s) SteelSeries 5Hv2 w/ Sound Blaster Z SE
Power Supply Corsair RM750x Power Supply
Mouse Razer Death-Adder + Viper 8K HZ Ambidextrous Gaming Mouse - Ergonomic Left Hand Edition
Keyboard Logitech G910 Orion Spectrum RGB Gaming Keyboard
Software Windows 11 Pro - 64-Bit Edition
Benchmark Scores I'm the Doctor, Doctor Who. The Definition of Gaming is PC Gaming...
Are they expensive? Hell yes!

Are they over-priced? Not at all.

So long as people keep buying them at their current prices the market will allow for Nvidia to charge more and more with each new generation.

If they were not selling and stores were left with stock on the shelves and in their warehouses, THEN they would be over-priced. That's just how free-markets work. If you don't like it, speak with your money and hope others follow suit.
Will have to respectfully disagree. Nvidia doesn't have competition in the high end and enthusiast line. Yet they price these cards by picking RipOff prices from the sky.

Intel has been doing the very same thing for many years, Over Pricing it's CPU's because Bulldozer wasn't competitive.

Look what Ryzen did to those Over Priced Intel Processors. Intel is in damage control. Has been since ZEN.

Nvidia has absolutely no measure in how to price it's High end GPU's. AMD's GPU Line Up RX480/580 aren't competitive enough. Don't confuse a company ripping people off with over prices to actual economics.

If you disagree, then we will agree to disagree.
 
Joined
May 14, 2017
Messages
23 (0.01/day)
Processor R7 3700X W/ PBO on
Motherboard TUF B550 Plus
Cooling Scythe Fuma 2
Memory 2x16 Trident Z Neo 3600 CL16
Video Card(s) EVGA RTX 3060 Ti FTW3
Storage SK Hynix P41 1TB + Crucial MX500 1TB 2.5+ Teamgroup MS30 1TB M.2+ WD Blue 4TB 5400RPM
Display(s) Dell S2721DGF, Dell P2317H
Case Cougar MX330-G
Audio Device(s) beyerdynamic DT770 PRO 32Ω, Fifine K669
Power Supply Corsair CX650M 2017
Mouse Logitech G502 Hero Wired
Keyboard Logitech G213
VR HMD Oculus Rift S
Pro Duo is not a consumer gaming card...
Nobody said that. The article even called it what it actually is: a halo product.
 
Joined
May 31, 2016
Messages
4,437 (1.43/day)
Location
Currently Norway
System Name Bro2
Processor Ryzen 5800X
Motherboard Gigabyte X570 Aorus Elite
Cooling Corsair h115i pro rgb
Memory 32GB G.Skill Flare X 3200 CL14 @3800Mhz CL16
Video Card(s) Powercolor 6900 XT Red Devil 1.1v@2400Mhz
Storage M.2 Samsung 970 Evo Plus 500MB/ Samsung 860 Evo 1TB
Display(s) LG 27UD69 UHD / LG 27GN950
Case Fractal Design G
Audio Device(s) Realtec 5.1
Power Supply Seasonic 750W GOLD
Mouse Logitech G402
Keyboard Logitech slim
Software Windows 10 64 bit
Nobody said that. The article even called it what it actually is: a halo product.
Nobody said also that it will be. Halo means more of a top notch product best of whatever is offered by AMD but it doesn't mean it must be a consumer card. Might as well be business purpose or both. Who knows.
 
Top