Friday, June 26th 2015
AMD Didn't Get the R9 Fury X Wrong, but NVIDIA Got its GTX 980 Ti Right
This has been a roller-coaster month for high-end PC graphics. The timing of NVIDIA's GeForce GTX 980 Ti launch had us giving finishing touches to its review with our bags to Taipei still not packed. When it launched, the GTX 980 Ti set AMD a performance target and a price target. Then began a 3-week wait for AMD to launch its Radeon R9 Fury X graphics card. The dance is done, the dust has settled, and we know who has won - nobody. AMD didn't get the R9 Fury X wrong, but NVIDIA got its GTX 980 Ti right. At best, this stalemate yielded a 4K-capable single-GPU graphics option from each brand at $650. You already had those in the form of the $650-ish Radeon R9 295X2, or a pair GTX 970 cards. Those with no plans of a 4K display already had great options in the form of the GTX 970, and price-cut R9 290X.
The Radeon R9 290 series launch from Fall-2013 stirred up the high-end graphics market in a big way. The $399 R9 290 made NVIDIA look comically evil for asking $999 for the card it beat, the GTX TITAN; while the R9 290X remained the fastest single-GPU option, at $550, till NVIDIA launched the $699 GTX 780 Ti, to get people back to paying through their noses for the extra performance. Then there were two UFO sightings in the form of the GTX TITAN Black, and the GTX TITAN-Z, which made no tangible contributions to consumer choice. Sure, they gave you full double-precision floating point (DPFP) performance, but DPFP is of no use to gamers. So what could have been the calculation at AMD and NVIDIA as June 2015 approached? Here's a theory.Image credit: Mahspoonis2big, Reddit
AMD's HBM Gamble
The "Fiji" silicon is formidable. It made performance/Watt gains over "Hawaii," despite a lack of significant shader architecture performance improvements between GCN 1.1 and GCN 1.2 (at least nowhere of the kind between NVIDIA's "Kepler" and "Maxwell.") AMD could do a 45% increase in stream processors for the Radeon R9 Fury X, at the same typical board power as its predecessor, the R9 290X. The company had to find other ways to bring down power consumption, and one way to do that, while not sacrificing performance, was implementing a more efficient memory standard, High Bandwidth Memory (HBM).
Implementing HBM, right now, is not as easy GDDR5 was, when it was new. HBM is more efficient than GDDR5, but it trades clock speed for bus-width, and a wider bus entails more pins (connections), which would have meant an insane amount of PCB wiring around the GPU, in AMD's case. The company had to co-develop the industry's first mass-producible interposer (silicon die that acts as substrate for other dies), relocate the memory to the GPU package, and still make do with the design limitation of first-generation HBM capping out at 8 Gb per stack, or 4 GB for AMD's silicon; after having laid a 4096-bit wide memory bus. This was a bold move.
Reviews show that 4 GB of HBM isn't Fiji's Achilles' heel. The card still competes in the same league as the 6 GB memory-laden GTX 980 Ti, at 4K Ultra HD (a resolution that's most taxing on the video memory). The card is just 2% slower than the GTX 980 Ti, at this resolution. Its performance/Watt is significantly higher than the R9 290X. We reckon that this outcome would have been impossible with GDDR5, if AMD never gambled with HBM, and stuck to the 512-bit wide GDDR5 interface of "Hawaii," just as it stuck to a front-end and render back-end configuration similar to it (the front-end is similar to that of "Tonga," while the ROP count is the same as "Hawaii.")
NVIDIA Accelerated GM200
NVIDIA's big "Maxwell" silicon, the GM200, wasn't expected to come out as soon as it did. The GTX 980 and the 5 billion-transistor GM204 silicon are just 9 months old in the market, NVIDIA has sold a lot of these; and given how the company milked its predecessor, the GK104, for a year in the high-end segment before bringing out the GK110 with the TITAN; something similar was expected of the GM200. Its March 2015 introduction - just six months following the GTX 980 - was unexpected. What was also unexpected, was NVIDIA launching the GTX 980 Ti, as early as it did. This card has effectively cannibalized the TITAN X, just 3 months post its launch. The GTX TITAN X is a halo product, overpriced at $999, and hence not a lot of GM200 chips were expected to be in production. We heard reports throughout Spring, that launch of a high-volume, money-making SKU based on the GM200 could be expected only after Summer. As it turns out, NVIDIA was preparing a welcoming party for the R9 Fury X, with the GTX 980 Ti.
The GTX 980 Ti was more likely designed with R9 Fury X performance, rather than a target price, as the pivot. The $650 price tag is likely something NVIDIA came up with later, after having achieved a performance lead over the R9 Fury X, by stripping down the GM200 as much as it could to get there. How NVIDIA figured out R9 Fury X performance is anybody's guess. It's more likely that the price of R9 Fury X would have been different, if the GTX 980 Ti wasn't around; than the other way around.
Who Won?
Short answer - nobody. The high-end graphics card market isn't as shaken up as it was, right after the R9 290 series launch. The "Hawaii" twins held onto their own, and continued to offer great bang for the buck, until NVIDIA stepped in with the GTX 970 and GTX 980 last September. $300 gets you not much more from what it did a month ago. At least now you have a choice between the GTX 970 and the R9 390 (which appears to have caught up), at $430, the R9 390X offers competition to the $499 GTX 980; and then there are leftovers from the previous-gen, such as the R9 290 series and the GTX 780 Ti, but these aren't really the high-end we were looking for. It was gleeful to watch the $399 R9 290 dethrone the $999 GTX TITAN in September 2013, as people upgraded their rigs for Holiday 2013. We didn't see that kind of a spectacle this month. There is a silver lining, though. There is a rather big gap between the GTX 980 and GTX 980 Ti just waiting to be filled.
Hopefully July will churn out something exciting (and bonafide high-end) around the $500 mark.
The Radeon R9 290 series launch from Fall-2013 stirred up the high-end graphics market in a big way. The $399 R9 290 made NVIDIA look comically evil for asking $999 for the card it beat, the GTX TITAN; while the R9 290X remained the fastest single-GPU option, at $550, till NVIDIA launched the $699 GTX 780 Ti, to get people back to paying through their noses for the extra performance. Then there were two UFO sightings in the form of the GTX TITAN Black, and the GTX TITAN-Z, which made no tangible contributions to consumer choice. Sure, they gave you full double-precision floating point (DPFP) performance, but DPFP is of no use to gamers. So what could have been the calculation at AMD and NVIDIA as June 2015 approached? Here's a theory.Image credit: Mahspoonis2big, Reddit
AMD's HBM Gamble
The "Fiji" silicon is formidable. It made performance/Watt gains over "Hawaii," despite a lack of significant shader architecture performance improvements between GCN 1.1 and GCN 1.2 (at least nowhere of the kind between NVIDIA's "Kepler" and "Maxwell.") AMD could do a 45% increase in stream processors for the Radeon R9 Fury X, at the same typical board power as its predecessor, the R9 290X. The company had to find other ways to bring down power consumption, and one way to do that, while not sacrificing performance, was implementing a more efficient memory standard, High Bandwidth Memory (HBM).
Implementing HBM, right now, is not as easy GDDR5 was, when it was new. HBM is more efficient than GDDR5, but it trades clock speed for bus-width, and a wider bus entails more pins (connections), which would have meant an insane amount of PCB wiring around the GPU, in AMD's case. The company had to co-develop the industry's first mass-producible interposer (silicon die that acts as substrate for other dies), relocate the memory to the GPU package, and still make do with the design limitation of first-generation HBM capping out at 8 Gb per stack, or 4 GB for AMD's silicon; after having laid a 4096-bit wide memory bus. This was a bold move.
Reviews show that 4 GB of HBM isn't Fiji's Achilles' heel. The card still competes in the same league as the 6 GB memory-laden GTX 980 Ti, at 4K Ultra HD (a resolution that's most taxing on the video memory). The card is just 2% slower than the GTX 980 Ti, at this resolution. Its performance/Watt is significantly higher than the R9 290X. We reckon that this outcome would have been impossible with GDDR5, if AMD never gambled with HBM, and stuck to the 512-bit wide GDDR5 interface of "Hawaii," just as it stuck to a front-end and render back-end configuration similar to it (the front-end is similar to that of "Tonga," while the ROP count is the same as "Hawaii.")
NVIDIA Accelerated GM200
NVIDIA's big "Maxwell" silicon, the GM200, wasn't expected to come out as soon as it did. The GTX 980 and the 5 billion-transistor GM204 silicon are just 9 months old in the market, NVIDIA has sold a lot of these; and given how the company milked its predecessor, the GK104, for a year in the high-end segment before bringing out the GK110 with the TITAN; something similar was expected of the GM200. Its March 2015 introduction - just six months following the GTX 980 - was unexpected. What was also unexpected, was NVIDIA launching the GTX 980 Ti, as early as it did. This card has effectively cannibalized the TITAN X, just 3 months post its launch. The GTX TITAN X is a halo product, overpriced at $999, and hence not a lot of GM200 chips were expected to be in production. We heard reports throughout Spring, that launch of a high-volume, money-making SKU based on the GM200 could be expected only after Summer. As it turns out, NVIDIA was preparing a welcoming party for the R9 Fury X, with the GTX 980 Ti.
The GTX 980 Ti was more likely designed with R9 Fury X performance, rather than a target price, as the pivot. The $650 price tag is likely something NVIDIA came up with later, after having achieved a performance lead over the R9 Fury X, by stripping down the GM200 as much as it could to get there. How NVIDIA figured out R9 Fury X performance is anybody's guess. It's more likely that the price of R9 Fury X would have been different, if the GTX 980 Ti wasn't around; than the other way around.
Who Won?
Short answer - nobody. The high-end graphics card market isn't as shaken up as it was, right after the R9 290 series launch. The "Hawaii" twins held onto their own, and continued to offer great bang for the buck, until NVIDIA stepped in with the GTX 970 and GTX 980 last September. $300 gets you not much more from what it did a month ago. At least now you have a choice between the GTX 970 and the R9 390 (which appears to have caught up), at $430, the R9 390X offers competition to the $499 GTX 980; and then there are leftovers from the previous-gen, such as the R9 290 series and the GTX 780 Ti, but these aren't really the high-end we were looking for. It was gleeful to watch the $399 R9 290 dethrone the $999 GTX TITAN in September 2013, as people upgraded their rigs for Holiday 2013. We didn't see that kind of a spectacle this month. There is a silver lining, though. There is a rather big gap between the GTX 980 and GTX 980 Ti just waiting to be filled.
Hopefully July will churn out something exciting (and bonafide high-end) around the $500 mark.
223 Comments on AMD Didn't Get the R9 Fury X Wrong, but NVIDIA Got its GTX 980 Ti Right
Obviously if some are getting 49°C and others are getting 60°C in open test benches the claim of <50°C typical load temps isn't true. Since, you know, typically people put their computers in actual computer cases...typically. Yep, and I can say a Prius is the fastest car in the world if I cherry pick tests. I'd be lying, but I could do it.
There is cherry picking tests and showing those in press releases, then there is flat out making false claims without showing where you are getting those claims from. If you say you have the fastest GPU in the world with no further explanation, it better damn well be faster than every other GPU overall, not just in cherry picked tests, otherwise it is a lie. Now, if they said they have the fastest GPU in the world based on 3DMark Firestrike results, then that would be fine. If they say it is the most efficient GPU in the world, and there is no debating that is a lie, then it better well give the best performance per watt out of anything on the market, and the 980Ti simply crushes the Fury X in that department.
I'm not saying the Fury X is better than the 980ti, because it's not, for all the reasons you say (if I was in the market for a high end GPU I'd get the Fury X for the hell of it). But to say AMD is lying and berating them for it is overreacting. If you take a company's PR speak literally, I consider that to be your problem. AMD is doing it too much, sure that I can agree with (because they totally do), but again, if you take every PR slide at face value that is indeed your problem.
I mean look at 980ti and FuryX at 4K they usually are around 40-45 FPS avg with higer settings. No thanks I d rather play Full HD or 1440p with everything maxed while having min FPS at around 60 FPS, than what the current highend can offer at 4K.
In 2 years we might get to the point and that Freesync/G-Sync nonsense might even be sortet out by then.
I'm sure Nvidia and AMD had a dual gpu stand off some time back when Nvidia claimed they had the fastest dual card but it really was head to head. Both sides have done it.
Their biggest fat lie was the part where they said it was an overclockers dream. Now that WAS a lie, unless you class a nightmare as a dream. The overclock is what is driving me to a 980ti.
And who knows what the ASIC quality is on the one they gave him? Well...besides him.
And the there's this:www.guru3d.com/articles_pages/amd_radeon_r9_fury_x_review,37.html
Their sample did 1125MHz. So we can also guesstimate W1z's might do 1275MHz. That's what you call an overclocker's dream. I know, I am one. And I still insist it'll go higher than that actually. Someone, with the skills, will push one well over 1400MHz. Mark my words.
Direct quote, courtesy of @HumanSmoke's link. Why do you feel the illogical need to jump to attack what I said?
All the reviews had poor overclock results. Just deal with it.
Or is it the world of AMD to release products that don't work as planned at launch? The 980ti didn't have this problem at launch. It over clocked just fine.
You're clinging onto some hope by using the tired line of, "once they get it right it'll be good". But really? Really?
Sorry man, your posts are really quick to attack what is known and evidenced thus far by ALL the reviews. I've praised the card, it's great and its a good implementation but for me the shitty over clock is a deal breaker.
oh, FTR, my power color 7970 LCS hit 1300 core (the other was BIOS locked at 1225 Max). My 780ti hit shy of 1400. I look forward to a 980ti on 1500+.
You are also just focusing on one of the lies they told, I listed several. It is very simple, if the Fury X was faster than the 980Ti at 4K, then the Fury X would be higher than the 980Ti on this graph:
But it isn't. If their statement was true I'd expect the Fury X to have no problem topping the 980Ti in overall performance.
The overwhelming marketing impact of a new card is its launch where it is the focus of the consumer tech world. Either AMD don't know a thing about selling their own product, or they voltage locked the card for a reason. Maybe they are aware that adding voltage brings adverse results ( a steep ramp in temps affecting GPU/HBM? overwhelming the AIO resulting in GPU throttling producing lower benchmark results than the stock settings? As an engineer don't you concede that both these outcomes might be possible?)
Bearing in mind that the overclocks achieved have been done with no voltage increase, and the power envelope has been raised in accordance with clockspeed...
...maybe AMD already know how the card acts under more extreme conditions.
- You weren't the only one I quoted, don't single yourself out on my account.
- If asking you questions about your reply in a straight-forward and civil manner is illogical, and/or what you call being attacked...see a psychologist/psychiatrist. You've got some issues to deal with.
- Poor overclocking results is a conclusion that one comes to who has no real idea how this works.
- Yes really! Really!
- ALL the reviews support my statements. It's you who's misinterpreting them.
- 7970 liquid cooled @ 1300 core...meh...not bad. Sounds pretty typical.
- 7970 BIOS locked @ 1225 Max? Trying to prove you really don't know what the hell you're talking about...or what? Hint: there's no such thing as a BIOS lock on overclocking. It's a hardware thing.
- 780 Ti @ ~ 1400. Don't doubt it. Doesn't prove anything.
- 980 Ti @ 1500+? Possibly. I'd like to see it. Go for it! Then ask yourself this: How much performance have I gained relative to a similarly overclocked Fury X. And no, not a Fury X @ the same core speed. Since given equal core speeds = YOU LOSE BADLY!!!
Homework people, homework. I'm not really as big a dufus as you think I am. Or that I lead on to be. It's not wise to display one's full power to just anyone and everyone. Then you could prepare yourself for what I actually have in store. And possibly reserve your best tricks for last too. My A-game is saved for ONLY when absolutely required. Shoot...I've said too much. That is all.@HumanSmoke I'll concede that. But we're still in the realm of suppositions and assumptions here, as opposed to hypotheses. Or better yet, theories.
Theory>Hypothesis>supposition/assumption>conjecture
As a comparison...
From the Gigabyte GTX 980 Ti G1 review at HardOCP ...and Gigabyte's G1 isn't the fastest 980 Ti out there. Word is that Galaxy's HOF is an overclocking monster, while the HOF LN2 editionis shaping up as a serious contender for the Classified KPE
BTW. Re: the relative overclocking merits. Here's Hexus's OC'ed Fury X ( 8.6% OC w/ no voltage adjustment ) and the Gigabyte's G1 980 Ti (8.9% OC w/ no voltage adjustment)
Also, I'm not a hardware or software engineer. Hell, I can't even do basic coding, shit ANY coding. I'm a hobbyist gamer that tinkers with water cooling and like to have a wee push on over clocking now and again.
I bow to your awesome knowledge base, I prostrate myself at your epic magnitude. And we can both watch the over clocking Hall of Fames for which turns out to be the better card.
It's the same crap I (we) are facing in company where I work. The workforce and the leadership seems to be entirely disconnected from one another. When you give them feedback based on experience you have with customers, they don't do shit and 10 minutes later, they again demand you improve things because results aren't good enough. I don't understand this world...
I bet it was the same at AMD...
If the board designers aren't fully aware of certain parameters, it's quite conceivable PR haven't got a clue. PR isn't technical - it's pure smoke and mirrors.