Thursday, January 19th 2023
NVIDIA GeForce RTX 4060 Ti Possible Specs Surface—160 W Power, Debuts AD106 Silicon
NVIDIA's next GeForce RTX 40-series "Ada" graphics card launch is widely expected to be the GeForce RTX 4070 (non-Ti), and as we approach Spring 2023, the company is expected to ramp up to the meat of its new generation, with xx60-segment, beginning with the GeForce RTX 4060 Ti. This new performance-segment SKU debuts the 4 nm "AD106" silicon. A set of leaks by kopite7kimi, a reliable source with NVIDIA leaks, shed light on possible specifications.
The RTX 4060 Ti is based on the AD106 silicon, which is expected to be much smaller than the AD104 powering the RTX 4070 series. The reference board developed at NVIDIA, codenamed PG190, is reportedly tiny, and yet it features the 16-pin ATX 12VHPWR connector. This is probably set for 300 W at its signal pins, and adapters included with graphics cards could convert two 8-pin PCIe into one 300 W 16-pin connector. The RTX 4060 Ti is expected to come with a typical graphics power value of 160 W.At this point we don't know whether the RTX 4060 Ti maxes out the AD106, but its rumored specs read as follows: 4,352 CUDA cores across 34 streaming multiprocessors (SM), 34 RT cores, 136 Tensor cores, 136 TMUs, and an unknown ROP count. The GPU is expected to feature a 128-bit wide GDDR6/X memory interface, and 8 GB could remain the standard memory size. NVIDIA is expected to use JEDEC-standard 18 Gbps GDDR6 memory, which should yield 288 GB/s of memory bandwidth. It will be very interesting to see how much faster the RTX 4060 Ti is over its predecessor, the RTX 3060 Ti, given that it has barely two-thirds the memory bandwidth. NVIDIA has made several architectural improvements to the memory sub-system with "Ada," and the AD106 is expected to get a large 32 MB L2 cache.
Sources:
kopite7kimi (Twitter), VideoCardz
The RTX 4060 Ti is based on the AD106 silicon, which is expected to be much smaller than the AD104 powering the RTX 4070 series. The reference board developed at NVIDIA, codenamed PG190, is reportedly tiny, and yet it features the 16-pin ATX 12VHPWR connector. This is probably set for 300 W at its signal pins, and adapters included with graphics cards could convert two 8-pin PCIe into one 300 W 16-pin connector. The RTX 4060 Ti is expected to come with a typical graphics power value of 160 W.At this point we don't know whether the RTX 4060 Ti maxes out the AD106, but its rumored specs read as follows: 4,352 CUDA cores across 34 streaming multiprocessors (SM), 34 RT cores, 136 Tensor cores, 136 TMUs, and an unknown ROP count. The GPU is expected to feature a 128-bit wide GDDR6/X memory interface, and 8 GB could remain the standard memory size. NVIDIA is expected to use JEDEC-standard 18 Gbps GDDR6 memory, which should yield 288 GB/s of memory bandwidth. It will be very interesting to see how much faster the RTX 4060 Ti is over its predecessor, the RTX 3060 Ti, given that it has barely two-thirds the memory bandwidth. NVIDIA has made several architectural improvements to the memory sub-system with "Ada," and the AD106 is expected to get a large 32 MB L2 cache.
164 Comments on NVIDIA GeForce RTX 4060 Ti Possible Specs Surface—160 W Power, Debuts AD106 Silicon
It is getting harder and harder to shrink transistors, RND costs have gone up significantly as well. But 7 to 5 nm is still just a 60% increase, not 113%.
3 nm wafers are said to cost $20k. To compare, 28 nm used to cost $3k a decade ago.
Costs are going up, but not to the degree NVIDIA and AMD are trying to convince us they are.
But I feel something else at play here. Technically with the 4080 and 4070Ti they started cannibalizing/undercutting the previous generation, yet some people think it's bad value. And by not announcing "official" price cuts they also keep AIBs happy-ish, and let them keep the margins on the old gen models for those that prefer to save some cash and go for the 3080, which in some places has similar perf/$ of the 4070Ti.
None of the "mitigating circumstances" in Nvidia's favor are any different from another generational/process change, yet in the Kepler/Maxwell example they cut the price, not increased it.
www.techpowerup.com/298151/nvidia-ceo-confirms-rtx-40-series-ada-reveal-in-september-launch-aimed-at-not-cannibalizing-ampere The first products announced were the 4090, 4080 16GB, and 4080 12GB (later 4070Ti), and at least 2 out of those 3 undercut 3000 series models.
I wasn't saying they didn't cannibalize in the past, but some people in this thread are implying that the 4060Ti will bring very little to the table and I'm trying to show reasons to be a little bit more optimistic.
My apologies for not being clear enough before.
If the 4060 Ti is $400, following the 3060 Ti, we have a $400 gap for a single card to fill - the 4070. Not even plausible.
If the 4060 Ti is $500, that is still a massive jump to the 4070, and another pretty big one to the 4070 Ti. Even so, where does it lead those of us who don't want to pay mass amounts of money? Used/older GPUs. Which should be well below MSRP, but due to the lack of replacement, are not. Heck, even a quick PCPartPicker search shows 0 models of 3060 available at MSRP at the moment.
If there are no replacements, prices will not fall. This will simply begin a vicious cycle of people being priced out of PC gaming for consoles, leading to less GPUs sold, less available, rinse and repeat.
That is saying nothing about older GPUs lacking modern features - the most popular ones out there (1660, RX 580, 1060) are locked out of DLSS, and if there are no sub-$300 Ada cards? DLSS 3 will only exist to excuse crappy optimization, not help those who actually NEED more frames.
But while we're in "fantasy land", please follow me:
- 3080 (700~750$) > 3080Ti > 3090 > 3090Ti (1500$) > 4080 (1200$)
- 3070Ti (600$) > 3080 > 3080Ti > 3090 (~950$?) > 4070Ti (800$)
Looking at this example we can see that going by naming scheme, although price increased between gens, we also see that they "overtake" 3 of the previous models (I'm putting the 3080 10 and 12GB together). So then:- 3060Ti (~400$) > 3070 (500$) > 3070Ti (600$) > 3080 (700~750$) > 4060Ti (500$ ???)
500$ for 3080 performance is still a bit more than I would consider spending, but otherwise "reasonable", also, I think that this could be too much to ask from a 160W card, but dreaming is still free! That's why I'm a bit confident that it should match the 3070Ti and I hope in that case they'll ask for less than 500$.In this scenario the 4060 could go for ~400$ and should slot above the 3070. RTX4050 / 4050Ti. But seeing how they botched the 3050 (slower than the 2060) I don't expect much; AMD will probably be the one to cater to that crowd.
If they do go ahead with the 4050, just for fun, how much VRAM and how wide would the bus be? 6Gb at 96bit? :roll:
Have you looked at the expected specs of the 3060 vs the Ti model? Something goofy there, the 3060 is shown as outperforming the Ti by having nearly double the clock speeds.
Manufacturers should consider following Intel's example and not necessarily go to a smaller node for each generation if they can still work with the one that they're on and make improvements. I may hate Intel but it's incredible how they managed to pull performance gains seemingly out of thin air at 10nm over and over and over again. Intel showed that it can be done so what's stopping everyone else from doing it?
Here's a perfect example of what would have been better:
AMD created the RX 6500 XT which, while a far better deal than anything nVidia had at that price point, was a dog. The previous-generation's RX 5500 XT was a superior card all-around because it wasn't limited to PCI-Express x4 and it had encoders. It was built on TSMC's 7nm process and had an MSRP of $169. With the pandemic and the silicon shortage, it would've been far better to continue production of the RX 5500 XT instead of wasting 6nm allocation (which was more expensive) on a card with that level of performance.
The same could've especially been said for the RX 5700 XT in the place of the RX 6600. Could you imagine just how many of those cards AMD could've sold during the mining craze if they were in production? If I were AMD, I would've been scrambling to find a way to produce the RX 5700 XT again, even if it meant going to GloFo. That would've alleviated so much of the strain on the new stock because the preferred mining cards were the RX 5700 XT and the RTX 3060 Ti.
We see nVidia still producing the GTX 1600-series and that makes a lot of sense. When availability is an issue, continuing to produce older models just makes sense, especially since the older models will be lower-end. Hell, I wouldn't even mind it if models were re-branded and sold at a lower price. The RX 6600's MSRP was $329 while the RX 5700 XT's MSRP was $399. Ignoring the mining craze, if AMD re-branded the 5700 XT as the 6600 and sold it at $300, people would've gone nuts for it. Even reviewers would've accepted it because while, sure, it was just a re-branded last-gen part, the price would've fallen by $100. The 7nm process would've been more mature and yields would've most likely been even better than before, only adding to the other economic benefits.
For all those weirdos saying "But, but, ray-tracing!" I would point out that all RT performance below the RTX 3070 is essentially garbage so who cares? There were a lot of gaps in availability that could've easily been filled by previous-gen silicon. It would've been cheaper and easier to produce and people would've been happy that they were getting reasonable performance for a great price.
If AMD decided to re-brand the RX 6800 and RX 6800 XT respectively as the RX 7600 and RX 7600 XT and sold them for $350-$400 each, I don't think that anyone could complain as long as they were transparent about it. It would also free up more of the new node capacity for the higher-tier products. There would still be plenty of use for the imperfect dice because they'd be perfect for mobile GPUs as the smaller nodes are more power-efficient, something that isn't a huge deal on the desktop. I think that this would be a great answer to the lack of stock issue, would increase sales and profits dramatically and would help AMD avoid pitfalls like the RX 6500 XT. After all, nobody complained when the HD 7970 was re-branded as the R9-280X because AMD admitted it openly. In fact, they dropped the price of all cards that were branded as the HD 7970, added three free games to choose per card and that's why I bought two of them at the same time (well, also for ARMA III). :laugh:
It's not like it hasn't been done before and it's such a simple concept that I can't believe I'm the first one to bring it up, but here we are. :rolleyes: I think that it mostly boils down to this. ;)
Overall it seems like they need to scale down GPUs, at least the ones for gaming. Ada is great for professional applications, and they can charge thousands of dollars for those cards.
But gaming GPUs should go back to 250 W for the top end model. The progress in visuals is so slow now, but they are still trying to push it, games take forever to develop and most companies are starting to have financial trouble.
Gaming industry is in a really bad place right now. We need affordable hardware and games that are actually polished and fun to play. Graphics are the least important thing in gaming, It is all going in the wrong direction.
We wouldn't have this issue of low stock anymore either because lots of people would be buying the previous-gen cards under their new branding, leaving the high-end new-gen cards for the fools and the fanboys (what I call "early adopters"). :laugh:
The other main issue is feature lockout. RT is not an issue, no low-end card has any business playing in that market, but things like DLSS are. Low-end is where that kind of stuff is actually needed.
Edit:
They can also rebrand the RTX 2060 as RTX 4050, since it's still an upgrade over the RTX3050...
What we have now is clearly no longer working and this would alleviate the biggest problem that we're facing, price.