• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

AMD to Unveil Next-Generation APUs on November 11

Joined
Sep 30, 2013
Messages
14 (0.00/day)
AMD needs to address TDP issues for both their CPU and GPU. It's way too high, which is why they are losing to nVidia badly in the mobile market.

Got any recent data that shows that "they are losing to nVidia badly in the mobile market" ?

John Peddie


(AMD).. APUs declined 9.6% from Q1 and increased an astounding 47.1% in notebooks. The company’s overall PC graphics shipments increased 10.9%.
Nvidia’s desktop discrete shipments were down 8.9% from last quarter; and, the company’s mobile discrete shipments decreased 7.1%
 
Last edited:
Joined
Mar 10, 2010
Messages
11,878 (2.20/day)
Location
Manchester uk
System Name RyzenGtEvo/ Asus strix scar II
Processor Amd R5 5900X/ Intel 8750H
Motherboard Crosshair hero8 impact/Asus
Cooling 360EK extreme rad+ 360$EK slim all push, cpu ek suprim Gpu full cover all EK
Memory Corsair Vengeance Rgb pro 3600cas14 16Gb in four sticks./16Gb/16GB
Video Card(s) Powercolour RX7900XT Reference/Rtx 2060
Storage Silicon power 2TB nvme/8Tb external/1Tb samsung Evo nvme 2Tb sata ssd/1Tb nvme
Display(s) Samsung UAE28"850R 4k freesync.dell shiter
Case Lianli 011 dynamic/strix scar2
Audio Device(s) Xfi creative 7.1 on board ,Yamaha dts av setup, corsair void pro headset
Power Supply corsair 1200Hxi/Asus stock
Mouse Roccat Kova/ Logitech G wireless
Keyboard Roccat Aimo 120
VR HMD Oculus rift
Software Win 10 Pro
Benchmark Scores 8726 vega 3dmark timespy/ laptop Timespy 6506
AMD needs to address TDP issues for both their CPU and GPU. It's way too high, which is why they are losing to nVidia badly in the mobile market.

I find it strange that members with little to say all year can be assed throwing a little bait in Amd threads.
cheers for the input though Dwade however , this being TPU , most readers here fecked efficiency right out the window on day 1 of their new build or old rebuild when they turned Eist / cool and quite off and all other eco features off then overclocked the snot out of it and left it like that eternally or until instability shows up only to step it back a bit:confused:

MOOOOOAAAARR POWERSSSS not less pls:D
 
Joined
Apr 16, 2010
Messages
2,070 (0.39/day)
System Name iJayo
Processor i7 14700k
Motherboard Asus ROG STRIX z790-E wifi
Cooling Pearless Assasi
Memory 32 gigs Corsair Vengence
Video Card(s) Nvidia RTX 2070 Super
Storage 1tb 840 evo, Itb samsung M.2 ssd 1 & 3 tb seagate hdd, 120 gig Hyper X ssd
Display(s) 42" Nec retail display monitor/ 34" Dell curved 165hz monitor
Case O11 mini
Audio Device(s) M-Audio monitors
Power Supply LIan li 750 mini
Mouse corsair Dark Saber
Keyboard Roccat Vulcan 121
Software Window 11 pro
Benchmark Scores meh... feel me on the battle field!
funny thing is that as bad as the reputation for the 9590s have been, Intel are no better when clocked to 5Ghz..

http://pctuning.tyden.cz/ilustrace3/obermaier/4770K/scaling_sandy.pnghttp://i.imgur.com/gzBNZwN.png

With Haswell things are not much better.. +18% clock costs over 60% more power (3900 to 4600mhz).. and that last 400Mhz is going to cost another 50%+ in power consumption to reach 5Ghz.

fair enough.... gotta admit though even if it was intel or nvidia in that picture.... it still would've been hilarious :laugh: And anyone who buy that level of cpu dont care about power consumption and tend to water cool so its all good
 
Joined
May 18, 2010
Messages
3,427 (0.64/day)
System Name My baby
Processor Athlon II X4 620 @ 3.5GHz, 1.45v, NB @ 2700Mhz, HT @ 2700Mhz - 24hr prime95 stable
Motherboard Asus M4A785TD-V EVO
Cooling Sonic Tower Rev 2 with 120mm Akasa attached, Akasa @ Front, Xilence Red Wing 120mm @ Rear
Memory 8 GB G.Skills 1600Mhz
Video Card(s) ATI ASUS Crossfire 5850
Storage Crucial MX100 SATA 2.5 SSD
Display(s) Lenovo ThinkVision 27" (LEN P27h-10)
Case Antec VSK 2000 Black Tower Case
Audio Device(s) Onkyo TX-SR309 Receiver, 2x Kef Cresta 1, 1x Kef Center 20c
Power Supply OCZ StealthXstream II 600w, 4x12v/18A, 80% efficiency.
Software Windows 10 Professional 64-bit
In the same boat,.. X6 1100T @ 4.2Ghz, see no reason to upgrade..

I can say the same thing with an Athlon II X4 @ 3.6Ghz.

AMD needs to address TDP issues for both their CPU and GPU. It's way too high, which is why they are losing to nVidia badly in the mobile market.

Hence why AMD/ATI currently have a larger market share than Nvidia in the mobile market.
 
Last edited:
Joined
Dec 6, 2012
Messages
148 (0.03/day)
Processor Ryzen 5 2600X
Motherboard ASRock X470 Taichi
Cooling Wraith Max HSF
Memory 2 x 8GB G.Skill FlareX @ 3400 MT/s CL14
Video Card(s) EVGA GTX 1080 Ti
Storage Samsung 970 Evo 250GB/Western Digital SN550 Blue 1TB/Crucial MX500 500GB (Ubuntu)/Toshiba 2TB HDD
Display(s) LG 27UD68-P
Case Fractal Design Define R6 TG
Power Supply EVGA SuperNOVA G2 750W
Mouse Razer Deathadder Chroma
Keyboard Corsair K75 (Cherry Brown Switches)
funny thing is that as bad as the reputation for the 9590s have been, Intel are no better when clocked to 5Ghz..

http://pctuning.tyden.cz/ilustrace3/obermaier/4770K/scaling_sandy.pnghttp://i.imgur.com/gzBNZwN.png

With Haswell things are not much better.. +18% clock costs over 60% more power (3900 to 4600mhz).. and that last 400Mhz is going to cost another 50%+ in power consumption to reach 5Ghz.

Not doubting the accuracy of this, but it's a little hard to take these charts seriously when they can't even spell what they're measuring.
 
Joined
Oct 4, 2013
Messages
58 (0.01/day)
I can say the same thing with an Athlon II X4 @ 3.6Ghz.


Didn't we (customers) wanted our CPU to lastttt long? Funny, enough some complaint AMD socket, too! :banghead:

I gotta said, my hat is down for AMD lonnng last as good enough CPU! :respect:
 
Joined
May 18, 2010
Messages
3,427 (0.64/day)
System Name My baby
Processor Athlon II X4 620 @ 3.5GHz, 1.45v, NB @ 2700Mhz, HT @ 2700Mhz - 24hr prime95 stable
Motherboard Asus M4A785TD-V EVO
Cooling Sonic Tower Rev 2 with 120mm Akasa attached, Akasa @ Front, Xilence Red Wing 120mm @ Rear
Memory 8 GB G.Skills 1600Mhz
Video Card(s) ATI ASUS Crossfire 5850
Storage Crucial MX100 SATA 2.5 SSD
Display(s) Lenovo ThinkVision 27" (LEN P27h-10)
Case Antec VSK 2000 Black Tower Case
Audio Device(s) Onkyo TX-SR309 Receiver, 2x Kef Cresta 1, 1x Kef Center 20c
Power Supply OCZ StealthXstream II 600w, 4x12v/18A, 80% efficiency.
Software Windows 10 Professional 64-bit
I can say the same thing with an Athlon II X4 @ 3.6Ghz.


Didn't we (customers) wanted our CPU to lastttt long? Funny, enough some complaint AMD socket, too! :banghead:

I gotta said, my hat is down for AMD lonnng last as good enough CPU! :respect:

100% agree. When AMD was churning out CPUs they are saying slow down we just upgraded, damn corporate greed milking the consumer. Then they give us hardware which almost half a decade and people complain for something new.

Not doubting the accuracy of this, but it's a little hard to take these charts seriously when they can't even spell what they're measuring.

Yh power "consuption". Think its the authors second language.
 
Joined
Sep 30, 2013
Messages
14 (0.00/day)
100% agree. When AMD was churning out CPUs they are saying slow down we just upgraded, damn corporate greed milking the consumer. Then they give us hardware which almost half a decade and people complain for something new.



Yh power "consuption". Think its the authors second language.

Iirc, the author is Czech/Slovak.. I assume that you communicate Czechoslovakian as well as they do English ???!
 
Joined
Dec 6, 2012
Messages
148 (0.03/day)
Processor Ryzen 5 2600X
Motherboard ASRock X470 Taichi
Cooling Wraith Max HSF
Memory 2 x 8GB G.Skill FlareX @ 3400 MT/s CL14
Video Card(s) EVGA GTX 1080 Ti
Storage Samsung 970 Evo 250GB/Western Digital SN550 Blue 1TB/Crucial MX500 500GB (Ubuntu)/Toshiba 2TB HDD
Display(s) LG 27UD68-P
Case Fractal Design Define R6 TG
Power Supply EVGA SuperNOVA G2 750W
Mouse Razer Deathadder Chroma
Keyboard Corsair K75 (Cherry Brown Switches)
Iirc, the author is Czech/Slovak.. I assume that you communicate Czechoslovakian as well as they do English ???!

I assumed as much, but that is beside the point. If I was publishing data in Czechoslovakian and trying to be taken seriously, you bet your ass I'd make sure there weren't any spelling errors. Once again, I'm not saying the data is false or inaccurate, just a little unprofessional.

Anyways, back on topic. 832 GCN cores seems like a waste of space/power if they're just going to be held back by the memory bandwidth anyways. I'm thinking it'll be between 384 and 512 cores.
 
Last edited:
Joined
May 18, 2010
Messages
3,427 (0.64/day)
System Name My baby
Processor Athlon II X4 620 @ 3.5GHz, 1.45v, NB @ 2700Mhz, HT @ 2700Mhz - 24hr prime95 stable
Motherboard Asus M4A785TD-V EVO
Cooling Sonic Tower Rev 2 with 120mm Akasa attached, Akasa @ Front, Xilence Red Wing 120mm @ Rear
Memory 8 GB G.Skills 1600Mhz
Video Card(s) ATI ASUS Crossfire 5850
Storage Crucial MX100 SATA 2.5 SSD
Display(s) Lenovo ThinkVision 27" (LEN P27h-10)
Case Antec VSK 2000 Black Tower Case
Audio Device(s) Onkyo TX-SR309 Receiver, 2x Kef Cresta 1, 1x Kef Center 20c
Power Supply OCZ StealthXstream II 600w, 4x12v/18A, 80% efficiency.
Software Windows 10 Professional 64-bit
Iirc, the author is Czech/Slovak.. I assume that you communicate Czechoslovakian as well as they do English ???!

Nope. But I wasn't criticising the authors spelling Ralfies was. I was just point out the mistake so everyone knew what Ralfies was talking about.
 
Joined
Dec 16, 2010
Messages
1,668 (0.33/day)
Location
State College, PA, US
System Name My Surround PC
Processor AMD Ryzen 9 7950X3D
Motherboard ASUS STRIX X670E-F
Cooling Swiftech MCP35X / EK Quantum CPU / Alphacool GPU / XSPC 480mm w/ Corsair Fans
Memory 96GB (2 x 48 GB) G.Skill DDR5-6000 CL30
Video Card(s) MSI NVIDIA GeForce RTX 4090 Suprim X 24GB
Storage WD SN850 2TB, Samsung PM981a 1TB, 4 x 4TB + 1 x 10TB HGST NAS HDD for Windows Storage Spaces
Display(s) 2 x Viotek GFI27QXA 27" 4K 120Hz + LG UH850 4K 60Hz + HMD
Case NZXT Source 530
Audio Device(s) Sony MDR-7506 / Logitech Z-5500 5.1
Power Supply Corsair RM1000x 1 kW
Mouse Patriot Viper V560
Keyboard Corsair K100
VR HMD HP Reverb G2
Software Windows 11 Pro x64
Benchmark Scores Mellanox ConnectX-3 10 Gb/s Fiber Network Card
I'm not expecting much of anything regarding hardware announcements, even during the press conference. All the seminars involve software, and I expect that will be the theme of the conference.

Anyways, back on topic. 832 GCN cores seems like a waste of space/power if they're just going to be held back by the memory bandwidth anyways. I'm thinking it'll be between 384 and 512 cores.

That bandwidth constraint is the issue. I don't understand why mobile processors haven't gotten wider memory buses to compensate. I can understand socketed desktop CPUs needing too many pins to support a wider memory bus, and also DIMM placement is an issue with a wide bus. But modern notebooks use BGA CPUs and soldered down memory. Theoretically a 256-bit DDR3 bus shouldn't require much more space in a laptop than a 128-bit bus, and the only increase in cost might be a PCB with a few more layers. In exchange graphics performance would scale immensely. Microsoft and Sony can do it for the APUs in their consoles and AMD's graphics division does it all the time for its GPUs, so I don't see why it isn't done for the mass market APUs.
 
Joined
May 13, 2008
Messages
762 (0.13/day)
System Name HTPC whhaaaat?
Processor 2600k @ 4500mhz
Motherboard Asus Maximus IV gene-z gen3
Cooling Noctua NH-C14
Memory Gskill Ripjaw 2x4gb
Video Card(s) EVGA 1080 FTW @ 2037/11016
Storage 2x512GB MX100/1x Agility 3 128gb ssds, Seagate 3TB HDD
Display(s) Vizio P 65'' 4k tv
Case Lian Li pc-c50b
Audio Device(s) Denon 3311
Power Supply Corsair 620HX
That bandwidth constraint is the issue. I don't understand why mobile processors haven't gotten wider memory buses to compensate. I can understand socketed desktop CPUs needing too many pins to support a wider memory bus, and also DIMM placement is an issue with a wide bus. But modern notebooks use BGA CPUs and soldered down memory. Theoretically a 256-bit DDR3 bus shouldn't require much more space in a laptop than a 128-bit bus, and the only increase in cost might be a PCB with a few more layers. In exchange graphics performance would scale immensely. Microsoft and Sony can do it for the APUs in their consoles and AMD's graphics division does it all the time for its GPUs, so I don't see why it isn't done for the mass market APUs.

You're asking the questions that have boggled my mind since the conception of the APU, and certainly what I find the most interesting challenge moving forward.

There are many conceivable answers, a wider bus among them (256-bit ddr3 would be sufficient for a ~512sp design), although perhaps less probable as we move to ddr4 and it's 1dimm-per-channel restriction and larger, more demanding iGPUs that will quickly outpace a 128-bit ddr4 bus. Certainly there is bga, but I wonder if amd is really willing to take that leap with their larger designs (as a consumer platform, ie not the ps4 or iterations of bobcat).

Hypertransport, if not a discrete (or optional) gddr5 bus to a gpu cache (ala what used to be called Sideport Memory in the discrete IGP days) seemed like a realistic option even up to this generation. While 32-bit, with the max bandwidth of a link resting somewhere near what gddr5 is capable on AMD's current gpu controllers, and meshing fairly nicely with being around half of what a 32/28nm iGPU would need (and twice what a 128-bit ddr3 bus could deliver), that would have more-or-less made sense. Obviously moving past this gen it would be less so, unless itself coupled with a ddr4 bus (ie ddr4 + gddrX).

From there, we have the possibilities of larger/faster caches (like the X1's on-die ram) offsetting what is needed externally. There is also the possibility of things like on-package off-die caches (not unlike Intel's Iris) as well stacked dram like Volta.

Whatever their solution, they need to do it yesterday. Their strength is (and has always been) in the floating point computation per mm (per process/cost) their designs deliver. While HSA capitalizes on this fact, as it should, with each passing node they lose that (realistic) advantage to intel, whom can ramp clocks higher until they reach parity in design (and then clock them lower to save power) even as their priority lies in improving their cpu cores. With each passing gpu gen nvidia grows closer to parity, as they are clearly receding from purely thinking of their designs as efficient gpus to rather more-or-less a floating point core (that makes sense as such unit with or without the shell of a cpu). The scary thing about all that is...intel and nvidia, those least dependant on memory bandwidth currently, have shown their plans for going forward. AMD, whom already is restricted on all fronts by this reality, has not (outside the ps4.)

I find that sincerely troubling. No doubt they have an answer...I just hope it comes sooner rather than later.
 
Joined
Dec 16, 2010
Messages
1,668 (0.33/day)
Location
State College, PA, US
System Name My Surround PC
Processor AMD Ryzen 9 7950X3D
Motherboard ASUS STRIX X670E-F
Cooling Swiftech MCP35X / EK Quantum CPU / Alphacool GPU / XSPC 480mm w/ Corsair Fans
Memory 96GB (2 x 48 GB) G.Skill DDR5-6000 CL30
Video Card(s) MSI NVIDIA GeForce RTX 4090 Suprim X 24GB
Storage WD SN850 2TB, Samsung PM981a 1TB, 4 x 4TB + 1 x 10TB HGST NAS HDD for Windows Storage Spaces
Display(s) 2 x Viotek GFI27QXA 27" 4K 120Hz + LG UH850 4K 60Hz + HMD
Case NZXT Source 530
Audio Device(s) Sony MDR-7506 / Logitech Z-5500 5.1
Power Supply Corsair RM1000x 1 kW
Mouse Patriot Viper V560
Keyboard Corsair K100
VR HMD HP Reverb G2
Software Windows 11 Pro x64
Benchmark Scores Mellanox ConnectX-3 10 Gb/s Fiber Network Card
You're asking the questions that have boggled my mind since the conception of the APU, and certainly what I find the most interesting challenge moving forward.

There are many conceivable answers, a wider bus among them (256-bit ddr3 would be sufficient for a ~512sp design), although perhaps less probable as we move to ddr4 and it's 1dimm-per-channel restriction and larger, more demanding iGPUs that will quickly outpace a 128-bit ddr4 bus. Certainly there is bga, but I wonder if amd is really willing to take that leap with their larger designs (as a consumer platform, ie not the ps4 or iterations of bobcat).

Hypertransport, if not a discrete (or optional) gddr5 bus to a gpu cache (ala what used to be called Sideport Memory in the discrete IGP days) seemed like a realistic option even up to this generation. While 32-bit, with the max bandwidth of a link resting somewhere near what gddr5 is capable on AMD's current gpu controllers, and meshing fairly nicely with being around half of what a 32/28nm iGPU would need (and twice what a 128-bit ddr3 bus could deliver), that would have more-or-less made sense. Obviously moving past this gen it would be less so, unless itself coupled with a ddr4 bus (ie ddr4 + gddrX).

From there, we have the possibilities of larger/faster caches (like the X1's on-die ram) offsetting what is needed externally. There is also the possibility of things like on-package off-die caches (not unlike Intel's Iris) as well stacked dram like Volta.

Whatever their solution, they need to do it yesterday. Their strength is (and has always been) in the floating point computation per mm (per process/cost) their designs deliver. While HSA capitalizes on this fact, as it should, with each passing node they lose that (realistic) advantage to intel, whom can ramp clocks higher until they reach parity in design (and then clock them lower to save power) even as their priority lies in improving their cpu cores. With each passing gpu gen nvidia grows closer to parity, as they are clearly receding from purely thinking of their designs as efficient gpus to rather more-or-less a floating point core (that makes sense as such unit with or without the shell of a cpu). The scary thing about all that is...intel and nvidia, those least dependant on memory bandwidth currently, have shown their plans for going forward. AMD, whom already is restricted on all fronts by this reality, has not (outside the ps4.)

I find that sincerely troubling. No doubt they have an answer...I just hope it comes sooner rather than later.

From what I've read about AMD's goals, they don't want a heterogeneous memory pool like the XBOX One where different memory addresses have different bandwidths and latencies. AMD is pushing to have all memory addresses the same speed and latency in order to avoid the need for software to shuffle memory among different addresses in order to optimize bandwidth, sort of what is one with a discrete GPU today. This doesn't eliminate an algorithm implemented in the core hardware managing more levels of cache (like what Intel does with Crystalwell), but AMD wants this to be transparent to the developer.

As far as DDR4, the doubled bandwidth will stave off the bandwidth limitation for a while but even without the need for more bandwidth the 1 DIMM/channel limitation will encourage wider memory buses. The people who want lots of memory for the desktop or mobile will now need lots double the memory channels to achieve the same capacity with DDR4 as DDR3. The server market already moved in this direction with DDR3; the reason for the migration to 256-bit buses were more for the sheer memory capacity of that many memory channels rather than the increased bandwidth.
 
Joined
Sep 19, 2012
Messages
615 (0.14/day)
System Name [WIP]
Processor Intel Pentium G3420 [i7-4790K SOON(tm)]
Motherboard MSI Z87-GD65 Gaming
Cooling [Corsair H100i]
Memory G.Skill TridentX 2x8GB-2400-CL10 DDR3
Video Card(s) [MSI AMD Radeon R9-290 Gaming]
Storage Seagate 2TB Desktop SSHD / [Samsung 256GB 840 PRO]
Display(s) [BenQ XL2420Z]
Case [Corsair Obsidian 750D]
Power Supply Corsair RM750
Software Windows 8.1 x64 Pro / Linux Mint 15 / SteamOS
Again people with the same concerns and mentality... *sigh*

Let me put it simple... HSA > pure iGPU for games and crap.

HSA is ment as a revolution in x86... and possibly the only thing that can save it from a slow and painful death by ARM.

Seriously, while the iGPU part should be beastly, even if with the new IMC and faster DDR3 support, it'll still come short of it's potential... the great iGPU is far from the (only) point of Kaveri...

And I'm sure, on paper at least, adding an extra 192-256 ALUs make much more performance sense to AMD, than adding 2 extra cores.
 
Joined
Nov 4, 2005
Messages
12,013 (1.72/day)
System Name Compy 386
Processor 7800X3D
Motherboard Asus
Cooling Air for now.....
Memory 64 GB DDR5 6400Mhz
Video Card(s) 7900XTX 310 Merc
Storage Samsung 990 2TB, 2 SP 2TB SSDs, 24TB Enterprise drives
Display(s) 55" Samsung 4K HDR
Audio Device(s) ATI HDMI
Mouse Logitech MX518
Keyboard Razer
Software A lot.
Benchmark Scores Its fast. Enough.
From what I've read about AMD's goals, they don't want a heterogeneous memory pool like the XBOX One where different memory addresses have different bandwidths and latencies. AMD is pushing to have all memory addresses the same speed and latency in order to avoid the need for software to shuffle memory among different addresses in order to optimize bandwidth, sort of what is one with a discrete GPU today. This doesn't eliminate an algorithm implemented in the core hardware managing more levels of cache (like what Intel does with Crystalwell), but AMD wants this to be transparent to the developer.

As far as DDR4, the doubled bandwidth will stave off the bandwidth limitation for a while but even without the need for more bandwidth the 1 DIMM/channel limitation will encourage wider memory buses. The people who want lots of memory for the desktop or mobile will now need lots double the memory channels to achieve the same capacity with DDR4 as DDR3. The server market already moved in this direction with DDR3; the reason for the migration to 256-bit buses were more for the sheer memory capacity of that many memory channels rather than the increased bandwidth.

No, they DO want it. It improves their performance in all facets.

http://arstechnica.com/information-...orm-memory-access-coming-this-year-in-kaveri/


Instead of having software decide where to run the process from, the hardware decides in real time which is more efficient, and then runs it. Addresses are the same, so no latency penalty for transporting it around. Hugely improved performance in DSP and other filtered data, serial data still run on the CPU cores.
 
Joined
Sep 19, 2012
Messages
615 (0.14/day)
System Name [WIP]
Processor Intel Pentium G3420 [i7-4790K SOON(tm)]
Motherboard MSI Z87-GD65 Gaming
Cooling [Corsair H100i]
Memory G.Skill TridentX 2x8GB-2400-CL10 DDR3
Video Card(s) [MSI AMD Radeon R9-290 Gaming]
Storage Seagate 2TB Desktop SSHD / [Samsung 256GB 840 PRO]
Display(s) [BenQ XL2420Z]
Case [Corsair Obsidian 750D]
Power Supply Corsair RM750
Software Windows 8.1 x64 Pro / Linux Mint 15 / SteamOS
Apparently AMD has indirectly confirmed the naming scheme for desktop Kaveri.



So as I suspected, Ax-7x00x. Like A10-7800K for the next top tier model.

Edit: As well as the existance of next Athlon CPUs... Like Athlon II X4 770K or 850K? I guess.
 
Last edited:
Joined
Dec 16, 2010
Messages
1,668 (0.33/day)
Location
State College, PA, US
System Name My Surround PC
Processor AMD Ryzen 9 7950X3D
Motherboard ASUS STRIX X670E-F
Cooling Swiftech MCP35X / EK Quantum CPU / Alphacool GPU / XSPC 480mm w/ Corsair Fans
Memory 96GB (2 x 48 GB) G.Skill DDR5-6000 CL30
Video Card(s) MSI NVIDIA GeForce RTX 4090 Suprim X 24GB
Storage WD SN850 2TB, Samsung PM981a 1TB, 4 x 4TB + 1 x 10TB HGST NAS HDD for Windows Storage Spaces
Display(s) 2 x Viotek GFI27QXA 27" 4K 120Hz + LG UH850 4K 60Hz + HMD
Case NZXT Source 530
Audio Device(s) Sony MDR-7506 / Logitech Z-5500 5.1
Power Supply Corsair RM1000x 1 kW
Mouse Patriot Viper V560
Keyboard Corsair K100
VR HMD HP Reverb G2
Software Windows 11 Pro x64
Benchmark Scores Mellanox ConnectX-3 10 Gb/s Fiber Network Card
No, they DO want it. It improves their performance in all facets.

http://arstechnica.com/information-...orm-memory-access-coming-this-year-in-kaveri/

Instead of having software decide where to run the process from, the hardware decides in real time which is more efficient, and then runs it. Addresses are the same, so no latency penalty for transporting it around. Hugely improved performance in DSP and other filtered data, serial data still run on the CPU cores.

I don't understand how that article refutes what I said; I think you agree with me but didn't understand what I said. I wasn't referring to dedicated memory for the GPU and GPU, which is obviously going away. I was referring AMD not wanting something like a NUMA where different memory addresses have different bandwidths and latencies.

When programming for the XBOX One, programmers have to write their code so that the most latency and bandwidth sensitive parts are sent to the small SRAM while the rest of the data is written to the larger but slower main memory. AMD doesn't want to have developers worrying about swapping data between the SRAM versus main memory, so they want a unified memory architecture like the PS4.

This is why I don't see something like alwayssts said occurring, where there is a small, high speed, on chip cache managed by software. The whole point of AMD's heterogeneous computing initiative is to make it as easy as possible for programmers to utilize heterogeneous computing. If there is to be a large SRAM cache at all, AMD wants something more like Crystalwell where the cache is managed by hardware and it is transparent to the developer.
 
Last edited:
Joined
Mar 10, 2010
Messages
11,878 (2.20/day)
Location
Manchester uk
System Name RyzenGtEvo/ Asus strix scar II
Processor Amd R5 5900X/ Intel 8750H
Motherboard Crosshair hero8 impact/Asus
Cooling 360EK extreme rad+ 360$EK slim all push, cpu ek suprim Gpu full cover all EK
Memory Corsair Vengeance Rgb pro 3600cas14 16Gb in four sticks./16Gb/16GB
Video Card(s) Powercolour RX7900XT Reference/Rtx 2060
Storage Silicon power 2TB nvme/8Tb external/1Tb samsung Evo nvme 2Tb sata ssd/1Tb nvme
Display(s) Samsung UAE28"850R 4k freesync.dell shiter
Case Lianli 011 dynamic/strix scar2
Audio Device(s) Xfi creative 7.1 on board ,Yamaha dts av setup, corsair void pro headset
Power Supply corsair 1200Hxi/Asus stock
Mouse Roccat Kova/ Logitech G wireless
Keyboard Roccat Aimo 120
VR HMD Oculus rift
Software Win 10 Pro
Benchmark Scores 8726 vega 3dmark timespy/ laptop Timespy 6506
I don't understand how that article refutes what I said; I think you agree with me but didn't understand what I said. I wasn't referring to dedicated memory for the GPU and GPU, which is obviously going away. I was referring AMD not wanting something like a NUMA where different memory addresses have different bandwidths and latencies.

When programming for the XBOX One, programmers have to write their code so that the most latency and bandwidth sensitive parts are sent to the small SRAM while the rest of the data is written to the larger but slower main memory. AMD doesn't want to have developers worrying about swapping data between the SRAM versus main memory, so they want a unified memory architecture like the PS4.

This is why I don't see something like alwayssts said occurring, where there is a small, high speed, on chip cache managed by software. The whole point of AMD's heterogeneous computing initiative is to make it as easy as possible for programmers to utilize heterogeneous computing. If there is to be a large SRAM cache at all, AMD wants something more like Crystalwell where the cache is managed by hardware and it is transparent to the developer.

thats exactly it and exactly where i think they are all going, stacked chips with multi layered memory and in centralising the memory rescource it only makes sense to up the bandwidth of each route to it and remove some of the intermediary caches to bring back some latency.
Im thinking quad module for Amd but per layer and effectively 4x ddr4 imc per layer x2 for 16 logic cores from 8 tied across an 8 channel ddr4 interface to 8 gig of Tsv connected dram, drop the sytem ram too at this point and the year is,,, ,likely 2015:cool:
 
Top