• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

AMD Vega 20 GPU Could Implement PCI-Express gen 4.0

btarunr

Editor & Senior Moderator
Staff member
Joined
Oct 9, 2007
Messages
47,252 (7.54/day)
Location
Hyderabad, India
System Name RBMK-1000
Processor AMD Ryzen 7 5700G
Motherboard ASUS ROG Strix B450-E Gaming
Cooling DeepCool Gammax L240 V2
Memory 2x 8GB G.Skill Sniper X
Video Card(s) Palit GeForce RTX 2080 SUPER GameRock
Storage Western Digital Black NVMe 512GB
Display(s) BenQ 1440p 60 Hz 27-inch
Case Corsair Carbide 100R
Audio Device(s) ASUS SupremeFX S1220A
Power Supply Cooler Master MWE Gold 650W
Mouse ASUS ROG Strix Impact
Keyboard Gamdias Hermes E2
Software Windows 11 Pro
The "Vega 20" silicon will be significantly different from the "Vega 10" which powers the company's current Radeon RX Vega series. AMD CEO Dr. Lisa Su unveiled the "Vega 20" silicon at the company's 2018 Computex event, revealing that the multi-chip module's 7 nm GPU die is surrounded by not two, but four HBM2 memory stacks, making up to 32 GB of memory. Another key specification is emerging thanks to the sharp eyes at ComputerBase.de - system bus.

A close inspection of the latest AMDGPU Linux driver includes PCI-Express link speed definitions for PCI-Express gen 4.0, which offers 256 Gbps of bandwidth per direction at x16 bus width, double that of PCI-Express gen 3.0. "Vega 20" got its first PCIe gen 4.0 support confirmation from a leak slide that surfaced around CES 2018. AMD "Vega" architecture slides from last year hinted at a Q3/Q4 launch of the first "Vega 20" based product. The same slide also hinted that the next-generation EPYC processor, which we know are "Zen 2" based and not "Zen+," could feature PCI-Express gen 4.0 root-complexes. Since EPYC chips are multi-chip modules, it could also hint at the likelihood of PCIe gen 4.0 on "Zen 2" based 3rd generation Ryzen processor family.



View at TechPowerUp Main Site
 

bug

Joined
May 22, 2015
Messages
13,794 (3.96/day)
Processor Intel i5-12600k
Motherboard Asus H670 TUF
Cooling Arctic Freezer 34
Memory 2x16GB DDR4 3600 G.Skill Ripjaws V
Video Card(s) EVGA GTX 1060 SC
Storage 500GB Samsung 970 EVO, 500GB Samsung 850 EVO, 1TB Crucial MX300 and 2TB Crucial MX500
Display(s) Dell U3219Q + HP ZR24w
Case Raijintek Thetis
Audio Device(s) Audioquest Dragonfly Red :D
Power Supply Seasonic 620W M12
Mouse Logitech G502 Proteus Core
Keyboard G.Skill KM780R
Software Arch Linux + Win10
A checkbox feature for the time being. No CPU currently supports PCIe 4.0 and to design a GPU that actually uses all that bandwidth, would mean it would run crippled unless paired with upcoming CPUs.
No biggie, we've been there before with PCIe 2.0 and 3.0.

What would be really neat would be for the card to run PCIe 4.0 x8 and free up 8 more lanes for NVMe storage.
 
Joined
Sep 14, 2017
Messages
625 (0.24/day)
A checkbox feature for the time being. No CPU currently supports PCIe 4.0 and to design a GPU that actually uses all that bandwidth, would mean it would run crippled unless paired with upcoming CPUs.
No biggie, we've been there before with PCIe 2.0 and 3.0.

What would be really neat would be for the card to run PCIe 4.0 x8 and free up 8 more lanes for NVMe storage.


I think that might be the point. Even 5.0 coming up, that pure x16 bandwidth is not as necessary (aside from say maybe HPC NICs) but how x1 lane would be enough now for a lot more devices per chassis. So at 3.0 it takes x16 lanes. But at 4.0 it takes x8 lanes for the GPU. Plus it's backwards compatible so doesn't hurt.
 
Last edited:
Joined
Aug 13, 2010
Messages
5,474 (1.05/day)
IDC if it would be PCIE3.0
All i wanna see is good graphical performance for a fair price.

By this time I wouldn't even care if the damn thing consumes 300 watts. We've come to this very reasonable and minimalist request. This is all there's left to ask.
 
Joined
Jun 29, 2018
Messages
537 (0.23/day)
A checkbox feature for the time being. No CPU currently supports PCIe 4.0 and to design a GPU that actually uses all that bandwidth, would mean it would run crippled unless paired with upcoming CPUs.

That is incorrect. IBM POWER9 implements PCIe 4.0 (and nVidia NVLink 2.0) since 2017 and is used among other things in the Talos II Workstation. Granted that is not a gaming computer, but still it does use 4.0.

Edit: Phoronix has some benchmarks under Linux of it.
 
Joined
Dec 16, 2017
Messages
2,922 (1.15/day)
System Name System V
Processor AMD Ryzen 5 3600
Motherboard Asus Prime X570-P
Cooling Cooler Master Hyper 212 // a bunch of 120 mm Xigmatek 1500 RPM fans (2 ins, 3 outs)
Memory 2x8GB Ballistix Sport LT 3200 MHz (BLS8G4D32AESCK.M8FE) (CL16-18-18-36)
Video Card(s) Gigabyte AORUS Radeon RX 580 8 GB
Storage SHFS37A240G / DT01ACA200 / ST10000VN0008 / ST8000VN004 / SA400S37960G / SNV21000G / NM620 2TB
Display(s) LG 22MP55 IPS Display
Case NZXT Source 210
Audio Device(s) Logitech G430 Headset
Power Supply Corsair CX650M
Software Whatever build of Windows 11 is being served in Canary channel at the time.
Benchmark Scores Corona 1.3: 3120620 r/s Cinebench R20: 3355 FireStrike: 12490 TimeSpy: 4624
Okay. So it is likely that next-gen graphics card (Vega 20) and whatever comes after Volta will switch to PCIe 4.0 for the high end products.

This also makes me think that I should probably put on hold any upgrade plans and wait until motherboards and CPUs with support for PCI 4.0 reach consumer market, for better future proofing...
 
Joined
Jun 14, 2010
Messages
632 (0.12/day)
Location
City 217
Processor AMD Phenom II X4 925
Motherboard Asus M4A78LT-M
Cooling Ice Hammer IH-4***
Memory 2x4GB DDR3 Corsair
Video Card(s) Asus HD7870 2GB
Storage 500GB SATAII Samsung | 500GB SATAII Seagate
Display(s) 23" LG 23EA63V-P
Case Thermaltake V3 Black Edition
Audio Device(s) VIA VT1708S
Power Supply Corsair TX650W
Software Windows 10 x64
Backwards compatibility and futureproofing. This is good stuff.
 
Joined
May 12, 2016
Messages
259 (0.08/day)
Processor Intel Core i7 11700
Motherboard Asus b560-i ROG
Cooling Thermalright Assassin King Mini
Memory G.Skill Trident Z 3600
Video Card(s) RTX 3080 FE
Display(s) Dell S2721DGF
Case Ncase M1
Power Supply Corsair SF750
Mouse HyperX
Keyboard HyperX
Okay. So it is likely that next-gen graphics card (Vega 20) and whatever comes after Volta will switch to PCIe 4.0 for the high end products.

This also makes me think that I should probably put on hold any upgrade plans and wait until motherboards and CPUs with support for PCI 4.0 reach consumer market, for better future proofing...

Exactly my strategy as well. Wait until 7nm cpus and gpus arrive with all the nice new features and make a quality upgrade. Should be there circa 2019 or 2020. In a meantime - just keep using 6700k and 1080.
 

eidairaman1

The Exiled Airman
Joined
Jul 2, 2007
Messages
42,346 (6.65/day)
Location
Republic of Texas (True Patriot)
System Name PCGOD
Processor AMD FX 8350@ 5.0GHz
Motherboard Asus TUF 990FX Sabertooth R2 2901 Bios
Cooling Scythe Ashura, 2Ă—BitFenix 230mm Spectre Pro LED (Blue,Green), 2x BitFenix 140mm Spectre Pro LED
Memory 16 GB Gskill Ripjaws X 2133 (2400 OC, 10-10-12-20-20, 1T, 1.65V)
Video Card(s) AMD Radeon 290 Sapphire Vapor-X
Storage Samsung 840 Pro 256GB, WD Velociraptor 1TB
Display(s) NEC Multisync LCD 1700V (Display Port Adapter)
Case AeroCool Xpredator Evil Blue Edition
Audio Device(s) Creative Labs Sound Blaster ZxR
Power Supply Seasonic 1250 XM2 Series (XP3)
Mouse Roccat Kone XTD
Keyboard Roccat Ryos MK Pro
Software Windows 7 Pro 64
I thought pcie 4 was relegated to servers and 5.0 would be the next major version for desktops...
 
Joined
Sep 17, 2014
Messages
22,491 (6.03/day)
Location
The Washing Machine
System Name Tiny the White Yeti
Processor 7800X3D
Motherboard MSI MAG Mortar b650m wifi
Cooling CPU: Thermalright Peerless Assassin / Case: Phanteks T30-120 x3
Memory 32GB Corsair Vengeance 30CL6000
Video Card(s) ASRock RX7900XT Phantom Gaming
Storage Lexar NM790 4TB + Samsung 850 EVO 1TB + Samsung 980 1TB + Crucial BX100 250GB
Display(s) Gigabyte G34QWC (3440x1440)
Case Lian Li A3 mATX White
Audio Device(s) Harman Kardon AVR137 + 2.1
Power Supply EVGA Supernova G2 750W
Mouse Steelseries Aerox 5
Keyboard Lenovo Thinkpad Trackpoint II
VR HMD HD 420 - Green Edition ;)
Software W11 IoT Enterprise LTSC
Benchmark Scores Over 9000
Okay. So it is likely that next-gen graphics card (Vega 20) and whatever comes after Volta will switch to PCIe 4.0 for the high end products.

This also makes me think that I should probably put on hold any upgrade plans and wait until motherboards and CPUs with support for PCI 4.0 reach consumer market, for better future proofing...

Not at all, really. You can run a 1080ti at 2.0 x16 and not notice a difference at all.

This only really benefits us the other way round: you can run GPU at x8 and save lanes for other stuff such as storage, which is a real thing these days for us lowly consumers.
 

eidairaman1

The Exiled Airman
Joined
Jul 2, 2007
Messages
42,346 (6.65/day)
Location
Republic of Texas (True Patriot)
System Name PCGOD
Processor AMD FX 8350@ 5.0GHz
Motherboard Asus TUF 990FX Sabertooth R2 2901 Bios
Cooling Scythe Ashura, 2Ă—BitFenix 230mm Spectre Pro LED (Blue,Green), 2x BitFenix 140mm Spectre Pro LED
Memory 16 GB Gskill Ripjaws X 2133 (2400 OC, 10-10-12-20-20, 1T, 1.65V)
Video Card(s) AMD Radeon 290 Sapphire Vapor-X
Storage Samsung 840 Pro 256GB, WD Velociraptor 1TB
Display(s) NEC Multisync LCD 1700V (Display Port Adapter)
Case AeroCool Xpredator Evil Blue Edition
Audio Device(s) Creative Labs Sound Blaster ZxR
Power Supply Seasonic 1250 XM2 Series (XP3)
Mouse Roccat Kone XTD
Keyboard Roccat Ryos MK Pro
Software Windows 7 Pro 64
Not at all, really. You can run a 1080ti at 2.0 x16 and not notice a difference at all.

This only really benefits us the other way round: you can run GPU at x8 and save lanes for other stuff such as storage, which is a real thing these days for us lowly consumers.


Considering it seems like it is mainly Network controllers and storage devices that need all the throughput because graphics cards just plain jane don't utilize it.
 
Joined
Apr 6, 2015
Messages
250 (0.07/day)
Location
Japan
System Name ChronicleScienceWorkStation
Processor AMD Threadripper 1950X
Motherboard Asrock X399 Taichi
Cooling Noctua U14S-TR4
Memory G.Skill DDR4 3200 C14 16GB*4
Video Card(s) AMD Radeon VII
Storage Samsung 970 Pro*1, Kingston A2000 1TB*2 RAID 0, HGST 8TB*5 RAID 6
Case Lian Li PC-A75X
Power Supply Corsair AX1600i
Software Proxmox 6.2
Considering it seems like it is mainly Network controllers and storage devices that need all the throughput because graphics cards just plain jane don't utilize it.
This is absolutely incorrect.

The motivation behind the sudden improvement of PCI-E (3.0 was back in 2010) is exactly GPU, but not the gaming part; it is the compute part which finds the PCI-E interface a horrible bottleneck.
PS: I am writing GPU code to accelerate data capture, the PCI-E speed is basically what determines the shortest processing time.
 

eidairaman1

The Exiled Airman
Joined
Jul 2, 2007
Messages
42,346 (6.65/day)
Location
Republic of Texas (True Patriot)
System Name PCGOD
Processor AMD FX 8350@ 5.0GHz
Motherboard Asus TUF 990FX Sabertooth R2 2901 Bios
Cooling Scythe Ashura, 2Ă—BitFenix 230mm Spectre Pro LED (Blue,Green), 2x BitFenix 140mm Spectre Pro LED
Memory 16 GB Gskill Ripjaws X 2133 (2400 OC, 10-10-12-20-20, 1T, 1.65V)
Video Card(s) AMD Radeon 290 Sapphire Vapor-X
Storage Samsung 840 Pro 256GB, WD Velociraptor 1TB
Display(s) NEC Multisync LCD 1700V (Display Port Adapter)
Case AeroCool Xpredator Evil Blue Edition
Audio Device(s) Creative Labs Sound Blaster ZxR
Power Supply Seasonic 1250 XM2 Series (XP3)
Mouse Roccat Kone XTD
Keyboard Roccat Ryos MK Pro
Software Windows 7 Pro 64
PS: I am writing GPU code to accelerate data capture, the PCI-E speed is basically what determines the shortest processing time.

You may want to prove it at this point...
 
Joined
Feb 1, 2017
Messages
39 (0.01/day)
I thought pcie 4 was relegated to servers and 5.0 would be the next major version for desktops...
That makes sense since this article is talking about a GPU that is intended for the data center market and not the consumer space. Of course, that doesn't stop anyone from fantasizing about the implications that this would theoretically have on the gaming market.
 
Joined
Nov 25, 2012
Messages
247 (0.06/day)
So are the cards going to be any good or just a marking managers wet dream?
As the numbers are bigger than the competition.
In the real price performanec and heat are pretty average.
 
Joined
Apr 6, 2015
Messages
250 (0.07/day)
Location
Japan
System Name ChronicleScienceWorkStation
Processor AMD Threadripper 1950X
Motherboard Asrock X399 Taichi
Cooling Noctua U14S-TR4
Memory G.Skill DDR4 3200 C14 16GB*4
Video Card(s) AMD Radeon VII
Storage Samsung 970 Pro*1, Kingston A2000 1TB*2 RAID 0, HGST 8TB*5 RAID 6
Case Lian Li PC-A75X
Power Supply Corsair AX1600i
Software Proxmox 6.2
You may want to prove it at this point...
Just do the math.

How many TFLOPs can even a cheap card handle today, how much bandwidth those cards have, and finally how many GB/s can PCI-E 3.0 pass.
This bandwidth limit does not kill everyone as one could do a lot of operations in GPU, plus for various reason performance does not just scale with TFLOPs, however, in cases like digital down-conversion where vectors are multiplied element-wise, the transfer is the limitation.
 
Last edited:
Joined
Apr 29, 2014
Messages
4,291 (1.11/day)
Location
Texas
System Name SnowFire / The Reinforcer
Processor i7 10700K 5.1ghz (24/7) / 2x Xeon E52650v2
Motherboard Asus Strix Z490 / Dell Dual Socket (R720)
Cooling RX 360mm + 140mm Custom Loop / Dell Stock
Memory Corsair RGB 16gb DDR4 3000 CL 16 / DDR3 128gb 16 x 8gb
Video Card(s) GTX Titan XP (2025mhz) / Asus GTX 950 (No Power Connector)
Storage Samsung 970 1tb NVME and 2tb HDD x4 RAID 5 / 300gb x8 RAID 5
Display(s) Acer XG270HU, Samsung G7 Odyssey (1440p 240hz)
Case Thermaltake Cube / Dell Poweredge R720 Rack Mount Case
Audio Device(s) Realtec ALC1150 (On board)
Power Supply Rosewill Lightning 1300Watt / Dell Stock 750 / Brick
Mouse Logitech G5
Keyboard Logitech G19S
Software Windows 11 Pro / Windows Server 2016
Well would be interesting but still pointless until we get actual desktop support. However, I say we focus on getting something out that can actually best the GTX 1080ti by a decent margin before innovating more. We are still waiting on some competition so new cards come out...
 
Joined
Mar 15, 2018
Messages
34 (0.01/day)
Well would be interesting but still pointless until we get actual desktop support. However, I say we focus on getting something out that can actually best the GTX 1080ti by a decent margin before innovating more. We are still waiting on some competition so new cards come out...

is your mind only gaming ?? pointless ?? there is IBM Power9 that support PCIE 4.0

And vega 7Nm with 32 GB HBM2 is Designing ONLY for Datacenter and HPC,,, AMD making GPU For datacenter not dekstop,, where your point ??
 
Joined
Apr 6, 2015
Messages
250 (0.07/day)
Location
Japan
System Name ChronicleScienceWorkStation
Processor AMD Threadripper 1950X
Motherboard Asrock X399 Taichi
Cooling Noctua U14S-TR4
Memory G.Skill DDR4 3200 C14 16GB*4
Video Card(s) AMD Radeon VII
Storage Samsung 970 Pro*1, Kingston A2000 1TB*2 RAID 0, HGST 8TB*5 RAID 6
Case Lian Li PC-A75X
Power Supply Corsair AX1600i
Software Proxmox 6.2
is your mind only gaming ?? pointless ?? there is IBM Power9 that support PCIE 4.0

And vega 7Nm with 32 GB HBM2 is Designing ONLY for Datacenter and HPC,,, AMD making GPU For datacenter not dekstop,, where your point ??
That's a mindset people have to renew, AI/DSP and a lot more new applications of GPU will make up the majority of the GPU sales in the future, the crytomining said it all.
 

bug

Joined
May 22, 2015
Messages
13,794 (3.96/day)
Processor Intel i5-12600k
Motherboard Asus H670 TUF
Cooling Arctic Freezer 34
Memory 2x16GB DDR4 3600 G.Skill Ripjaws V
Video Card(s) EVGA GTX 1060 SC
Storage 500GB Samsung 970 EVO, 500GB Samsung 850 EVO, 1TB Crucial MX300 and 2TB Crucial MX500
Display(s) Dell U3219Q + HP ZR24w
Case Raijintek Thetis
Audio Device(s) Audioquest Dragonfly Red :D
Power Supply Seasonic 620W M12
Mouse Logitech G502 Proteus Core
Keyboard G.Skill KM780R
Software Arch Linux + Win10
That's a mindset people have to renew, AI/DSP and a lot more new applications of GPU will make up the majority of the GPU sales in the future, the crytomining said it all.
Actually, I think the sane thing to do would to have a clear delimitation between GPUs (usually associated with desktops) and stuff that's used for compute (rarely associated with desktops). But since SKUs are very closely related, that's not going to happen anytime soon.
In the meantime, I think we should cut some slack for those that see GPU and don't automatically think mining, AI and whatnot. In exchange, they (myself included) should be more careful choosing their words.
 
Joined
Apr 29, 2014
Messages
4,291 (1.11/day)
Location
Texas
System Name SnowFire / The Reinforcer
Processor i7 10700K 5.1ghz (24/7) / 2x Xeon E52650v2
Motherboard Asus Strix Z490 / Dell Dual Socket (R720)
Cooling RX 360mm + 140mm Custom Loop / Dell Stock
Memory Corsair RGB 16gb DDR4 3000 CL 16 / DDR3 128gb 16 x 8gb
Video Card(s) GTX Titan XP (2025mhz) / Asus GTX 950 (No Power Connector)
Storage Samsung 970 1tb NVME and 2tb HDD x4 RAID 5 / 300gb x8 RAID 5
Display(s) Acer XG270HU, Samsung G7 Odyssey (1440p 240hz)
Case Thermaltake Cube / Dell Poweredge R720 Rack Mount Case
Audio Device(s) Realtec ALC1150 (On board)
Power Supply Rosewill Lightning 1300Watt / Dell Stock 750 / Brick
Mouse Logitech G5
Keyboard Logitech G19S
Software Windows 11 Pro / Windows Server 2016
is your mind only gaming ?? pointless ?? there is IBM Power9 that support PCIE 4.0

And vega 7Nm with 32 GB HBM2 is Designing ONLY for Datacenter and HPC,,, AMD making GPU For datacenter not dekstop,, where your point ??
No, my mind is on whats available and what we can do with it. Yes most of my reference was to the mainstream but either way there is not even much professional support for PCIE 4.0. Congrats on the googling to find one of the few things that can support it. Its on the label, and far to early to celebrate support for something that's hard to find support for at the time. If that changes then it will become something to look forward to.

Actually, I think the sane thing to do would to have a clear delimitation between GPUs (usually associated with desktops) and stuff that's used for compute (rarely associated with desktops). But since SKUs are very closely related, that's not going to happen anytime soon.
In the meantime, I think we should cut some slack for those that see GPU and don't automatically think mining, AI and whatnot. In exchange, they (myself included) should be more careful choosing their words.
The problem is more along the lines of what can even support it and when. Having a card even for professional use is nice with the feature but right now there ain't much that can do PCIE 4.0.
 

bug

Joined
May 22, 2015
Messages
13,794 (3.96/day)
Processor Intel i5-12600k
Motherboard Asus H670 TUF
Cooling Arctic Freezer 34
Memory 2x16GB DDR4 3600 G.Skill Ripjaws V
Video Card(s) EVGA GTX 1060 SC
Storage 500GB Samsung 970 EVO, 500GB Samsung 850 EVO, 1TB Crucial MX300 and 2TB Crucial MX500
Display(s) Dell U3219Q + HP ZR24w
Case Raijintek Thetis
Audio Device(s) Audioquest Dragonfly Red :D
Power Supply Seasonic 620W M12
Mouse Logitech G502 Proteus Core
Keyboard G.Skill KM780R
Software Arch Linux + Win10
The problem is more along the lines of what can even support it and when. Having a card even for professional use is nice with the feature but right now there ain't much that can do PCIE 4.0.
It's been pointed above: only server chips for now. But I wouldn't make a fuss about, it's the chicken and egg problem we get with every new generation. Gotta start somewhere with support ;)
 
Joined
Apr 12, 2013
Messages
7,545 (1.77/day)
This is absolutely incorrect.

The motivation behind the sudden improvement of PCI-E (3.0 was back in 2010) is exactly GPU, but not the gaming part; it is the compute part which finds the PCI-E interface a horrible bottleneck.
PS: I am writing GPU code to accelerate data capture, the PCI-E speed is basically what determines the shortest processing time.
That's why we have Nvlink, IF & IIRC Intel's also working on something.
Can you tell us how much data is transferred between the CPU/GPU through PCIe, by certain applications in your line of work? I'm not talking about theoretical limits, but actual observed data transfers.
 
Joined
Apr 6, 2015
Messages
250 (0.07/day)
Location
Japan
System Name ChronicleScienceWorkStation
Processor AMD Threadripper 1950X
Motherboard Asrock X399 Taichi
Cooling Noctua U14S-TR4
Memory G.Skill DDR4 3200 C14 16GB*4
Video Card(s) AMD Radeon VII
Storage Samsung 970 Pro*1, Kingston A2000 1TB*2 RAID 0, HGST 8TB*5 RAID 6
Case Lian Li PC-A75X
Power Supply Corsair AX1600i
Software Proxmox 6.2
That's why we have Nvlink, IF & IIRC Intel's also working on something.
Can you tell us how much data is transferred between the CPU/GPU through PCIe, by certain applications in your line of work? I'm not talking about theoretical limits, but actual observed data transfers.
I am currently working on a digitizer, it is moving ~ 800 MB channel at a sampling frequency of 200 MHz.
Each input will create two output, so the output size will be 1600 MB.
I am still exploring and optimizing it, the best I got is around a few times above of the theoretical limit; I believe there are some overhead.
We want to utilize all 4 channels, and possibly boost the sampling rate to 400 MHz (or higher), so it will be up to 8 times longer than that. As the operations are relatively simple, the operations are mostly bandwidth limited in our case.
The above is the best case scenario, otherwise we could also be limited by sharing of the PCI-E lanes and stuff alike. The time seems short, but comparing to the computation the transfer dominates things here.

A possible solution is to see if the transfer time (as well as small VRAM size) can be solved by using APU (Raven Ridge) where the RAM is shared between the host and the on-die GPU. If this works, that would be a low cost and efficient processor for our application. Also, by using lower precision (half) and doing decimation in the GPU might also help solve the problem.
 

bug

Joined
May 22, 2015
Messages
13,794 (3.96/day)
Processor Intel i5-12600k
Motherboard Asus H670 TUF
Cooling Arctic Freezer 34
Memory 2x16GB DDR4 3600 G.Skill Ripjaws V
Video Card(s) EVGA GTX 1060 SC
Storage 500GB Samsung 970 EVO, 500GB Samsung 850 EVO, 1TB Crucial MX300 and 2TB Crucial MX500
Display(s) Dell U3219Q + HP ZR24w
Case Raijintek Thetis
Audio Device(s) Audioquest Dragonfly Red :D
Power Supply Seasonic 620W M12
Mouse Logitech G502 Proteus Core
Keyboard G.Skill KM780R
Software Arch Linux + Win10
I am currently working on a digitizer, it is moving ~ 800 MB channel at a sampling frequency of 200 MHz.
Each input will create two output, so the output size will be 1600 MB.
I am still exploring and optimizing it, the best I got is around a few times above of the theoretical limit; I believe there are some overhead.
We want to utilize all 4 channels, and possibly boost the sampling rate to 400 MHz (or higher), so it will be up to 8 times longer than that. As the operations are relatively simple, the operations are mostly bandwidth limited in our case.
The above is the best case scenario, otherwise we could also be limited by sharing of the PCI-E lanes and stuff alike. The time seems short, but comparing to the computation the transfer dominates things here.

A possible solution is to see if the transfer time (as well as small VRAM size) can be solved by using APU (Raven Ridge) where the RAM is shared between the host and the on-die GPU. If this works, that would be a low cost and efficient processor for our application. Also, by using lower precision (half) and doing decimation in the GPU might also help solve the problem.
Can you explain a little?
 
Top