• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

AMD Corporate Fellow Phil Rogers Jumps Ship to NVIDIA

Joined
Sep 1, 2015
Messages
152 (0.04/day)
Hehehe...

8.2 TFLOPs double-precision from 100w is pretty phenomenal. Xeon Phi draws a lot more power for only 1 TFLOP double-precision. Fury X is 8.6 single-precision weighing in at, what, 275w? Granted, these numbers will naturally improve with the move to 14-16nm.
It's like reading about car running on nuclear energy.
 
Joined
Aug 30, 2015
Messages
166 (0.05/day)
Location
Copenhagen, Denmark
System Name Royal Fortune (Main)/Adventure Galley (NAS)/Little Ranger (HTPC)
Processor Intel i5 4460/AMD C-70/Intel Pentium G3258 Anniversary Ed.
Motherboard Gigabyte ga-z97x-gaming 5/Asrock C-70M1/Asrock Z97 Anniversary
Cooling Phanteks PH-TC12DX/Stock/Raijintek Triton Core
Memory 8GB Team Group Dark 1600 CL9/8GB Team Group Elite 1600 CL9/8GB Avexir Core 1600
Video Card(s) VTX3D R9 280X 3GB/APU/Palit GTX 750 TI StormX Duo
Storage 120GB Team Group Ultra L5 SSD + 1TB WD Black/4 X 2TB WD Blue/120 GB Kingston V300
Display(s) Dell 2310/AOC e2070Swn 19.5"/TV
Case In Win 707/Bitfenix Prodigy M/Dimastech Easy V3
Audio Device(s) N/A
Power Supply EVGA Supernova GS 650W/be quiet! System Power 7 350W/Xigmatek Maverick 400W
Mouse Logitech G303 Daedalus Apex/Razer Abyssus/-
Keyboard Corsair K70 Red/Steelseries Apex Raw/Logitech K400
Software Win10/FreeNAS 9.3/KodiBuntu
I love these threads, they bring out all the strange fanboys...It's like watching pro wrestling, but with a lot less brains.

AMD is in trouble, no doubt about that, but they've been there before. (they've pulled a rabbit out the hat before)
Nvidia has got the better product, at the moment, but that does not make them infallible. (they've screwed up before)

See, this what a non-fanboy reads from the present situation...but please do keep up the red vs green fight, it's more fun than watching kindergarteners fight :D
 
Joined
Apr 30, 2012
Messages
3,881 (0.84/day)
Hehehe...

8.2 TFLOPs double-precision from 100w is pretty phenomenal. Xeon Phi draws a lot more power for only 1 TFLOP double-precision. Fury X is 8.6 single-precision weighing in at, what, 275w? Granted, these numbers will naturally improve with the move to 14-16nm.

20watt ceiling more then their current chip PEZY-SC with x3 the cores.

Logic Cores(PE) 1,024
Core Frequency 733MHz
Peak Performance Floating Point  Single 3.0TFlops / Double 1.5TFlops
Host Interface PCI Express GEN3.0 x8Lane x 4Port (x16 bifurcation available)
JESD204B Protocol support
DRAM Interface
DDR4, DDR3 combo 64bit x 8Port Max B/W 1533.6GB/s
+Ultra WIDE IO SDRAM (2,048bit) x 2Port Max B/W 102.4GB/s

They announced it in Feb 2015 for a release date in 2016.

They also have plans for a PEZY-SC3 & 4

PEZY-SC3
8192 core in 10nm technology 2018

PEZY-SC4
16384 core in 7nm technology 2020
 
Joined
Sep 6, 2013
Messages
3,391 (0.82/day)
Location
Athens, Greece
System Name 3 desktop systems: Gaming / Internet / HTPC
Processor Ryzen 5 7600 / Ryzen 5 4600G / Ryzen 5 5500
Motherboard X670E Gaming Plus WiFi / MSI X470 Gaming Plus Max (1) / MSI X470 Gaming Plus Max (2)
Cooling Aigo ICE 400SE / Segotep T4 / Νoctua U12S
Memory Kingston FURY Beast 32GB DDR5 6000 / 16GB JUHOR / 32GB G.Skill RIPJAWS 3600 + Aegis 3200
Video Card(s) ASRock RX 6600 + GT 710 (PhysX) / Vega 7 integrated / Radeon RX 580
Storage NVMes, ONLY NVMes / NVMes, SATA Storage / NVMe, SATA, external storage
Display(s) Philips 43PUS8857/12 UHD TV (120Hz, HDR, FreeSync Premium) / 19'' HP monitor + BlitzWolf BW-V5
Case Sharkoon Rebel 12 / CoolerMaster Elite 361 / Xigmatek Midguard
Audio Device(s) onboard
Power Supply Chieftec 850W / Silver Power 400W / Sharkoon 650W
Mouse CoolerMaster Devastator III Plus / CoolerMaster Devastator / Logitech
Keyboard CoolerMaster Devastator III Plus / CoolerMaster Devastator / Logitech
Software Windows 10 / Windows 10&Windows 11 / Windows 10
I love these threads, they bring out all the strange fanboys...It's like watching pro wrestling, but with a lot less brains.

AMD is in trouble, no doubt about that, but they've been there before. (they've pulled a rabbit out the hat before)
Nvidia has got the better product, at the moment, but that does not make them infallible. (they've screwed up before)

See, this what a non-fanboy reads from the present situation...but please do keep up the red vs green fight, it's more fun than watching kindergarteners fight :D
You are also a fanboy, a fanboy of yourself :p
 
Joined
Sep 7, 2011
Messages
2,785 (0.57/day)
Location
New Zealand
System Name MoneySink
Processor 2600K @ 4.8
Motherboard P8Z77-V
Cooling AC NexXxos XT45 360, RayStorm, D5T+XSPC tank, Tygon R-3603, Bitspower
Memory 16GB Crucial Ballistix DDR3-1600C8
Video Card(s) GTX 780 SLI (EVGA SC ACX + Giga GHz Ed.)
Storage Kingston HyperX SSD (128) OS, WD RE4 (1TB), RE2 (1TB), Cav. Black (2 x 500GB), Red (4TB)
Display(s) Achieva Shimian QH270-IPSMS (2560x1440) S-IPS
Case NZXT Switch 810
Audio Device(s) onboard Realtek yawn edition
Power Supply Seasonic X-1050
Software Win8.1 Pro
Benchmark Scores 3.5 litres of Pale Ale in 18 minutes.
8.2 TFLOPs double-precision from 100w is pretty phenomenal. Xeon Phi draws a lot more power for only 1 TFLOP double-precision. Fury X is 8.6 single-precision weighing in at, what, 275w? Granted, these numbers will naturally improve with the move to 14-16nm.
PEZY-SC isn't really comparable to Fury X or any 3D consumer graphics card. As I've previously noted, PEZY lacks a 3D graphics pipeline ( no rasterization, tessellation, geometry, hull, pixel shading etc.). Stripping out 3D functionality allows for a compute heavy - and shorter pipeline. The GK 210 evolution of GK 110 very likely points to Nvidia bifurcating their future GPU tech - one line pursuing 3D consumer/workstation graphics, one line devoted to math co-processing. As for Xeon Phi, you're still looking at x86 cores which are pretty damn big and power hungry in comparison to simple shader module blocks, MIPS cores, and ARM.
 

FordGT90Concept

"I go fast!1!11!1!"
Joined
Oct 13, 2008
Messages
26,259 (4.44/day)
Location
IA, USA
System Name BY-2021
Processor AMD Ryzen 7 5800X (65w eco profile)
Motherboard MSI B550 Gaming Plus
Cooling Scythe Mugen (rev 5)
Memory 2 x Kingston HyperX DDR4-3200 32 GiB
Video Card(s) AMD Radeon RX 7900 XT
Storage Samsung 980 Pro, Seagate Exos X20 TB 7200 RPM
Display(s) Nixeus NX-EDG274K (3840x2160@144 DP) + Samsung SyncMaster 906BW (1440x900@60 HDMI-DVI)
Case Coolermaster HAF 932 w/ USB 3.0 5.25" bay + USB 3.2 (A+C) 3.5" bay
Audio Device(s) Realtek ALC1150, Micca OriGen+
Power Supply Enermax Platimax 850w
Mouse Nixeus REVEL-X
Keyboard Tesoro Excalibur
Software Windows 10 Home 64-bit
Benchmark Scores Faster than the tortoise; slower than the hare.
Yeah, which is why I mentioned Xeon Phi which is a lot similar. It's performance is much lower though. Intel might have to go back to the drawing board and trim the fat from x86 to make it competitive with ARM. x86 has always really neglected FLOPs and focused on specialized instructions.
 
Joined
Sep 7, 2011
Messages
2,785 (0.57/day)
Location
New Zealand
System Name MoneySink
Processor 2600K @ 4.8
Motherboard P8Z77-V
Cooling AC NexXxos XT45 360, RayStorm, D5T+XSPC tank, Tygon R-3603, Bitspower
Memory 16GB Crucial Ballistix DDR3-1600C8
Video Card(s) GTX 780 SLI (EVGA SC ACX + Giga GHz Ed.)
Storage Kingston HyperX SSD (128) OS, WD RE4 (1TB), RE2 (1TB), Cav. Black (2 x 500GB), Red (4TB)
Display(s) Achieva Shimian QH270-IPSMS (2560x1440) S-IPS
Case NZXT Switch 810
Audio Device(s) onboard Realtek yawn edition
Power Supply Seasonic X-1050
Software Win8.1 Pro
Benchmark Scores 3.5 litres of Pale Ale in 18 minutes.
Yeah, which is why I mentioned Xeon Phi which is a lot similar. It's performance is much lower though. Intel might have to go back to the drawing board and trim the fat from x86 to make it competitive with ARM. x86 has always really neglected FLOPs and focused on specialized instructions.
Yep, that x86 overhead tax is a bitch. Still hard to see Intel deviating too far from their well beaten track even with programmers complaining about complexities in regard to Xeon Phi's coding in relation to other GPGPU ecosystems.
 
Joined
Aug 30, 2015
Messages
166 (0.05/day)
Location
Copenhagen, Denmark
System Name Royal Fortune (Main)/Adventure Galley (NAS)/Little Ranger (HTPC)
Processor Intel i5 4460/AMD C-70/Intel Pentium G3258 Anniversary Ed.
Motherboard Gigabyte ga-z97x-gaming 5/Asrock C-70M1/Asrock Z97 Anniversary
Cooling Phanteks PH-TC12DX/Stock/Raijintek Triton Core
Memory 8GB Team Group Dark 1600 CL9/8GB Team Group Elite 1600 CL9/8GB Avexir Core 1600
Video Card(s) VTX3D R9 280X 3GB/APU/Palit GTX 750 TI StormX Duo
Storage 120GB Team Group Ultra L5 SSD + 1TB WD Black/4 X 2TB WD Blue/120 GB Kingston V300
Display(s) Dell 2310/AOC e2070Swn 19.5"/TV
Case In Win 707/Bitfenix Prodigy M/Dimastech Easy V3
Audio Device(s) N/A
Power Supply EVGA Supernova GS 650W/be quiet! System Power 7 350W/Xigmatek Maverick 400W
Mouse Logitech G303 Daedalus Apex/Razer Abyssus/-
Keyboard Corsair K70 Red/Steelseries Apex Raw/Logitech K400
Software Win10/FreeNAS 9.3/KodiBuntu
You are also a fanboy, a fanboy of yourself :p
I'm just trying to add a bit of sanity to the rabid barking of crazed red team/green team jingoism, if you felt it was directed at you...well, it's no fault of mine, is it?

(most posts in these threads are factual and on point, I'm only aiming at the guys who thinks red vs green is more important than any other aspect of our hobby/living)
 
Joined
Sep 15, 2011
Messages
6,761 (1.39/day)
Processor Intel® Core™ i7-13700K
Motherboard Gigabyte Z790 Aorus Elite AX
Cooling Noctua NH-D15
Memory 32GB(2x16) DDR5@6600MHz G-Skill Trident Z5
Video Card(s) ZOTAC GAMING GeForce RTX 3080 AMP Holo
Storage 2TB SK Platinum P41 SSD + 4TB SanDisk Ultra SSD + 500GB Samsung 840 EVO SSD
Display(s) Acer Predator X34 3440x1440@100Hz G-Sync
Case NZXT PHANTOM410-BK
Audio Device(s) Creative X-Fi Titanium PCIe
Power Supply Corsair 850W
Mouse Logitech Hero G502 SE
Software Windows 11 Pro - 64bit
Benchmark Scores 30FPS in NFS:Rivals
The rats are leaving the sinking ship??
 
Joined
Apr 3, 2012
Messages
4,373 (0.94/day)
Location
St. Paul, MN
System Name Bay2- Lowerbay/ HP 3770/T3500-2+T3500-3+T3500-4/ Opti-Con/Orange/White/Grey
Processor i3 2120's/ i7 3770/ x5670's/ i5 2400/Ryzen 2700/Ryzen 2700/R7 3700x
Motherboard HP UltraSlim's/ HP mid size/ Dell T3500 workstation's/ Dell 390/B450 AorusM/B450 AorusM/B550 AorusM
Cooling All stock coolers/Grey has an H-60
Memory 2GB/ 4GB/ 12 GB 3 chan/ 4GB sammy/T-Force 16GB 3200/XPG 16GB 3000/Ballistic 3600 16GB
Video Card(s) HD2000's/ HD 2000/ 1 MSI GT710,2x MSI R7 240's/ HD4000/ Red Dragon 580/Sapphire 580/Sapphire 580
Storage ?HDD's/ 500 GB-er's/ 500 GB/2.5 Samsung 500GB HDD+WD Black 1TB/ WD Black 500GB M.2/Corsair MP600 M.2
Display(s) 1920x1080/ ViewSonic VX24568 between the rest/1080p TV-Grey
Case HP 8200 UltraSlim's/ HP 8200 mid tower/Dell T3500's/ Dell 390/SilverStone Kublai KL06/NZXT H510 W x2
Audio Device(s) Sonic Master/ onboard's/ Beeper's!
Power Supply 19.5 volt bricks/ Dell PSU/ 525W sumptin/ same/Seasonic 750 80+Gold/EVGA 500 80+/Antec 650 80+Gold
Mouse cheap GigaWire930, CMStorm Havoc + Logitech M510 wireless/iGear usb x2/MX 900 wireless kit 4 Grey
Keyboard Dynex, 2 no name, SYX and a Logitech. All full sized and USB. MX900 kit for Grey
Software Mint 18 Sylvia/ Opti-Con Mint KDE/ T3500's on Kubuntu/HP 3770 is Win 10/Win 10 Pro/Win 10 Pro/Win10
Benchmark Scores World Community Grid is my benchmark!!
Joined
Jul 31, 2014
Messages
481 (0.13/day)
System Name Diablo | Baal | Mephisto | Andariel
Processor i5-3570K@4.4GHz | 2x Xeon X5675 | i7-4710MQ | i7-2640M
Motherboard Asus Sabertooth Z77 | HP DL380 G6 | Dell Precision M4800 | Lenovo Thinkpad X220 Tablet
Cooling Swiftech H220-X | Chassis cooled (6 fans + HS) | dual-fanned heatpipes | small-fanned heatpipe
Memory 32GiB DDR3-1600 CL9 | 96GiB DDR3-1333 ECC RDIMM | 32GiB DDR3L-1866 CL11 | 8GiB DDR3L-1600 CL11
Video Card(s) Dual GTX 670 in SLI | Embedded ATi ES1000 | Quadro K2100M | Intel HD 3000
Storage many, many SSDs and HDDs....
Display(s) 1 Dell U3011 + 2x Dell U2410 | HP iLO2 KVMoIP | 3200x1800 Sharp IGZO | 1366x768 IPS with Wacom pen
Case Corsair Obsidian 550D | HP DL380 G6 Chassis | Dell Precision M4800 | Lenovo Thinkpad X220 Tablet
Audio Device(s) Auzentech X-Fi HomeTheater HD | None | On-board | On-board
Power Supply Corsair AX850 | Dual 750W Redundant PSU (Delta) | Dell 330W+240W (Flextronics) | Lenovo 65W (Delta)
Mouse Logitech G502, Logitech G700s, Logitech G500, Dell optical mouse (emergency backup)
Keyboard 1985 IBM Model F 122-key, Ducky YOTT MX Black, Dell AT101W, 1994 IBM Model M, various integrated
Software FAAAR too much to list
Pascal is coming out and its more in-line with what HSA is working towards. Nvidia wants to see if it can benefit from that going forward.

Nvidia doesn't have a X86 so it will have to fight an ARMs race. Qualcomm just announced its intentions and PEZY-SC2 is coming.

Main specifications are as follows of "PEZY-SC2" being planned at the moment.
  • 製造プロセス:14-16nm FinFET Manufacturing process: 14-16nm FinFET
  • ダイサイズ:400-500mm2 Die size: 400-500mm2
  • 動作周波数:1.0GHz Operating frequency: 1.0GHz
  • 搭載独自コア数:4,096 Equipped with its own core number: 4,096
  • 演算性能:8.2TFLOPS(倍精度)/16.4TFLOPS(単精度) Computing performance: 8.2TFLOPS (double precision) /16.4TFLOPS (single precision)
  • 内臓CPU:デバッグ・管理用に加えて、新たに汎用演算用にも利用 Visceral CPU: In addition to for debugging and management, newly utilized for general-purpose computing
  • メモリインターフェース1:500GB/s(独自)* 8 ch(パッケージ内接続) Memory interface 1: 500GB / s (own) * 8 ch (package connection)
  • メモリインターフェース2:HMCまたはHBM(ch数は未定) Memory interface 2: HMC or HBM (ch number of undecided)
  • 外部インターフェース:PCIe Gen3/4 x8 * 6 Port External interface: PCIe Gen3 / 4 x8 * 6 Port
  • 消費電力:100W(プロセッサ単体でパッケージ内積層DRAM等を含まず) Power consumption: 100W (not including the package within the stacked DRAM or the like in a single processor)
The pie is shrinking for everyone.

As interesting as PEXY-SC2 is, to me it looks much more like a Xeon Phi/GPGPU competitor than a general-purpose CPU by the sheer number of cores and low clock speed. Probably also very nice for server loads as well, since those tend to be quite parallelized.

nVidia could go back out and sue Intel for x86 rights. Their original plan was for a dual-ISA architecture with Denver, which lead to them buying out Transmeta back in the day pretty much solely for the x86 license. They got close to going to court against Intel years ago, but then Intel paid them A Lot of Money to not ship the x86 ISA enabled. If AMD folded, the FTC would probably push, because let's face it, to really compete with Intel at desktop level, there's basically only nVidia and IBM (if they cut up their POWER8 cores down to 4core and 2core modules) - most the ARM vendors (Qcomm, Samsung, Apple) simply can't get single-core performance to match Sandy Bridge yet, let alone Skylake.


Indeed. Why compete in a tough commodity market (micro-servers) when you can get much, much more lucrative in the automotive market?

Yeah, which is why I mentioned Xeon Phi which is a lot similar. It's performance is much lower though. Intel might have to go back to the drawing board and trim the fat from x86 to make it competitive with ARM. x86 has always really neglected FLOPs and focused on specialized instructions.

Yep, that x86 overhead tax is a bitch. Still hard to see Intel deviating too far from their well beaten track even with programmers complaining about complexities in regard to Xeon Phi's coding in relation to other GPGPU ecosystems.

x86 vs ARM vs POWER vs MIPS, RISC vs CISC vs VLIW/EPIC is all academic wankery these days thanks to how pretty much every modern high-performance core using more than about 5W is a superscalar, out-of-order core design: the instructions are decoded into internal microcode anyways, which makes the ISA essentially an irrelevant part of the equation. In effect, for all the fat that x86 has, so do high-performance ARM, POWER and MIPS. If anything, right now x86 is arguably the most scalable architecture to ever be built, ranging from milliwatt (Quark) to hundreds of watts (Xeon Phi).

On the Phi specifically, the current Knights Corner design is very competitive to GK110 and GK210, and Knights Landing looks to be competing head to head with Pascal, though I reserve final judgement for that for after both ship.
 
Joined
Sep 1, 2015
Messages
152 (0.04/day)
We all forgot that NVIDIA is one of the big five in Open Power foundation. So they have access to Power CPU's and core design. So in fact they have the CPU they need in their hands.
 
Joined
Sep 7, 2011
Messages
2,785 (0.57/day)
Location
New Zealand
System Name MoneySink
Processor 2600K @ 4.8
Motherboard P8Z77-V
Cooling AC NexXxos XT45 360, RayStorm, D5T+XSPC tank, Tygon R-3603, Bitspower
Memory 16GB Crucial Ballistix DDR3-1600C8
Video Card(s) GTX 780 SLI (EVGA SC ACX + Giga GHz Ed.)
Storage Kingston HyperX SSD (128) OS, WD RE4 (1TB), RE2 (1TB), Cav. Black (2 x 500GB), Red (4TB)
Display(s) Achieva Shimian QH270-IPSMS (2560x1440) S-IPS
Case NZXT Switch 810
Audio Device(s) onboard Realtek yawn edition
Power Supply Seasonic X-1050
Software Win8.1 Pro
Benchmark Scores 3.5 litres of Pale Ale in 18 minutes.
On the Phi specifically, the current Knights Corner design is very competitive to GK110 and GK210, and Knights Landing looks to be competing head to head with Pascal, though I reserve final judgement for that for after both ship.
HPC code is tuned (often hand tuned) for specific applications and workload, so the hardware often plays second fiddle to coding and wringing out the best practical performance - which is what my point was - the same point you quoted:
Yep, that x86 overhead tax is a bitch. Still hard to see Intel deviating too far from their well beaten track even with programmers complaining about complexities in regard to Xeon Phi's coding in relation to other GPGPU ecosystems.
If KNC is very competitive with GK110 ( and the results on standard benchmarks are mixed on that to say the least (#1) (#2)) within the same power envelope -even ringing in a 300W 7120P/X doesn't seem to appreciably swing things in KNC's favour) it makes you wonder why Intel gives away Xeon Phi to grab high profile contracts, and even MSRP for individuals generally doesn't reflect the actual pricing for the most part....and why it needs major input from Intel (cash and incentives) to get clients to use it.
 
Last edited:
Joined
Sep 7, 2011
Messages
598 (0.12/day)
Location
Pacific Rim
Processor Ryzen 3600
Motherboard B450
Cooling Scythe Ashura
Memory Team Dark Z 3200 8GB x2
Video Card(s) MSI 390
Storage WD 2TB + WD Green 640GB
Display(s) Samsung 40JU6600 @ 200% scaling
Case Coolermaster CM 690 II
Audio Device(s) Fiio E10K, Graham Slee Solo II SRG, Sennheiser HD6XX, AKG K7XX, ATH WS1100is
Power Supply Corsair HX650
Mouse Rival 700
Keyboard Corsair K70, Razer Tarantula
amd fanboys after read this new. :p


just copying from
http://www.techpowerup.com/forums/t...ery-to-predecessor.216541/page-2#post-3353355
xD

Seriously, it's probably about money. You work to get money, don't you?
 
Joined
Jul 9, 2015
Messages
3,413 (0.99/day)
System Name M3401 notebook
Processor 5600H
Motherboard NA
Memory 16GB
Video Card(s) 3050
Storage 500GB SSD
Display(s) 14" OLED screen of the laptop
Software Windows 10
Benchmark Scores 3050 scores good 15-20% lower than average, despite ASUS's claims that it has uber cooling.
That is a rad smile.

Sooner than later, we'll find out just how good Zen is.

Isn't it planned in second half of 2016?
 
Joined
Sep 7, 2011
Messages
2,785 (0.57/day)
Location
New Zealand
System Name MoneySink
Processor 2600K @ 4.8
Motherboard P8Z77-V
Cooling AC NexXxos XT45 360, RayStorm, D5T+XSPC tank, Tygon R-3603, Bitspower
Memory 16GB Crucial Ballistix DDR3-1600C8
Video Card(s) GTX 780 SLI (EVGA SC ACX + Giga GHz Ed.)
Storage Kingston HyperX SSD (128) OS, WD RE4 (1TB), RE2 (1TB), Cav. Black (2 x 500GB), Red (4TB)
Display(s) Achieva Shimian QH270-IPSMS (2560x1440) S-IPS
Case NZXT Switch 810
Audio Device(s) onboard Realtek yawn edition
Power Supply Seasonic X-1050
Software Win8.1 Pro
Benchmark Scores 3.5 litres of Pale Ale in 18 minutes.
Isn't it planned in second half of 2016?
Delayed. AMD confirmed as much with the Keller announcement. Zen sampling in 2016, shipping for revenue in 2017.
Jim’s departure is not expected to impact our public product or technology roadmaps, and we remain on track for “Zen” sampling in 2016 with first full year of revenue in 2017.
Pretty much everything processor related got put back (or cancelled in SkyBridge's case), as this roadmap from last year shows


Seattle has yet to really show up, and K12 is now also looking at a 2017 launch
 
Last edited:
Joined
Apr 19, 2011
Messages
2,198 (0.44/day)
Location
So. Cal.
We don't know, but by that picture a "smart follow" at his age (he's no spring chicken) should be looking to retire, not off undertaking a new endeavor.

I might think after 21 years and if he had AMD stock he might just not have the nest-egg to retire on. So a quick flip, that negotiates a healthy salary increase and stock options to hold-out for 4-5 more years (ca-ching). Nvidia gets a strong voice to work with the HSA Foundation and develop their program to move it up to the forefront within the organization.

While it appears to be a loss for AMD they are well entrenched on the HSA front, and at this point their work isn't gong to be impacted, unless he's able recruit more talent from AMD.

Honestly, I kind of feel sorry for him...
 
Joined
Jul 31, 2014
Messages
481 (0.13/day)
System Name Diablo | Baal | Mephisto | Andariel
Processor i5-3570K@4.4GHz | 2x Xeon X5675 | i7-4710MQ | i7-2640M
Motherboard Asus Sabertooth Z77 | HP DL380 G6 | Dell Precision M4800 | Lenovo Thinkpad X220 Tablet
Cooling Swiftech H220-X | Chassis cooled (6 fans + HS) | dual-fanned heatpipes | small-fanned heatpipe
Memory 32GiB DDR3-1600 CL9 | 96GiB DDR3-1333 ECC RDIMM | 32GiB DDR3L-1866 CL11 | 8GiB DDR3L-1600 CL11
Video Card(s) Dual GTX 670 in SLI | Embedded ATi ES1000 | Quadro K2100M | Intel HD 3000
Storage many, many SSDs and HDDs....
Display(s) 1 Dell U3011 + 2x Dell U2410 | HP iLO2 KVMoIP | 3200x1800 Sharp IGZO | 1366x768 IPS with Wacom pen
Case Corsair Obsidian 550D | HP DL380 G6 Chassis | Dell Precision M4800 | Lenovo Thinkpad X220 Tablet
Audio Device(s) Auzentech X-Fi HomeTheater HD | None | On-board | On-board
Power Supply Corsair AX850 | Dual 750W Redundant PSU (Delta) | Dell 330W+240W (Flextronics) | Lenovo 65W (Delta)
Mouse Logitech G502, Logitech G700s, Logitech G500, Dell optical mouse (emergency backup)
Keyboard 1985 IBM Model F 122-key, Ducky YOTT MX Black, Dell AT101W, 1994 IBM Model M, various integrated
Software FAAAR too much to list
HPC code is tuned (often hand tuned) for specific applications and workload, so the hardware often plays second fiddle to coding and wringing out the best practical performance - which is what my point was - the same point you quoted:

If KNC is very competitive with GK110 ( and the results on standard benchmarks are mixed on that to say the least (#1) (#2)) within the same power envelope -even ringing in a 300W 7120P/X doesn't seem to appreciably swing things in KNC's favour) it makes you wonder why Intel gives away Xeon Phi to grab high profile contracts, and even MSRP for individuals generally doesn't reflect the actual pricing for the most part....and why it needs major input from Intel (cash and incentives) to get clients to use it.

That's why I seperated my comments on HPC vs more "standard" use-cases. Either way, the "x86 fat" was what I was disputing, since all high-performance architectures have about the same design paradigms nowadays. Sure, x86 being CISC has a huge number of instructions, but realistically, only compiler writers need to care about this, and ARM and POWER have both added in more instructions over the years, while MIPS has exited the high-performance space pretty much completely. As for SPARC, you don't hear anything about SPARC either outside of mainframe (Fujitsu) and Oracle.

For KNC vs GK110, I was looking an Tianhe-2 vs TITAN over at top500 (which tests using LINPACK), where Tianhe-2 has about twice the performance (33.86 PFLOPS vs 17.59 PFLOPS), at the cost of thrice the chip count (48 000 phis vs 18 688 K20X) and twice the power draw (17.6MW vs 8.2MW). In HPC, space isn't generally a major concern, but cooling and power is, which is why in general FLOPS/W is the better metric to use. Of course, the way large number of Phi cores make it harder to get good performance out of, but this is HPC, it's hard enough at 18k. The real reason why nVidia still mostly owns the market is because a lot of HPC programmers are quite familiar with CUDA already, and/or have an existing CUDA-based codebase; and porting to OpenCL or x86 is a non-trivial exercise.
 
Joined
Apr 2, 2011
Messages
2,849 (0.57/day)
Good god, this is funny.

Green team says that AMD is going to go under next month. Two people leaving, as high up as these people, is a sign of the apocalypse for AMD.

Red team says it's just two people moving on. Zen will fix everything.




Both sides are wrong. Keller leaving was the end of a contractual obligation. His job was done, so he left. Rogers is a problem. He's leaving for a competitor, which is making inroads into new markets. At the same time as it is bad, it's not like AMD is losing everything with one person. Rogers is jumping to where the best money is, and that isn't really bad for a cash strapped AMD.

AMD isn't doing great. Zen is largely a make it or break it situation for them. If they pull the rabbit out of the hat, they can get back to a single company. If they fail there's going to have to be some sacrificial offering. Keller and Rogers aren't the bell weather for AMD, they're very small (if well known) cogs in a larger plan.

It is fun to see how insane the fan boys are on each side though. I'm waiting for the truly irrational ones to come forward and defend bulldozer. I wish I had some booze.
 
Joined
Sep 7, 2011
Messages
2,785 (0.57/day)
Location
New Zealand
System Name MoneySink
Processor 2600K @ 4.8
Motherboard P8Z77-V
Cooling AC NexXxos XT45 360, RayStorm, D5T+XSPC tank, Tygon R-3603, Bitspower
Memory 16GB Crucial Ballistix DDR3-1600C8
Video Card(s) GTX 780 SLI (EVGA SC ACX + Giga GHz Ed.)
Storage Kingston HyperX SSD (128) OS, WD RE4 (1TB), RE2 (1TB), Cav. Black (2 x 500GB), Red (4TB)
Display(s) Achieva Shimian QH270-IPSMS (2560x1440) S-IPS
Case NZXT Switch 810
Audio Device(s) onboard Realtek yawn edition
Power Supply Seasonic X-1050
Software Win8.1 Pro
Benchmark Scores 3.5 litres of Pale Ale in 18 minutes.
For KNC vs GK110, I was looking an Tianhe-2 vs TITAN over at top500 (which tests using LINPACK), where Tianhe-2 has about twice the performance (33.86 PFLOPS vs 17.59 PFLOPS), at the cost of thrice the chip count (48 000 phis vs 18 688 K20X) and twice the power draw (17.6MW vs 8.2MW). In HPC, space isn't generally a major concern, but cooling and power is, which is why in general FLOPS/W is the better metric to use.
The comparison is flawed. Tianhe-2 uses 32000 Intel Xeon E5-2692 in a 2:3 ratio with Xeon Phi. Titan uses Opteron 6274's in a 1:1 ratio with Tesla K20X's. Not only do GPGPUs offer better FLOPs and FLOPs/watt than CPUs, the Xeon's in Tianhe-2 themselves offer greater actual FLOPS per core/processor (and FLOPS/watt even though both the E5-2692 and Opteron 6274 are nominally rated at 115W TDP)

AMD Q3 Earnings....not great, but at least they get a short-term cash injection.
 
Last edited:
Joined
Jul 31, 2014
Messages
481 (0.13/day)
System Name Diablo | Baal | Mephisto | Andariel
Processor i5-3570K@4.4GHz | 2x Xeon X5675 | i7-4710MQ | i7-2640M
Motherboard Asus Sabertooth Z77 | HP DL380 G6 | Dell Precision M4800 | Lenovo Thinkpad X220 Tablet
Cooling Swiftech H220-X | Chassis cooled (6 fans + HS) | dual-fanned heatpipes | small-fanned heatpipe
Memory 32GiB DDR3-1600 CL9 | 96GiB DDR3-1333 ECC RDIMM | 32GiB DDR3L-1866 CL11 | 8GiB DDR3L-1600 CL11
Video Card(s) Dual GTX 670 in SLI | Embedded ATi ES1000 | Quadro K2100M | Intel HD 3000
Storage many, many SSDs and HDDs....
Display(s) 1 Dell U3011 + 2x Dell U2410 | HP iLO2 KVMoIP | 3200x1800 Sharp IGZO | 1366x768 IPS with Wacom pen
Case Corsair Obsidian 550D | HP DL380 G6 Chassis | Dell Precision M4800 | Lenovo Thinkpad X220 Tablet
Audio Device(s) Auzentech X-Fi HomeTheater HD | None | On-board | On-board
Power Supply Corsair AX850 | Dual 750W Redundant PSU (Delta) | Dell 330W+240W (Flextronics) | Lenovo 65W (Delta)
Mouse Logitech G502, Logitech G700s, Logitech G500, Dell optical mouse (emergency backup)
Keyboard 1985 IBM Model F 122-key, Ducky YOTT MX Black, Dell AT101W, 1994 IBM Model M, various integrated
Software FAAAR too much to list
The comparison is flawed. Tianhe-2 uses 32000 Intel Xeon E5-2692 in a 2:3 ratio with Xeon Phi. Titan uses Opteron 6274's in a 1:1 ratio with Tesla K20X's. Not only do GPGPUs offer better FLOPs and FLOPs/watt than CPUs, the Xeon's in Tianhe-2 themselves offer greater actual FLOPS per core/processor (and FLOPS/watt even though both the E5-2692 and Opteron 6274 are nominally rated at 115W TDP)

AMD Q3 Earnings....not great, but at least they get a short-term cash injection.

Oops, missed that catch.. Still, compared to the co-processors, they're not that big a deal, since the CPUs do in the 100s of GFLOPS, while both coprocessors break the TFLOP barrier on their own. It's not an insignificant difference (if anything, the Titan gains the edge of CPU aid at it's 1:1 ratio), but in the end, not that big a difference. Eitherwyas, it doesn't detract from my original comment of the two being comparable/competitive. Phi being not competitive would be being unable to hit half the K20X' performance at LINPACK for example, which is clearly not the case.

AMD must be really cash strapped to have to spin off even more :/
 
Joined
Sep 7, 2011
Messages
2,785 (0.57/day)
Location
New Zealand
System Name MoneySink
Processor 2600K @ 4.8
Motherboard P8Z77-V
Cooling AC NexXxos XT45 360, RayStorm, D5T+XSPC tank, Tygon R-3603, Bitspower
Memory 16GB Crucial Ballistix DDR3-1600C8
Video Card(s) GTX 780 SLI (EVGA SC ACX + Giga GHz Ed.)
Storage Kingston HyperX SSD (128) OS, WD RE4 (1TB), RE2 (1TB), Cav. Black (2 x 500GB), Red (4TB)
Display(s) Achieva Shimian QH270-IPSMS (2560x1440) S-IPS
Case NZXT Switch 810
Audio Device(s) onboard Realtek yawn edition
Power Supply Seasonic X-1050
Software Win8.1 Pro
Benchmark Scores 3.5 litres of Pale Ale in 18 minutes.
Oops, missed that catch.. Still, compared to the co-processors, they're not that big a deal, since the CPUs do in the 100s of GFLOPS, while both coprocessors break the TFLOP barrier on their own.
Just to wrap this up, those are theoretical numbers. In reality in tuned HPC workloads Xeon Phi hits around 70% of theoretical SGEMM, while Kepler is around 83% (Maxwell is hitting 94-95% as an aside). The problem is that as the workload increases in size, Phi starts losing efficiency as this whitepaper from Sandia Labs shows, so comparable/competitive is a moving target. Broadly comparable in small workloads rapidly turns worse in larger workloads ( Density/Pressure/Coordinate calculations in the table are time to completion (lower is better obviously)). There is a reason that Intel subsidize Xeon Phi. Some of it is to get a footing in the co-processor industry, but most of it comes down to hit-or-miss performance highly dependent on job size, something Nvidia and AMD don't tend to suffer from (although the latter has support issues).



AMD must be really cash strapped to have to spin off even more :/
Inventory write downs and a 23% gross margin will do that.
 
Last edited:
Joined
Jul 31, 2014
Messages
481 (0.13/day)
System Name Diablo | Baal | Mephisto | Andariel
Processor i5-3570K@4.4GHz | 2x Xeon X5675 | i7-4710MQ | i7-2640M
Motherboard Asus Sabertooth Z77 | HP DL380 G6 | Dell Precision M4800 | Lenovo Thinkpad X220 Tablet
Cooling Swiftech H220-X | Chassis cooled (6 fans + HS) | dual-fanned heatpipes | small-fanned heatpipe
Memory 32GiB DDR3-1600 CL9 | 96GiB DDR3-1333 ECC RDIMM | 32GiB DDR3L-1866 CL11 | 8GiB DDR3L-1600 CL11
Video Card(s) Dual GTX 670 in SLI | Embedded ATi ES1000 | Quadro K2100M | Intel HD 3000
Storage many, many SSDs and HDDs....
Display(s) 1 Dell U3011 + 2x Dell U2410 | HP iLO2 KVMoIP | 3200x1800 Sharp IGZO | 1366x768 IPS with Wacom pen
Case Corsair Obsidian 550D | HP DL380 G6 Chassis | Dell Precision M4800 | Lenovo Thinkpad X220 Tablet
Audio Device(s) Auzentech X-Fi HomeTheater HD | None | On-board | On-board
Power Supply Corsair AX850 | Dual 750W Redundant PSU (Delta) | Dell 330W+240W (Flextronics) | Lenovo 65W (Delta)
Mouse Logitech G502, Logitech G700s, Logitech G500, Dell optical mouse (emergency backup)
Keyboard 1985 IBM Model F 122-key, Ducky YOTT MX Black, Dell AT101W, 1994 IBM Model M, various integrated
Software FAAAR too much to list
Just to wrap this up, those are theoretical numbers. In reality in tuned HPC workloads Xeon Phi hits around 70% of theoretical SGEMM, while Kepler is around 83% (Maxwell is hitting 94-95% as an aside). The problem is that as the workload increases in size, Phi starts losing efficiency as this whitepaper from Sandia Labs shows, so comparable/competitive is a moving target. Broadly comparable in small workloads rapidly turns worse in larger workloads ( Density/Pressure/Coordinate calculations in the table are time to completion (lower is better obviously)). There is a reason that Intel subsidize Xeon Phi. Some of it is to get a footing in the co-processor industry, but most of it comes down to hit-or-miss performance highly dependent on job size, something Nvidia and AMD don't tend to suffer from (although the latter has support issues).


Very interesting stuff. Where do you find those? I want in!
 
Top