You, guys, discuss the topic from the wrong side.
Try getting the RISC smartphone performance from the x86-64 chips in the very same power envelope.
How about running a 4K 17-inch office laptop with 1-watt power consumption and a battery that can last 48 hours or more? How about that?
How about making AMD's Ryzen 5 U-APUs running at 2 or 3 watts with decent performance on a 4K 15-inch office notebook?
Show me an ARM SoC that performs decently and consumes even 1W under a meaningful load? Also, the LCD panel of that 17" 4K laptop will be consuming
way more than 1 watt on its own.
Phone ARM SoCs peak at 8-10W and run steady-state at around ~3W (depending on the phone design). The same chips in laptops get a bit more thermal and power headroom. ARM chips idle lower than x86 chips, but then you're talking about high vs. low mW figures, which ... well, it's a difference, but it isn't particularly meaningful. ARM chips are still more efficient in the low end and can run in lower power envelopes in total, but that is in part due to them being designed to be smaller and more efficient in the first place. It'll be extremely interesting to see how Intel's E cores and whatever analogue to this AMD comes up with in the coming years stacks up.
99% of the comparison aren't system build with LPDDR*, they are traditional PC or laptop build by third party.
... did I say that? No. I said:
most laptops today use soldered RAM, whether regular DDR4 or LPDDR4X
So at least do me the courtesy of responding to what I actually said. Also, all PCs are built by third parties. AMD and Intel don't make PCs, whether desktops or laptops. None of that stops them from designing an SoC with on-board memory - it would just force OEMs to adjust their designs.
They say that, but in reality it's more, Let's build a server die and sell the best binned CPU to mobile.
Really?
Intel launched Ice Lake for servers this spring.
Ice Lake in mobile launched in mid-2019. While these aren't the same architecture in every way, there is a reason they share a name, and the server chip was literally two years later than the mobile chip. Also, there is zero overlap between Intel's server and laptop chips, as they use completely different die designs.
You have to perform better to ask for more. See Apple with M1 Pro and Max. if it perform better, people will pay more for ARM design.
On the Apple side that is relatively easy, as they control the OS and much of the software stack, and also built an incredibly robust backwards compatibility system through emulation. ARM, QC or anyone else would need close cooperation with MS to do the same. And so far, these efforts have failed. You are right to a certain point, but it's a chicken-and-egg problem: nobody buys ARM PCs because they underperform; nobody builds high-performance ARM PCs because there's no demand for them. The only solution towards fixing this is a significant R&D investment.
The point is the ARM ISA benefits of being "RISC" loose most of its benefits as the cores grow bigger and go more complex. As core grow bigger and bigger, the ISA become a way smaller factor in overall performance. I do not say that it would impact one more than another. I say ISA become less and less relevant the more you push performance.
Less, sure. But Apple's efficiency advantage still shows that there's a lot of room for movement. Sure, they have a huge die and a node advantage, but they're still matching ~5GHz x86 performance at 3.2GHz and half or less than half the power. Regardless of other factors, that speaks to a highly efficient design. Whether that can be matched by an x86 design? We'll have to wait and see.
Everyone could do a high performance architecture using x86(if they have the license indeed), ARM or RISC V. But getting there is hard.
That statement is literally a contradiction. If everyone could make one, then it would by definition not be hard. The difficulty of doing so is precisely why everyone
isn't doing that.
My point there is ARM got it easy over the last decade as AMD was struggling and Intel was milking the market. If both company were still in a fierce competition, ARM would probably got it way harder. That do not means that Apple would not have been able to get an amazing design out.
You might be right - we'll see. IMO, ARM's main advantage has been the growth of the mobile market, not x86 stagnation, as they mostly haven't been competing in the same markets to begin with.
Well an installer could choose to only install the specific binary. Binary are fairly small. i don't see this as a big deal. People forget to realise that x64 code take double the space of 32 bit code and nobody freaked out with that. And it's also how Apple are doing things these days that their M1 laptop are doing very well.
So you want a dual-ISA CPU with applications that selectively install for one set of cores or the other? What would be the point of that? Does the user have to choose which set of cores they want to install an application for? Will it be automatic? And what about the OS? Seriously, this sounds like a nightmare scenario.
That bring back to the main debate. (let exclude apple from this since it's a closed system), If ARM ecosystem doesn't release design that are matching or outperforming x64 offers, they won't penetrate the desktop/mobile market.
That is somewhat true. At least they need to be competitive, and to deliver adequate performance. Zen1 showed us what kind of effect that can have.
They will also have an hard time in server market. They may have their initiative like Amazon CPU and stuff, but in the end, people will prefer what perform the most.
They're doing decently in the server market currently, both in gaining market share and absolute performance. Look at the
Altra Q80 results.
Qualcomm said in the early 2010s that ARM would be the future of servers, and that went precisely nowhere.
That is very much not true. They had a couple of false starts, but ARM servers are doing quite well these days. Look at the link directly above. Their market share is still tiny, but that's expected, as gaining ground in the server market takes a long, long time.
Turns out, if you dont have decades of software to back you up, you at least need jawdropping performance. Which ARM also couldnt deliver, at x86 performance it was drawing as much or more power. We see similar behaviour from apple's M series, despite having 5nm on their side and gigantic caches and all this vertical integration they are only slightly faster and just as power hungry as zen 2 based 4000 series APUs.
While the M1 Pro and Max can scale to ~100W power draws under heavy combined CPU and GPU loads, I've seen nothign to indicate that they aren't more efficient than 4000-series APUs. Got any concrete data to back up that statement?
At least in ST performance they match the fastest x86 cores while consuming ~10-11W (
absolute highest is 14.5W),
while in MT performance they match or (often drastically) beat 35/45W AMD/Intel mobile chips (note three links in this sentence). With MT loads never exceeding 45W, it's pretty safe to say the M1 series is significantly ahead in efficiency.
Pair ARM with FPGA and use the FPGA hardware to emulate the harder x86 stuff that ARM struggles with. Looking at it further FPGA is what will bridge the gap between x86 and ARM and other chips. FPGA is the programmable fabric if you will that can connect and ties different chip hardware together in various manners. I firmly believe in the future of FPGA tech it's just a matter of more of a stronger programmable hardware and software to go alongside it.
Some of the stuff Xilinx is doing now is interesting it'll help making FPGA's more user friendly and access w/o being a coder and engineer at the same time sort of like GameMaker for FPGA's is the best way to describe things they are starting to do with developer boards building blocks and it's not really to that point yet, but give it time and using FPGA's exactly the way you intend to for a given purpose is going to become easier and simpler to do so. It's like lego blocks with circuits so just imagine the fun in the right hands with the right vision.
FPGAs are nowhere near ASICs in power efficiency though, so if you're using some sort of FPGA-based acceleration for your x86 emulator that is likely going to get quite power hungry. Which rather undermines the proposed ARM advantage in the first place. FPGAs have tons and tons of cool use cases, but wide-scale general consumer applications are not among them - ASICs will always be a better fit there. Also, FPGA-accelerated x86 emulation sounds like a recipe for some rather heavy litigation.