Sunday, October 18th 2020

Intel "Tiger Lake" Based Pentium and Celeron to Feature AVX2, an Instruction the Entry-Level Brands were Deprived Of

Oct 18th, 2020 23:07 Discuss (39 Comments)

Intel's next-generation Pentium Gold and Celeron entry-level processors based on the "Tiger Lake" microarchitecture could finally receive the AVX2 instruction set. Intel had segmented AVX and AVX2 to be exclusive to the Core and Xeon brands, with the Pentium Gold and Celeron products based on the same microarchitectures to artificially lack these instructions.

Intel updated its ARK product information database with entries for "Tiger Lake" based Pentium Gold and Celeron products. The page for the Pentium Gold 7505 and Celeron 6305, mention support for AVX2 besides SSE4. Both are mobile chips with 15 W TDP, and are built on the same 10 nm SuperFin process as the rest of the 11th Gen Core "Tiger Lake" processor family.

Add your own comment

39 Comments on Intel "Tiger Lake" Based Pentium and Celeron to Feature AVX2, an Instruction the Entry-Level Brands were Deprived Of

#26

biffzinker

efikkanAVX-512 will be massively powerful when we see real applications use it, there will be no going back.

Except there's a hit to clockspeed, and thermals when AVX-512 is in use. Everybody loves their 5GHz overclock on the Intel side which ends up impossible with AVX-512.

#27

GoldenX

biffzinkerExcept there's a hit to clockspeed, and thermals when AVX-512 is in use. Everybody loves their 5GHz overclock on the Intel side which ends up impossible with AVX-512.

Even with that, the performance benefit will offset the lower clocks.
FMA alone can produce some very nice 40% uplifts, for example.

#28

windwhirl

biffzinkerExcept there's a hit to clockspeed, and thermals when AVX-512 is in use. Everybody loves their 5GHz overclock on the Intel side which ends up impossible with AVX-512.

I think Intel' engineers will find some way to solve that problem without losing any or very little of the speedups that AVX-512 may provide. Though it may take a while.

#29

GoldenX

Fun that this took a decade. Looks like Intel is angry at the R3 3100 and any incoming Zen3 replacement.

#30

ratirt

efikkanHow come?
AMD have in some ways been better at adopting AVX2, featuring it even in their entry level products, and have supported it since Excavator(2015).
If anything, AVX-512 should be Intel's stronghold, if only they featured that across the entire lineup.

What I mean is Intel doesn't have much to play with to counter AMD so AVX is the one to go. In that Area the performance is not bad so Intel is adding it to every processor available or at least most.

#31

R-T-B

Haile SelassieJust a question - why PFSense (built on BSD) in this day and age when there is IP Fire built on Linux (SMP support) with comparable features? Granted, there's no ARM64 version but that is more or less the only drawback in my book.

There are certainly ARM64 linux versions.

#32

efikkan

windwhirlI think Intel' engineers will find some way to solve that problem without losing any or very little of the speedups that AVX-512 may provide. Though it may take a while.

They already have, in Ice Lake-SP.

#33

windwhirl

efikkanThey already have, in Ice Lake-SP.

Yeah, I forgot about it...

#34

GoldenX

Reminder that AVX-512 is a complete mess:

#35

ExcuseMeWtf

okay, unless there will be a consistent set implemented across the board in several generations at minimum, it won't be adopted in any remotely widespread manner.

#36

windwhirl

GoldenXReminder that AVX-512 is a complete mess:

Yeah, but it seems like Ice Lake and Tiger Lake are gonna bring a bit of order... That aside, I think some of AVX-512 instructions are in reality "512" versions of instructions that were previously implemented in SSE/AVX/AVX2 instructions sets, so if you really don't need it, you could probably use the older versions.

ExcuseMeWtfokay, unless there will be a consistent set implemented across the board in several generations at minimum, it won't be adopted in any remotely widespread manner.

Again, IC and TL seem to bring a little order here, implementing a rather respectable amount of AVX-512 sets.

And just for the record, your username made this very hilarious :laugh:

#37

efikkan

windwhirlI think some of AVX-512 instructions are in reality "512" versions of instructions that were previously implemented in SSE/AVX/AVX2 instructions sets, so if you really don't need it, you could probably use the older versions.

AVX-512 certainly extends the feature set of AVX2 to 512-bit, and existing code can be ported very easily.
But on the instruction level it also changes the opcodes to allow much more advanced operations on data sets, which is where the true power of AVX-512 is, beyond just being a "double AVX2". AVX-512 is getting close to being a "sub instruction set" of x86.

The challenge of all versions of AVX is the difficulty of using it, it requires expert level programmers to gain substantial performance gains. But the good news is that just enabling automatic optimizations usually gives ~10-30% performance gains "for free" (probably >50% with some minor effort), since the compiler can auto-vectorize and unroll some things, but in order to get that >10x performance gain, it still requires handcrafted low-level code. I believe compilers have some potential to improve here, but ultimately they can only deal with the code written by the programmer.

windwhirlAgain, IC and TL seem to bring a little order here, implementing a rather respectable amount of AVX-512 sets.

Some of these feature sets are mostly relevant to enterprise users, like those "AI" features.
The good thing about having feature sets is that it makes it easier for e.g. AMD to implement the relevant features for consumers.

#38

dragontamer5788

efikkanAVX-512 certainly extends the feature set of AVX2 to 512-bit, and existing code can be ported very easily.
But on the instruction level it also changes the opcodes to allow much more advanced operations on data sets, which is where the true power of AVX-512 is, beyond just being a "double AVX2". AVX-512 is getting close to being a "sub instruction set" of x86.

The challenge of all versions of AVX is the difficulty of using it, it requires expert level programmers to gain substantial performance gains. But the good news is that just enabling automatic optimizations usually gives ~10-30% performance gains "for free" (probably >50% with some minor effort), since the compiler can auto-vectorize and unroll some things, but in order to get that >10x performance gain, it still requires handcrafted low-level code. I believe compilers have some potential to improve here, but ultimately they can only deal with the code written by the programmer.

ISPC and OpenCL target AVX512, and are probably what I'd recommend to anyone who is writing any serious amount of CPU-SIMD code these days. That is, if you need SIMD, but for some reason GPU-SIMD is too high latency or something, so you need to fall back to CPU-SIMD.

Intrinsics are still useful for a few applications, but its far harder to use intrinsics than to use a dedicated language like ISPC: ispc.github.io/

If that's still too much to ask for, then "#pragma omp simd" is the next recommendation. Works in C, C++, and Fortran on a variety of compilers (like GCC and LLVM). A shame about Microsoft Visual Studio... you can't win them all.

#39

MrMilli

EdLinThey needed AVX2? I thought they only needed plain AVX? (Missing on the Phenom II's pre-bulldozer, lots of people didn't upgrade to Bulldozer because it was worse from those, until a few titles in their DRM used AVX...)

That's a bit irrelevant isn't it, considering even the newest Pentium's and Celeron's don't support AVX1.

GoldenXReminder that AVX-512 is a complete mess:

That only looks like a mess because it's full of obsolete things. You can remove Knights Landing, Knights Mill and Cannon Lake.
Skylake-SP, Cascade Lake & Cooper Lake are all niche workstation and 8-way multiprocessing server products. Not really relevant for desktop software.
That leaves Skylake-X, Ice Lake and Tiger Lake. Again Skylake-X was a niche product.
Whatever's that left for actual desktop usage, is not a mess.

Edit:
I forgot to add that in any case all CPU's support AVX-512F (AVX-512 Foundation). If you program for AVX-512, you can always rely on AVX-512F instructions and check for more (required anyway because of fall-back code).

Add your own comment

Intel "Tiger Lake" Based Pentium and Celeron to Feature AVX2, an Instruction the Entry-Level Brands were Deprived Of

39 Comments on Intel "Tiger Lake" Based Pentium and Celeron to Feature AVX2, an Instruction the Entry-Level Brands were Deprived Of

Latest GPU Drivers

New Forum Posts

Popular Reviews

TPU on YouTube

Controversial News Posts

Intel "Tiger Lake" Based Pentium and Celeron to Feature AVX2, an Instruction the Entry-Level Brands were Deprived Of

Related News

39 Comments on Intel "Tiger Lake" Based Pentium and Celeron to Feature AVX2, an Instruction the Entry-Level Brands were Deprived Of

Latest GPU Drivers

New Forum Posts

Popular Reviews

TPU on YouTube

Controversial News Posts