New Intel oneAPI 2023 Tools Maximize Value of Upcoming Intel Hardware

TheLostSwede · Dec 16, 2022

Today, Intel announced the 2023 release of the Intel oneAPI tools - available in the Intel Developer Cloud and rolling out through regular distribution channels. The new oneAPI 2023 tools support the upcoming 4th Gen Intel Xeon Scalable processors, Intel Xeon CPU Max Series and Intel Data Center GPUs, including Flex Series and the new Max Series. The tools deliver performance and productivity enhancements, and also add support for new Codeplay plug-ins that make it easier than ever for developers to write SYCL code for non-Intel GPU architectures. These standards-based tools deliver choice in hardware and ease in developing high-performance applications that run on multiarchitecture systems.

"We're seeing encouraging early application performance results on our development systems using Intel Max Series GPU accelerators - applications built with Intel's oneAPI compilers and libraries. For leadership-class computational science, we value the benefits of code portability from multivendor, multiarchitecture programming standards such as SYCL and Python AI frameworks such as PyTorch, accelerated by Intel libraries. We look forward to the first exascale scientific discoveries from these technologies on the Aurora system next year."
-Timothy Williams, deputy director, Argonne Computational Science Division

What oneAPI Tools Deliver: Intel's 2023 developer tools include a comprehensive set of the latest compilers and libraries, analysis and porting tools, and optimized artificial intelligence (AI) and machine learning frameworks to build high-performance, multiarchitecture applications for CPUs, GPUs and FPGAs, powered by oneAPI. The tools enable developers to quickly meet performance objectives and save time by using a single codebase, allowing more time for innovation.

This new oneAPI tools release helps developers take advantage of the advanced capabilities of Intel hardware:

4th Gen Intel Xeon Scalable and Xeon CPU Max Series processors with Intel Advanced Matrix Extensions (Intel AMX), Intel Quick Assist Technology (Intel QAT), Intel AVX-512, bfloat16 and more.
Intel Data Center GPUs, including Flex Series with hardware-based AV1 encoder, and Max Series GPUs with data type flexibility, Intel Xe Matrix Extensions (Intel XMX), vector engine, Intel Xe Link and other features.

Example benchmarks:

MLPerf DeepCAM deep learning inference and training performance with Xeon Max CPU showed a 3.6x performance gain over Nvidia at 2.4 and AMD as the baseline 1.0 using Intel AMX enabled by the Intel oneAPI Deep Neural Network Library (oneDNN).
LAMMPS (large-scale atomic/molecular massively parallel simulator) workloads running on Xeon Max CPU with kernels offloaded to six Max Series GPUs and optimized by oneAPI tools resulted in an up to 16x performance gain over 3rd Gen Intel Xeon or AMD Milan alone.

Advanced software performance:

Intel Fortran Compiler provides full Fortran language standards support up through Fortran 2018 and expands OpenMP GPU offload support, speeding development of standards-compliant applications.
Intel oneAPI Math Kernel Library (oneMKL) with extended OpenMP offload capability improves portability.
Intel oneAPI Deep Neural Network Library (oneDNN) enables 4th Gen Intel Xeon and Max Series CPU processors' advanced deep learning features including Intel AMX, Intel AVX-512, VNNI and bfloat16.

To boost developer productivity, enriched SYCL support and robust code migration and analysis tools make it easier to develop code for multiarchitecture systems.

The Intel oneAPI DPC++/C++ Compiler adds support for new plug-ins from Codeplay Software for Nvidia and AMD GPUs to simplify writing SYCL code and extend code portability across these processor architectures. This provides a unified build environment with integrated tools for cross-platform productivity. As part of this solution, Intel and Codeplay will offer commercial priority support starting with the oneAPI plug-in for Nvidia GPUs.
CUDA-to-SYCL code migration is now easier with more than 100 CUDA APIs added to the Intel DPC++ Compatibility Tool, which is based on open source SYCLomatic.
Users can identify MPI imbalances at scale with the Intel VTune Profiler.
Intel Advisor adds automated roofline analysis for Intel Data Center GPU Max Series to identify and prioritize memory, cache or compute bottlenecks and causes, with actionable insights for optimizing data-transfer reuse costs of CPU-to-GPU offloading.

Why It Matters: With 48% of developers targeting heterogeneous systems that use more than one kind of processor, more efficient multiarchitecture programming is required to address the increasing scope and scale of real-world workloads. Using oneAPI's open, unified programming model with Intel's standards-based multiarchitecture tools provides freedom of choice in hardware, performance, productivity and code portability for CPUs and accelerators. Code written for proprietary programming models, like CUDA, lacks portability to other hardware, creating a siloed development practice that locks organizations into a closed ecosystem.

About oneAPI Ecosystem Adoption: Continued ecosystem adoption of oneAPI is ongoing with new Centers of Excellence being established. One, the Open Zettascale Lab at the University of Cambridge, is focused on porting significant exascale candidate codes to oneAPI, including CASTEP, FEniCS and AREPO. The center offers courses and workshops with experts teaching oneAPI methodologies and tools for compiling and porting code and optimizing performance. In total, 30 oneAPI Centers of Excellence have been established.

View at TechPowerUp Main Site | Source

eidairaman1 · Dec 16, 2022

Sounds like a Apple

claes · Dec 16, 2022

Sounds like you didn’t read the PR

thestryker6 · Dec 17, 2022

I hope Intel's open approach to this stays, because continued development is in the best interest of everyone doing anything at the datacenter/hpc level. I think we can probably thank nvidia's domination in that sphere for Intel taking this approach so far.

R-T-B · Dec 17, 2022

claes said:
Sounds like you didn’t read the PR

He didn't get past "Intel."

Solaris17 · Dec 17, 2022

Good for them. Looks like their toolkits will be less fragmented which is good for people in the intel ecosystem already. Bonus points for adding the extra AMD nvidia and what appears to be arm or atleast ASIC support.

kapone32 · Dec 18, 2022

This is good. It seems Intel has learned from AMD and now that we have both of them giving software access to their hardware on an Open basis. We should start to see software and Games more take advantage of the hardware available to the consumer today. That is nice to see. The 5950X is a great CPU and is cool when Gaming because you are only using at the most 15% of the CPU. Too bad it is frowned upon to burn DVDs but I am sure that would be killer on these chips. Whether it is Intel 10th Gen to 13th or AMD Ryzen 2nd Gen to 5th Gen.

System Name	Overlord Mk MLI
Processor	AMD Ryzen 7 7800X3D
Motherboard	Gigabyte X670E Aorus Master
Cooling	Noctua NH-D15 SE with offsets
Memory	32GB Team T-Create Expert DDR5 6000 MHz @ CL30-34-34-68
Video Card(s)	Gainward GeForce RTX 4080 Phantom GS
Storage	1TB Solidigm P44 Pro, 2 TB Corsair MP600 Pro, 2TB Kingston KC3000
Display(s)	Acer XV272K LVbmiipruzx 4K@160Hz
Case	Fractal Design Torrent Compact
Audio Device(s)	Corsair Virtuoso SE
Power Supply	be quiet! Pure Power 12 M 850 W
Mouse	Logitech G502 Lightspeed
Keyboard	Corsair K70 Max
Software	Windows 10 Pro
Benchmark Scores	https://valid.x86.fr/yfsd9w

System Name	PCGOD
Processor	AMD FX 8350@ 5.0GHz
Motherboard	Asus TUF 990FX Sabertooth R2 2901 Bios
Cooling	Scythe Ashura, 2×BitFenix 230mm Spectre Pro LED (Blue,Green), 2x BitFenix 140mm Spectre Pro LED
Memory	16 GB Gskill Ripjaws X 2133 (2400 OC, 10-10-12-20-20, 1T, 1.65V)
Video Card(s)	AMD Radeon 290 Sapphire Vapor-X
Storage	Samsung 840 Pro 256GB, WD Velociraptor 1TB
Display(s)	NEC Multisync LCD 1700V (Display Port Adapter)
Case	AeroCool Xpredator Evil Blue Edition
Audio Device(s)	Creative Labs Sound Blaster ZxR
Power Supply	Seasonic 1250 XM2 Series (XP3)
Mouse	Roccat Kone XTD
Keyboard	Roccat Ryos MK Pro
Software	Windows 7 Pro 64

System Name	boomer--->zoomer not your typical millenial build
Processor	i5-760 @ 3.8ghz + turbo ~goes wayyyyyyyyy fast cuz turboooooz~
Motherboard	P55-GD80 ~best motherboard ever designed~
Cooling	NH-D15 ~double stack thot twerk all day~
Memory	16GB Crucial Ballistix LP ~memory gone AWOL~
Video Card(s)	MSI GTX 970 ~~GOLDEN EDITION~~ RAWRRRRRR
Storage	500GB Samsung 850 Evo (OS X, *nix), 128GB Samsung 840 Pro (W10 Pro), 1TB SpinPoint F3 ~best in class
Display(s)	ASUS VW246H ~best 24" you've seen FULL HD 1O80PP SLAPS~
Case	FT02-W ~the W stands for white but it's brushed aluminum except for the disgusting ODD bays; cries
Audio Device(s)	A LOT
Power Supply	850W EVGA SuperNova G2 ~hot fire like champagne~
Mouse	CM Spawn ~cmcz R c00l seth mcfarlane darawss~
Keyboard	CM QF Rapid - Browns ~fastrrr kees for fstr teens~
Software	integrated into the chassis
Benchmark Scores	9999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999999

Processor	265K (running stock until more Intel updates land)
Motherboard	MPG Z890 Carbon WIFI
Cooling	Peerless Assassin 140
Memory	48GB DDR5-7200 CL34
Video Card(s)	RTX 3080 12GB FTW3 Ultra Hybrid
Storage	1.5TB 905P and 2x 2TB P44 Pro
Display(s)	CU34G2X and Ea244wmi
Case	Dark Base 901
Audio Device(s)	Sound Blaster X4
Power Supply	Toughpower PF3 850
Mouse	G502 HERO/G700s
Keyboard	Ducky One 3 Pro Nazca

System Name	Pioneer
Processor	Ryzen 9 9950X
Motherboard	GIGABYTE Aorus Elite X670 AX
Cooling	Noctua NH-D15 + A whole lotta Sunon, Phanteks and Corsair Maglev blower fans...
Memory	64GB (2x 32GB) G.Skill Flare X5 @ DDR5-6000 CL30
Video Card(s)	XFX RX 7900 XTX Speedster Merc 310
Storage	Intel 5800X Optane 800GB boot, +2x Crucial P5 Plus 2TB PCIe 4.0 NVMe SSDs
Display(s)	55" LG 55" B9 OLED 4K Display
Case	Thermaltake Core X31
Audio Device(s)	TOSLINK->Schiit Modi MB->Asgard 2 DAC Amp->AKG Pro K712 Headphones or HDMI->B9 OLED
Power Supply	FSP Hydro Ti Pro 850W
Mouse	Logitech G305 Lightspeed Wireless
Keyboard	WASD Code v3 with Cherry Green keyswitches + PBT DS keycaps
Software	Gentoo Linux x64 / Windows 11 Enterprise IoT 2024

New Intel oneAPI 2023 Tools Maximize Value of Upcoming Intel Hardware

TheLostSwede

News Editor

eidairaman1

The Exiled Airman

claes

thestryker6

R-T-B

Solaris17

Super Dainty Moderator

kapone32

System Name	RogueOne
Processor	Xeon W9-3495x
Motherboard	ASUS w790E Sage SE
Cooling	SilverStone XE360-4677
Memory	128gb Gskill Zeta R5 DDR5 RDIMMs
Video Card(s)	MSI SUPRIM Liquid X 4090
Storage	1x 2TB WD SN850X \| 2x 8TB GAMMIX S70
Display(s)	49" Philips Evnia OLED (49M2C8900)
Case	Thermaltake Core P3 Pro Snow
Audio Device(s)	Moondrop S8's on schitt Gunnr
Power Supply	Seasonic Prime TX-1600
Mouse	Razer Viper mini signature edition (mercury white)
Keyboard	Monsgeek M3 Lavender, Moondrop Luna lights
VR HMD	Quest 3
Software	Windows 11 Pro Workstation
Benchmark Scores	I dont have time for that.

System Name	Best AMD Computer
Processor	AMD 7900X3D
Motherboard	Asus X670E E Strix
Cooling	In Win SR36
Memory	GSKILL DDR5 32GB 5200 30
Video Card(s)	Sapphire Pulse 7900XT (Watercooled)
Storage	Corsair MP 700, Seagate 530 2Tb, Adata SX8200 2TBx2, Kingston 2 TBx2, Micron 8 TB, WD AN 1500
Display(s)	GIGABYTE FV43U
Case	Corsair 7000D Airflow
Audio Device(s)	Corsair Void Pro, Logitch Z523 5.1
Power Supply	Deepcool 1000M
Mouse	Logitech g7 gaming mouse
Keyboard	Logitech G510
Software	Windows 11 Pro 64 Steam. GOG, Uplay, Origin
Benchmark Scores	Firestrike: 46183 Time Spy: 25121