Friday, August 18th 2023
Intel Graphics Announces DirectX 11 Performance Uplifts and Frame-time Reductions
Intel Graphics today announced the Q3-2023 major update of its Arc GPU Graphics drivers, which will be released shortly. The latest driver promises to be a transformative update recommended for all Intel GPU users. The company says that it has re-architected several under-the-hood components of the drivers to make A-series GPUs significantly faster. The company also put in engineering effort to reduce frame-times, and introduce a new way of measuring the GPU's contribution to it; so users can figure out whether they are in a CPU-limited scenario, or a GPU-limited one. Lastly, the company updated its PresentMonitor utility with a new front-end interface.
Intel Arc "Alchemist" is a ground up discrete GPU graphics architecture that was designed mainly for DirectX 12 and Vulkan, but over time, relied on API translation for DirectX 9 games. With its Spring driver updates, the company had released a major update that uplifted DirectX 9 game performance by 43% on average. This was because even though API translation was being used for DirectX 9 games, there was broad scope for per-game optimization, and DirectX 9 remains a relevant API for several current e-sports titles. With today's release, Intel promises a similar round of performance updates, with as much as 19% performance uplifts to be had in DirectX 11 titles at 1080p, measured with an A750 on a Core i5-13400F based machine. These gains are averaged to +12% on the fastest i9-13900K processor. The logic being that the slower processor benefits greater from the changes Intel made to its DirectX 11 driver.Intel has developed a new performance metric called "GPU Busy." Put simply, this is the time taken by the GPU alone to process an API call from the CPU. Game rendering is a collaborative workload between the CPU and GPU. For the generation of each frame in a game, the CPU has to tally the game-state with what needs to be displayed on the screen; organize this information into an API call, and send it to the GPU, which then interprets the API call and draws a frame.Every time the CPU's end of calculations for a frame is done, it puts out a "present" call to the GPU driver. As the GPU is rendering the frame, the CPU thread responsible for the frame is essentially idle, until the GPU can post a "present return" state back to the CPU, so it can begin work on the next frame. The time difference between two presents is basically frame-time (the time it takes for you machine to generate a frame). After the generation of a frame by the GPU, it is pushed to the frame-buffer, and onward to the display controller and the display. Intel figured out a way to break down frame-time further into the GPU's specific contribution toward it, which the company calls GPU Busy.With the new GPU Busy counter, the company is able to show just how much the new latest drivers contribute to minimizing the frame-time. To do so, the company first showed us a frame-time graph of "Overwatch 2" with the launch driver, showing a wild amount of jitter. It then showed the GPU-Busy contribution to it. Since GPU Busy is a subset of frame-time, on a time-scale, it is a lower value. Every time there is a large gap between the GPU Busy value and the overall frame-time, you experience a CPU-limited scenario, whereas in regions of the graph with finer gaps between the two values, you are either GPU-limited, or balanced.
With that out of the way, the company showed us the frame-time graph of its latest driver, which shows significantly lower frame-times, and much less jitter. When a Core i5-13400F-powered machine is overlaid with an A750 using the latest driver, you notice that not only is the overall frame-time lower and jitter suppressed, but also the GPU Busy time is reduced. There's a greater coherence between the CPU and GPU performance. Overall, Intel's effort isn't directed toward improving frame-rates (performance), but also "smoothness" (reduced frame time jitter).
The GPU Busy metric can be measured using the latest version of PresentMon, which the company is releasing as its own standalone overlay application. You can read all about it here.
The complete slide-deck from Intel follows.
Intel Arc "Alchemist" is a ground up discrete GPU graphics architecture that was designed mainly for DirectX 12 and Vulkan, but over time, relied on API translation for DirectX 9 games. With its Spring driver updates, the company had released a major update that uplifted DirectX 9 game performance by 43% on average. This was because even though API translation was being used for DirectX 9 games, there was broad scope for per-game optimization, and DirectX 9 remains a relevant API for several current e-sports titles. With today's release, Intel promises a similar round of performance updates, with as much as 19% performance uplifts to be had in DirectX 11 titles at 1080p, measured with an A750 on a Core i5-13400F based machine. These gains are averaged to +12% on the fastest i9-13900K processor. The logic being that the slower processor benefits greater from the changes Intel made to its DirectX 11 driver.Intel has developed a new performance metric called "GPU Busy." Put simply, this is the time taken by the GPU alone to process an API call from the CPU. Game rendering is a collaborative workload between the CPU and GPU. For the generation of each frame in a game, the CPU has to tally the game-state with what needs to be displayed on the screen; organize this information into an API call, and send it to the GPU, which then interprets the API call and draws a frame.Every time the CPU's end of calculations for a frame is done, it puts out a "present" call to the GPU driver. As the GPU is rendering the frame, the CPU thread responsible for the frame is essentially idle, until the GPU can post a "present return" state back to the CPU, so it can begin work on the next frame. The time difference between two presents is basically frame-time (the time it takes for you machine to generate a frame). After the generation of a frame by the GPU, it is pushed to the frame-buffer, and onward to the display controller and the display. Intel figured out a way to break down frame-time further into the GPU's specific contribution toward it, which the company calls GPU Busy.With the new GPU Busy counter, the company is able to show just how much the new latest drivers contribute to minimizing the frame-time. To do so, the company first showed us a frame-time graph of "Overwatch 2" with the launch driver, showing a wild amount of jitter. It then showed the GPU-Busy contribution to it. Since GPU Busy is a subset of frame-time, on a time-scale, it is a lower value. Every time there is a large gap between the GPU Busy value and the overall frame-time, you experience a CPU-limited scenario, whereas in regions of the graph with finer gaps between the two values, you are either GPU-limited, or balanced.
With that out of the way, the company showed us the frame-time graph of its latest driver, which shows significantly lower frame-times, and much less jitter. When a Core i5-13400F-powered machine is overlaid with an A750 using the latest driver, you notice that not only is the overall frame-time lower and jitter suppressed, but also the GPU Busy time is reduced. There's a greater coherence between the CPU and GPU performance. Overall, Intel's effort isn't directed toward improving frame-rates (performance), but also "smoothness" (reduced frame time jitter).
The GPU Busy metric can be measured using the latest version of PresentMon, which the company is releasing as its own standalone overlay application. You can read all about it here.
The complete slide-deck from Intel follows.
25 Comments on Intel Graphics Announces DirectX 11 Performance Uplifts and Frame-time Reductions
If you use the new beta DX12 renderer, it can maintain 200+ FPS at 3440x1440p with the same graphical settings (on a A770 that I’ve tested).
EDIT: I tested it on a 5800X/DDR4-3733/B550 system.
I only have older games and the BL3 benchmark was kind of a turn off.
www.techpowerup.com/review/intel-arc-a750/8.html
This is from the launch, not new, these uplifts are old.
GPU Busy metric seems like a good way to see know whether the bottleneck come from GPU or CPU+RAM
most the people don't understand you can't get prefect drivers from a lab.
Nonetheless, well done Intel on improving the drivers.
From what I understand, they are targeting RTX 4070 performance. That would be over 90% of the market. It is a 16gb card, performs about as well as every other card at it's price range, and has a year's worth of driver updates before the next generation hits the shelves.
In all seriousness, what do you think you can get for $350USD