Sparkle Arc B580 Titan OC Review 22

Sparkle Arc B580 Titan OC Review

(22 Comments) »

Introduction

Sparkle Logo

We have with us the Sparkle Arc B580 Titan OC graphics card, powered by the new B580 Battlemage GPU that's been making waves since its reveal earlier this month. Sparkle is a fairly new entrant to the Intel Arc graphics card ecosystem, but has been around for decades, notably as a former NVIDIA GeForce board partner. Now they are owned by TUL—the parent company of PowerColor. The Titan OC is the company's premium custom-design based on the B580, and pairs the GPU with a 31.5 cm-long triple-fan cooling solution that's a little over two slots thick. The Arc B580 marks Intel's second generation of the Xe gaming graphics architecture as discrete GPUs. These are modern, fulfill the DirectX 12 Ultimate API standards, and include a wide range of gaming experience improvements within the XeSS 2 feature suite.



Xe2 Battlemage succeeds the original Xe Alchemist. It debuted earlier this year as the architecture driving the iGPU of Intel's Core Ultra 200V Lunar Lake mobile processors, although that variant was a "lite" implementation of Xe2. On the Arc B580, you can expect a more complete set of the Xe2 Battlemage IP, with the GPU being geared for performance-segment gaming. The B580 logically succeeds the Arc A580, but presents a massive generational performance step-up. Intel claims (and we're confirmed in our review of the reference design), that the B580 outperforms every GPU from the Alchemist generation, including the top A770, and since the A770 was marketed as a 1440p-class GPU, this use case carries on for the B580.

In all, Intel claims a 70% gain in the SIMD performance of its 2nd Gen Xe cores over Alchemist, and a 50% gain in performance-per-watt, due in part to the new TSMC 5 nm EUV foundry node these chips are being built on. This is thanks to the significant IPC gain of the 2nd Gen Xe core, a new Ray Tracing Unit with anywhere between 50% to 100% generational gains in performance, a large 18 MB on-die last-level cache, and a faster memory sub-system than the A580, besides double the memory size, at 12 GB.

The B580 is being pitted by Intel against the NVIDIA GeForce RTX 4060. If you recall, the previous generation flagship parts, the A770 and A750, were compared by Intel to the RTX 3060, and the A580 was awkwardly positioned against the RTX 3050. In this sense, Intel has made a bigger generational leap in performance than NVIDIA, and we can only hope that Intel scales out the Xe2 architecture for even larger GPUs.

The Arc B580 is based on the 5 nm BMG-G21 silicon, and features 20 Xe cores, or 128 execution units, worth 2,560 unified shaders. There are also 20 Ray Tracing Units, and 160 XMX units, which accelerate AI. This is backed by a solid raster graphics backend, consisting of 80 ROPs, and 160 TMUs. The chip gets 12 GB of 19 Gbps GDDR6 memory across a 192-bit wide memory bus, which is both 50% larger and faster than the memory implementation of the RTX 4060 and the AMD Radeon RX 7600.

Perhaps the biggest aspect of the Arc B580 is its starting price of $250, which undercuts the RTX 4060 and the RX 7600 XT by at least $50. The Sparkle Arc B580 Titan OC looks like it's from a segment above, and yet, is being offered at just a $20 premium over this. The card is 31.5 cm in length and comes with a factory overclocked speed of 2740 MHz, compared to the 2670 MHz reference speed. It also comes with a slight increase in power limits to 200 W, from 190 W reference.

Intel Arc B580 Market Segment Analysis
 PriceCoresROPsCore
Clock
Boost
Clock
Memory
Clock
GPUTransistorsMemory
RX 6500 XT$1401024322685 MHz2825 MHz2248 MHzNavi 245400M4 GB, GDDR6, 64-bit
Arc A580$1803072961700 MHzN/A2000 MHzACM-G1021700M8 GB, GDDR6, 256-bit
RTX 3050$1652560321552 MHz1777 MHz1750 MHzGA10612000M8 GB, GDDR6, 128-bit
Arc A750$22035841122050 MHzN/A2000 MHzACM-G1021700M8 GB, GDDR6, 256-bit
RX 6600 XT$2052048642359 MHz2589 MHz2000 MHzNavi 2311060M8 GB, GDDR6, 128-bit
RTX 3060$2203584481320 MHz1777 MHz1875 MHzGA10612000M12 GB, GDDR6, 192-bit
RX 7600$2502048642250 MHz2625 MHz2250 MHzNavi 3313300M8 GB, GDDR6, 128-bit
RX 7600 XT$3102048642470 MHz2755 MHz2250 MHzNavi 3313300M16 GB, GDDR6, 128-bit
RTX 4060$2853072481830 MHz2460 MHz2125 MHzAD10718900M8 GB, GDDR6, 128-bit
Arc A770$25040961282100 MHzN/A2187 MHzACM-G1021700M16 GB, GDDR6, 256-bit
Arc B580$2502560802670 MHzN/A2375 MHzBMG-G2119600M12 GB, GDDR6, 192-bit
Sparkle Arc B580 Titan OC$2702560802740 MHzN/A2375 MHzBMG-G2119600M12 GB, GDDR6, 192-bit
RTX 3060 Ti$3004864801410 MHz1665 MHz1750 MHzGA10417400M8 GB, GDDR6, 256-bit
RTX 4060 Ti$3804352482310 MHz2535 MHz2250 MHzAD10622900M8 GB, GDDR6, 128-bit
RX 6700 XT$350
2560642424 MHz2581 MHz2000 MHzNavi 2217200M12 GB, GDDR6, 192-bit
RTX 3070$3205888961500 MHz1725 MHz1750 MHzGA10417400M8 GB, GDDR6, 256-bit
RTX 3070 Ti$3706144961575 MHz1770 MHz1188 MHzGA10417400M8 GB, GDDR6X, 256-bit
RX 6800$3403840961815 MHz2105 MHz2000 MHzNavi 2126800M16 GB, GDDR6, 256-bit
RX 7700 XT$3703456962171 MHz2544 MHz2250 MHzNavi 3226500M12 GB, GDDR6, 192-bit
RX 6800 XT$40046081282015 MHz2250 MHz2000 MHzNavi 2126800M16 GB, GDDR6, 256-bit

Xe2 Battlemage Architecture


The Arc B580 debuts Intel's second discrete gaming GPU architecture, codenamed Xe2 "Battlemage." A variant of Xe2 is used in the integrated graphics solution of Intel's Core Ultra 200V "Lunar Lake" mobile processors, the one today is its fully-fledged discrete gaming GPU version, with all hardware components enabled. The Arc B580 is a mid-range model based on the BMG-G21 silicon. The B580 likely does not max-out the silicon, since it is meant to succeed the mid-range A580 "Alchemist." We asked, but Intel would not comment on future products. The BMG-G21 is built on the TSMC N5 (5 nm EUV) foundry node, and packs 19.6 billion transistors across a 272 mm² monolithic die. The 5 nm node is contemporary, given that both NVIDIA "Ada" and AMD RDNA 3 gaming GPUs use it.

The BMG-G21 GPU features a PCI-Express 4.0 x8 host interface on the Arc B580 and B570. It is configured with a 192-bit GDDR6 memory bus on the B580, and a 160-bit bus on the B570. The GPU is organized in a very similar manner to modern GPUs from NVIDIA and AMD—a Global Dispatch processor distributes work among the five Render Slices, which talk to each other over the GPU's fabric and memory sub-system. The GPU's internal last-level cache is 18 MB in size. Besides the five Render Slices, there is the Media Engine, consisting of two MFX (multi-format x-coders (encoders/decoders); there are two sets of hardware encoders and decoders. Then there's the GDDR6 memory controller and the Display Engine, with four display interfaces.


Intel is claiming a 70% generational increase in performance per Xe Core, the indivisible number-crunching subunit of the GPU; and a 50% generational performance-per-watt increase. The above graphs illustrate the contribution of the individual sub-systems of the Xe2 Battlemage architecture toward these improvements; and how this plays out in a frametime analysis example of a real-world use case.


The Render Slice diagram (above) highlights the biggest chunk of the generational performance increase by Intel. It's thanks to increased IPC from the Xe Core, a more specialized and capable Ray Tracing Unit, a 300% faster Geometry engine, faster Sampler, 50% increase in HiZ, Z, stencil caches, and increases in performance of the pixel backends. Intel's engineering goal has been to reduce latency wherever it can, and reduce software (CPU) overhead as much as it can. The new 2nd Gen Xe Core features eight 512-bit vector engines, with SIMD16-native ALUs, and many more data formats. Rather than two sets of FP and INT units per vector engine, there is just one set of each per vector engine in Xe2, with larger numbers of ALUs.


Intel introduced its second gen Ray Tracing Unit, with massive generational improvements in performance and capability. It introduces a third Traversal Pipeline, which yields a 50% increase in box intersection performance. A second triangle intersection unit has been added to double the performance of triangle intersections. The BVH cache has doubled in size to 16 KB.

XeSS 2, Frame Generation, and Low Latency


Intel has codified the original XeSS as XeSS Super Resolution (XeSS-SR), as that's what it originally was—a performance enhancement that relies on super-resolution technology. The XeSS-SR SDK gets a new compute dispatcher backend for popular APIs—DirectX 11, DirectX 12, and Vulkan. There are two XeSS-SR models, the regular one, and a XeSS-SR Lite model for GPUs that lack XMX matrix acceleration capability.

XeSS 2 isn't a single technology, or an improvement over XeSS-SR, but a collection of three technologies—the existing XeSS-SR, which deals with performance; the new XeSS Frame Generation (XeSS-FG) technology, which nearly doubles frame rates based on intelligent frame doubling; and the new Xe Low Latency (XeLL) technology, which works to reduce the latency cost of SR and FG, but is something that can be used as a standalone whole-system latency technology, too.


XeSS-FG can either be implemented at native resolution, or in conjunction with XeSS-SR, where it is located right after the XeSS-SR step in the rendering queue. It relies on motion vectors, depth data, temporal frame data, and optical flow reprojection, to create interpolated frames that are then interleaved with the output frames, to effectively double the framerate. The interpolated image is then passed along to the next stage, where the HUD/UI is added at native resolution, and pushed to the frame buffer for output.


The SR + FG passes contribute to frame latency, and so, just as NVIDIA uses Reflex to counteract this latency, Intel innovated XeLL. The technology intelligently compacts the rendering queue to reduce the time it takes for an input to register as motion on-screen. XeLL remains enabled in all workloads the use XeSS-FG, but it can be used as a standalone feature, too. There's also an implicit driver-based low-latency mode that does this without a game having an explicit XeSS 2 or XeLL implementation.


Intel has updated its software package significantly. The new "Intel Graphics Software" replaces the "Arc Control" utility, and gives you a cleaner user interface. There are many new settings related to the display, including display scaling model/method/quantization range; 3D graphics settings, including a driver-based FPS limiter, the driver-based low-latency mode; and the exhaustive new Performance and Overclocking controls, which include the ability to set frequency offsets, tinker with the V/F curve, power limits, and GPU and memory clocks. It also integrates Intel's PresentMon metrics.

Packaging

Package Front
Package Back


The Card

Graphics Card Front
Graphics Card Back
Graphics Card Front Angled

Sparkle's B580 Titan OC comes with a fantastic-looking blue color theme—I really love how the various shades of blue come together. On the back you get a high-quality metal backplate with a cutout for air to flow through.

Graphics Card Dimensions

Dimensions of the card are 31.5 x 12.0 cm, and it weighs 952 g.

Graphics Card Height
Graphics Card Back Angled

Installation requires two slots in your system. We measured the card's width to be 45 mm.

Monitor Outputs, Display Connectors

Display connectivity includes three standard DisplayPort 2.1 and one HDMI 2.1a. On the Intel reference card one port is marked with a black outline, but not on this card. That port supports higher bitrates (UHBR13.5), i.e. 4K up to 360 Hz, while the other two ports support up to 240 Hz, so I guess you have to test them to find the right one (if you have a monitor that requires such high transfer rates).

In terms of codecs, you get full support for H.264, H.265, VP9 and AV1, both encode and decode. Worth highlighting is HEVC 4:2:2 10-bit encoding and decoding, which is a unique capability, and AV1 Screen Content encoding, which improves the quality of text in movies—fantastic for screen recordings or screen sharing.

Graphics Card Power Plugs

The card uses a single 8-pin connector, plus PCIe slot power, allowing a maximum power draw of 225 W. Sparkle has set their power limit to 200 W, 10 W higher than Intel default.


The card comes with an illuminated Intel ARC logo that's glowing blue and can't be controlled.

Teardown

Graphics Card Cooler Front
Graphics Card Cooler Back

The main cooler uses four heatpipes and provides cooling for the GPU, memory and VRM circuitry.


On the back, you'll find a metal backplate with cutouts to improve airflow.
Our Patreon Silver Supporters can read articles in single-page format.
Discuss(22 Comments)
Apr 17th, 2025 17:33 EDT change timezone

New Forum Posts

Popular Reviews

Controversial News Posts