News Posts matching #GPU

Return to Keyword Browsing

Intel Raptor Lake-S CPU-attached NVMe Storage Remains on PCIe Gen4

Intel is preparing to launch its next-generation desktop platform codenamed Rocket Lake-S. According to the presentation held by Intel today in Shenzen, China, we have official information regarding some of the platform features that Raptor Lake is bringing. Starting with memory support, Raptor Lake is still carrying the transitional DDR4 and DDR5 support, as the full swing towards DDR5 is still in progress. Unlike the previous generation Alder Lake, which brought DDR5-4800 support, Raptor Lake's integrated memory controller can drive DDR5 modules with a 5600 MT/s configuration. As DDR4 support remains, it is limited to 3200 MT/s speed.

Interesting information from the leaked slide points out that support for CPU-attached NVMe storage remains PCIe Gen4. While AMD will provide an AM5 socket with CPU-attached NMVe storage on PCIe Gen5 protocol, Intel is taking a step back and holding on to Gen4. The CPU is outputting 16 PCIe Gen5 lanes on its own. Motherboard vendors for the upcoming 700-series boards for Raptor Lake can still provide a PCIe Gen5 NVMe slot; however, it will have to subtract eight Gen5 lanes from the PCI Express Graphics (PEG) slot and route them to NVMe storage. As our testing shows, this will affect GPU's performance by a few percent. AMD's upcoming AM5 platform has no such issues, as the CPU provides both the PEG and CPU-attached NVMe storage with sufficient PCIe Gen5 bandwidth.

AMD WMMA Instruction is Direct Response to NVIDIA Tensor Cores

AMD's RDNA3 graphics IP is just around the corner, and we are hearing more information about the upcoming architecture. Historically, as GPUs advance, it is not unusual for companies to add dedicated hardware blocks to accelerate a specific task. Today, AMD engineers have updated the backend of the LLVM compiler to include a new instruction called Wave Matrix Multiply-Accumulate (WMMA). This instruction will be present on GFX11, which is the RDNA3 GPU architecture. With WMMA, AMD will offer support for processing 16x16x16 size tensors in FP16 and BF16 precision formats. With these instructions, AMD is adding new arrangements to support the processing of matrix multiply-accumulate operations. This is closely mimicking the work NVIDIA is doing with Tensor Cores.

AMD ROCm 5.2 API update lists the use case for this type of instruction, which you can see below:
rocWMMA provides a C++ API to facilitate breaking down matrix multiply accumulate problems into fragments and using them in block-wise operations that are distributed in parallel across GPU wavefronts. The API is a header library of GPU device code, meaning matrix core acceleration may be compiled directly into your kernel device code. This can benefit from compiler optimization in the generation of kernel assembly and does not incur additional overhead costs of linking to external runtime libraries or having to launch separate kernels.

rocWMMA is released as a header library and includes test and sample projects to validate and illustrate example usages of the C++ API. GEMM matrix multiplication is used as primary validation given the heavy precedent for the library. However, the usage portfolio is growing significantly and demonstrates different ways rocWMMA may be consumed.

Arm Announces the Cortex-X3, Cortex-A715 CPU Cores and Immortalis-G715 GPU

This time last year, I wrote about how digital experiences had never been more important, from personal to business devices - they helped us stay connected and entertained at a time when we needed it most. Compute continues to define our experiences in the modern world, and now these experiences are becoming even more visual.

Smartphones are at the center of our connected lives. From gaming to productivity, through video calling, social media or virtual environments, it is the device that provides us the connection to everyone and everything, in real time. For developers, making these immersive real-time 3D experiences even more compelling and engaging requires more performance. Arm sets the standard for performance and efficient compute, and our latest suite of compute solutions for consumer devices will continue to raise the threshold of what's possible in the mobile market, shaping the visual experiences of tomorrow.

AMD Ryzen 7000 Series Dragon Range and Phoenix Mobile Processor Specifications Leak

AMD is preparing to update its mobile sector with the latest IP in the form of Zen4 CPU cores and RDNA3 graphics. According to Red Gaming Tech, we have specifications of upcoming processor families. First, we have AMD Dragon Range mobile processors representing a downsized Raphael design for laptops. Carrying Zen4 CPU cores and RDNA2 integrated graphics, these processors are meant to power high-performance laptops with up to 16 cores and 32 threads. Being a direct competitor to Intel's Alder Lake-HX, these processors also carry an interesting naming convention. The available SKUs include AMD Ryzen 5 7600HX, Ryzen 7 7800HX, Ryzen 9 7900HX, and Ryzen 9 7980HX design with a massive 16-core configuration. These CPUs are envisioned to run along with more powerful dedicated graphics, with clock speeds of 4.8-5.0+ GHz.

Next, we have AMD Phoenix processors, which take Dragon Range's design to a higher level thanks to the newer graphics IP. Having Zen4 cores, Phoenix processors carry upgraded RDNA3 graphics chips to provide a performance level similar to NVIDIA's GeForce RTX 3060 Max-Q SKU, all in one package. These APUs will come in four initial configurations: Ryzen 5 7600HS, Ryzen 7 7800HS, Ryzen 9 7900HS, and Ryzen 9 7980HS. While maxing out at eight cores, these APUs will compensate with additional GPU compute units with a modular chiplet design. AMD Phoenix is set to become AMD's first chiplet design launching for the laptop market, and we can expect more details as we approach the launch date.

Intel Arc A370M Graphics Card Tested in Various Graphics Rendering Scenarios

Intel's Arc Alchemist graphics cards launched in laptop/mobile space, and everyone is wondering just how well the first generation of discrete graphics performs in actual, GPU-accelerated workloads. Tellusim Technologies, a software company located in San Diego, has managed to get ahold of a laptop featuring an Intel Arc A370M mobile graphics card and benchmark it against other competing solutions. Instead of using Vulkan API, the team decided to use D3D12 API for tests, as the Vulkan usually produces lower results on the new 12th generation graphics. With the 30.0.101.1736 driver version, this GPU was mainly tested in the standard GPU working environment like triangles and batches. Meshlet size is set to 69/169, and the job is as big as 262K Meshlets. The total amount of geometry is 20 million vertices and 40 million triangles per frame.

Using the tests such as Single DIP (drawing 81 instances with u32 indices without going to Meshlet level), Mesh Indexing (Mesh Shader emulation), MDI/ICB (Multi-Draw Indirect or Indirect Command Buffer), Mesh Shader (Mesh Shaders rendering mode) and Compute Shader (Compute Shader rasterization), the Arc GPU produced some exciting numbers, measured in millions or billions of triangles. Below, you can see the results of these tests.

AMD GPU Prices Fall Below MSRP in Europe, NVIDIA GPUs Approach the Baseline

Graphics card prices have been on a steady decline in the past few months, following their peak in May of last year when we saw double and triple pricing compared to the baseline MSRP value. According to the 3DCenter.org report, which tracks graphics card prices in Germany and Austria, we have information that AMD GPU prices have dipped below MSRP, while NVIDIA GPUs are very close to baseline listed prices. The report tracks Ethereum mining profitability and displays it in the yellow line. As the line is declining, so are the GPU prices. For AMD, the prices are now 8% below the 100% of MSRP. At 92%, consumers can find AMD GPUs at a slight discount. While AMD cards are slightly cheaper, NVIDIA GPUs are now at 102% of the MSRP, the lowest price point since the launch.

NVIDIA RTX 40 Series Could Reach 800 Watts on Desktop, 175 Watt for Mobile/Laptop

Rumors of NVIDIA's upcoming Ada Lovelace graphics cards keep appearing. With every new update, it seems like the total power consumption is getting bigger, and today we are getting information about different SKUs, including mobile and desktop variants. According to a well-known leaker, kopite7kimi, we have information about the power limits of the upcoming GPUs. The new RTX 40 series GPUs will feature a few initial SKUs: AD102, AD103, AD104, and AD106. Every SKU, except the top AD102, will be available as well. The first in line, AD102, is the most power-hungry SKU with a maximum power limit rating of 800 Watts. This will require multiple power connectors and a very beefy cooling solution to keep it running.

Going down the stack, we have an AD103 SKU limited to 450 Watts on desktop and 175 Watts on mobile. The AD104 chip is limited to 400 Watts on desktop, while the mobile version is still 175 Watts. Additionally, the AD106 SKU is limited to 260 Watts on desktop and 140 Watts on mobile.

Apple M2 CPU & GPU Benchmarks Surface on Geekbench

The recently announced Apple M2 processor which is set to feature in the new MacBook Air and 13-inch MacBook Pro models has been benchmarked. The processor appeared in numerous Geekbench 5 CPU & GPU tests where the chip scored a maximum single-core result of 1919 points and 8928 points in multi-core representing an 11% and 18% CPU performance improvement respectively from the M1. The chip brings significant GPU performance increases achieving a Geekbench Metal score of 30627 points which is a ~42% increase from the M1 partially due to a larger 10-core GPU compared to the 8-core GPU on the M1. These initial numbers largely align with claims from Apple of an 18% CPU and 35% GPU improvement over the original M1.

AMD Plans Late-October or Early-November Debut of RDNA3 with Radeon RX 7000 Series

AMD is planning to debut its next-generation RDNA3 graphics architecture with the Radeon RX 7000 series desktop graphics cards, some time in late-October or early-November, 2022. This, according to Greymon55, a reliable source with AMD and NVIDIA leaks. We had known about a late-2022 debut for AMD's next-gen graphics, but now we have a finer timeline.

AMD claims that RDNA3 will repeat the feat of over 50 percent generational performance/Watt gains that RDNA2 had over RDNA. The next-generation GPUs will be built on the TSMC N5 (5 nm EUV) silicon fabrication process, and debut a multi-chip module design similar to AMD's processors. The logic dies with the GPU's SIMD components will be built on the most advanced node, while the I/O and display/media accelerators will be located in separate dies that can make do on a slightly older node.

Intel Arc Alchemist GPUs Get Vulkan 1.3 Compatibility

A part of the process of building a graphics card is designing compatibility to execute the latest graphics APIs like DirectX, OpenGL, and Vulkan. Today, we have confirmation that Intel's Arc Alchemist discrete graphics cards will be compatible with Vulkan's latest iteration - version 1.3. In January, Khronos, the team behind Vulkan API, released their regular two-year update to the standard. Graphics card vendors like NVIDIA and AMD announced support immediately with their drivers. Today, the Khronos website officially lists Intel Arc Alchemist mobile graphics cards as compatible with Vulkan 1.3 with Intel Arc A770M, A730M, A550M, A370M, and A350M GPUs.

At the time of writing, there is no official announcement for the desktop cards yet. However, given that the mobile SKUs are supporting the latest standard, it is extremely likely that the desktop variants will also carry the same level of support.

AMD Said to Become TSMC's Third Largest Customer in 2023

Based on a report in the Taiwanese media, AMD is quickly becoming a key customer for TSMC and is expected to become its third largest customer in 2023. This is partially due to new orders that AMD has placed with TSMC for its 5 nm node. AMD is said to become TSMC's single largest customer for its 5 nm node in 2023, although it's not clear from the report how large of a share of the 5 nm node AMD will have.

The additional orders are said to be related to AMD's Zen 4 based processors, as well as its upcoming RDNA3 based GPUs. AMD is expected to be reaching a production volume of some 20,000 wafers in the fourth quarter of 2022, although there's no mention of what's expected in 2023. Considering most of AMD's products for the next year or two will all be based on TSMC's 5 nm node, this shouldn't come as a huge surprise though, as AMD has a wide range of new CPU and GPU products coming.

Jon Peddie Research: Q1 of 2022 Saw a Decline in GPU Shipments Quarter-to-Quarter

Jon Peddie Research reports that the global PC-based graphics processor units (GPU) market reached 96 million units in Q1'22 and PC GPUs shipments decreased 6.2% due to disturbances in China, Ukraine, and the pullback from the lockdown elsewhere. However, the fundamentals of the GPU and PC market are solid over the long term, JPR predicts GPUs will have a compound annual growth rate of 6.3% during 2022-2026 and reach an installed base of 3.3 million units at the end of the forecast period. Over the next five years, the penetration of discrete GPUs (dGPU) in the PC market will grow to reach a level of 46%.

AMD's overall market share percentage from last quarter increased 0.7%, Intel's market share decreased by -2.4%, and Nvidia's market share increased 1.69%, as indicated in the following chart.

ORNL Frontier Supercomputer Officially Becomes the First Exascale Machine

Supercomputing game has been chasing various barriers over the years. This has included MegaFLOP, GigaFLOP, TeraFLOP, PetaFLOP, and now ExaFLOP computing. Today, we are witnessing for the first time an introduction of an Exascale-level machine contained at Oak Ridge National Laboratory. Called the Frontier, this system is not really new. We have known about its upcoming features for months now. What is new is the fact that it was completed and is successfully running at ORNL's facilities. Based on the HPE Cray EX235a architecture, the system uses 3rd Gen AMD EPYC 64-core processors with a 2 GHz frequency. In total, the system has 8,730,112 cores that work in conjunction with AMD Instinct MI250X GPUs.

As of today's TOP500 supercomputers list, the system is overtaking Fugaku's spot to become the fastest supercomputer on the planet. Delivering a sustained HPL (High-Performance Linpack) score of 1.102 Exaflop/s, it features a 52.23 GigaFLOPs/watt power efficiency rating. In the HPL-AI metric, dedicated to measuring the system's AI capabilities, the Frontier machine can output 6.86 exaFLOPs at reduced precisions. This alone is, of course, not a capable metric for Exascale machines as AI works with INT8/FP16/FP32 formats, while the official results are measured in FP64 double-precision form. Fugaku, the previous number one, scores about 2 ExaFLOPs in HPL-AI while delivering "only" 442 PetaFlop/s in HPL FP64 benchmarks.

AMD RDNA 3 GPUs to Support DisplayPort 2.0 UHBR 20 Standard

AMD's upcoming Radeon RX 7000 series of graphics cards based on the RDNA 3 architecture are supposed to feature next-generation protocols all over the board. Today, according to a patch committed to the Linux kernel, we have information about display output choices AMD will present to consumers in the upcoming products. According to a Twitter user @Kepler_L2, who discovered this patch, we know that AMD will bundle DisplayPort 2.0 technology with UHBR 20 transmission mode. The UHBR 20 standard can provide a maximum of 80 Gbps bi-directional bandwidth, representing the highest bandwidth in a display output connector currently available. With this technology, a sample RDNA 3 GPU could display 16K resolution with Display Stream Compression, 10K without compression, or two 8K HDR screens running at 120 Hz refresh rate. All of this will be handled by Display Controller Next (DCN) engine for media.

The availability of DisplayPort 2.0 capable monitors is a story of its own. VESA noted that they should come at the end of 2021; however, they got delayed due to the lack of devices supporting this output. Having AMD's RDNA 3 cards as the newest product to support these monitors, we would likely see the market adapt to demand and few available products as the transition to the latest standard is in the process.

Intel to Present Meteor/Arrow Lake with Foveros 3D Packaging at Hot Chips 34

Hot Chips 34, the upcoming semiconductor conference from Sunday, August 21 to Tuesday, August 23, 2022, will feature many significant contributions from folks like Intel, AMD, Tesla, and NVIDIA. Today, thanks to Intel's registration at the event, we discovered that the company would present its work on Meteor Lake and Arrow Lake processors with the novel Foveros 3D packaging. The all-virtual presentation from Intel will include talks about Ponte Vecchio GPU and its architecture, system, and software; Meteorlake and Arrowlake 3D Client Architecture Platform with Foveros; and some Xeon D and FPGA presentations. You can see the official website here for a complete list of upcoming talks.

As a little reminder, Meteor Lake is supposed to arrive next year, replacing the upcoming Raptor Lake design, and it has already ahs been pictured, which you can see below. The presentation will be recorded and all content posted on Hot Chips's website for non-attendees to catch up on.

AMD Robotics Starter Kit Kick-Starts the Intelligent Factory of the Future

Today AMD announced the Kria KR260 Robotics Starter Kit, the latest addition to the Kria portfolio of adaptive system-on-modules (SOMs) and developer kits. A scalable and out-of-the-box development platform for robotics, the Kria KR260 offers a seamless path to production deployment with the existing Kria K26 adaptive SOMs. With native ROS 2 support, the standard framework for robotics application development, and pre-built interfaces for robotics and industrial solutions, the new SOM starter kit enables rapid development of hardware-accelerated applications for robotics, machine vision and industrial communication and control.

"The Kria KR260 Robotics Starter Kits builds on the success of our Kria SOMs and KV260 Vision AI Starter Kit for AI and embedded developers, providing roboticists with a complete, out-of-the-box solution for this rapidly growing application space," said Chetan Khona, senior director of Industrial, Vision, Healthcare and Sciences Markets at AMD. "Roboticists will now be able to work in their standard development environment on a platform that has all the interfaces and capabilities needed to be up and running in less than an hour. The KR260 Starter Kit is an ideal platform to accelerate robotics innovation and easily take ideas to production at scale."

AMD Claims Higher FPS/$ Radeon GPU Value Over NVIDIA Offerings

Frank Azor, Chief Architect of Gaming Solutions & Marketing at AMD, has posted an interesting slide on Twitter, claiming that AMD Radeon products possess higher FPS/$ value than NVIDIA's graphics offerings. According to the slide, AMD Radeon graphics cards are the best solutions for gamers looking at performance per dollar ratings and performance per watt. This means that AMD claims that Radeon products are inherently higher-value products than NVIDIA's offerings while also more efficient. As the chart shows, which you can see below, some AMD Radeon cards are offering up to 89% better FPS/$ value with up to 123% better FPS/Watt metric. This highest rating is dedicated to Radeon RX 6400 GPU; however, there are all GPUs included in comparison with up to the latest Radeon RX 6950 XT SKU.

Compared to TechPowerUp's own testing of AMD's Radeon cards and multiple reviews calculating the performance per dollar metric, we could not see numbers as high as AMD's. This means that AMD's marketing department probably uses a different selection of games that may perform better on AMD Radeon cards than NVIDIA GeForce RTX. Of course, as with any company marketing material, you should take it with a grain of salt, so please check some of our reviews for a non-biased comparison.

NVIDIA Releases Security Update 473.47 WHQL Driver for Kepler GPUs

Ten years ago, in 2012, NVIDIA introduced its Kepler series of graphics cards based on the TSMC 28 nm node. Architecture has been supported for quite a while now by NVIDIA's drivers, and the last series to carry support was the 470 driver class. Today, NVIDIA pushed a security update in the form of a 473.47 WHQL driver that brings fixes to various CVE vulnerabilities that can cause anything from issues that may lead to denial of service, information disclosure, or data tampering. This driver version has no fixed matters and doesn't bring any additional features except the fix for vulnerabilities. With CVEs rated from 4.1 to 8.5, NVIDIA has fixed major issues bugging Kepler GPU users. With a high risk for code execution, denial of service, escalation of privileges, information disclosure, and data tampering, the 473.47 WHQL driver is another step for supporting Kepler architecture until 2024, when NVIDIA plans to drop the support for this architecture. Supported cards are GT 600, GT 700, GTX 600, GTX 700, Titan, Titan Black, and Titan Z.

The updated drivers are available for installation on NVIDIA's website and for users of TechPowerUp's NVCleanstall software.

NVIDIA GeForce RTX 4090 Twice as Fast as RTX 3090, Features 16128 CUDA Cores and 450W TDP

NVIDIA's next-generation GeForce RTX 40 series of graphics cards, codenamed Ada Lovelace, is shaping up to be a powerful graphics card lineup. Allegedly, we can expect to see a mid-July launch of NVIDIA's newest gaming offerings, where customers can expect some impressive performance. According to a reliable hardware leaker, kopite7kimi, NVIDIA GeForce RTX 4090 graphics card will feature AD102-300 GPU SKU. This model is equipped with 126 Streaming Multiprocessors (SMs), which brings the total number of FP32 CUDA cores to 16128. Compared to the full AD102 GPU with 144 SMs, this leads us to think that there will be an RTX 4090 Ti model following up later as well.

Paired with 24 GB of 21 Gbps GDDR6X memory, the RTX 4090 graphics card has a TDP of 450 Watts. While this number may appear as a very power-hungry design, bear in mind that the targeted performance improvement over the previous RTX 3090 model is expected to be a two-fold scale. Paired with TSMC's new N4 node and new architecture design, performance scaling should follow at the cost of higher TDPs. These claims are yet to be validated by real-world benchmarks of independent tech media, so please take all of this information with a grain of salt and wait for TechPowerUp reviews once the card arrives.

Alleged AMD Instinct MI300 Exascale APU Features Zen4 CPU and CDNA3 GPU

Today we got information that AMD's upcoming Instinct MI300 will be allegedly available as an Accelerated Processing Unit (APU). AMD APUs are processors that combine CPU and GPU into a single package. AdoredTV managed to get ahold of a slide that indicates that AMD Instinct MI300 accelerator will also come as an APU option that combines Zen4 CPU cores and CDNA3 GPU accelerator in a single, large package. With technologies like 3D stacking, MCM design, and HBM memory, these Instinct APUs are positioned to be a high-density compute the product. At least six HBM dies are going to be placed in a package, with the APU itself being a socketed design.

The leaked slide from AdoredTV indicates that the first tapeout is complete by the end of the month (presumably this month), with the first silicon hitting AMD's labs in Q3 of 2022. If the silicon turns out functional, we could see these APUs available sometime in the first half of 2023. Below, you can see an illustration of the AMD Instinct MI300 GPU. The APU version will potentially be of the same size with Zen4 and CDNA3 cores spread around the package. As Instinct MI300 accelerator is supposed to use eight compute tiles, we could see different combinations of CPU/GPU tiles offered. As we await the launch of the next-generation accelerators, we are yet to see what SKUs AMD will bring.

AMD's Integrated GPU in Ryzen 7000 Gets Tested in Linux

It appears that one of AMD's partners has a Ryzen 7000 CPU or APU, with integrated graphics up and running in Linux. Based on details leaked, courtesy of the partner testing the chip using the Phoronix Test Suite and submitting the results to the OpenBenchmarking database. The numbers are by no means impressive, suggesting that this engineering sample isn't running at the proper clock speeds. For example, it only scores 63.1 FPS in Enemy Territory: Quake Wars, where a Ryzen 9 6900HX manages 182.1 FPS, where both GPUs have been allocated 512 MB of system memory as the minimum graphics memory allocation.

The integrated GPU goes under the model name of GFX1036, with older integrated RDNA2 GPUs from AMD having been part of the GFX103x series. It's reported to have a clock speed of 2000/1000 MHz, although it's presumably running at the lower of the two clock speeds, if not even slower, as it's only about a third of the speed or slower, than the GPU in the Ryzen 9 6900HX. That said, the GPU in the Ryzen 7000-series is as far as anyone's aware, not really intended for gaming, since it's a very stripped down GPU that is meant to mainly be for desktop use and media usage, so it's possible that it'll never catch up with the current crop of integrated GPUs from AMD. We'll hopefully find out more in less than two weeks time, when AMD has its keynote at Computex.

NVIDIA Releases Open-Source GPU Kernel Modules

NVIDIA is now publishing Linux GPU kernel modules as open source with dual GPL/MIT license, starting with the R515 driver release. You can find the source code for these kernel modules in the NVIDIA Open GPU Kernel Modules repo on GitHub. This release is a significant step toward improving the experience of using NVIDIA GPUs in Linux, for tighter integration with the OS and for developers to debug, integrate, and contribute back. For Linux distribution providers, the open-source modules increase ease of use.

They also improve the out-of-the-box user experience to sign and distribute the NVIDIA GPU driver. Canonical and SUSE are able to immediately package the open kernel modules with Ubuntu and SUSE Linux Enterprise Distributions. Developers can trace into code paths and see how kernel event scheduling is interacting with their workload for faster root cause debugging. In addition, enterprise software developers can now integrate the driver seamlessly into the customized Linux kernel configured for their project.

Tachyum Delivers the Highest AI and HPC Performance with the Launch of the World's First Universal Processor

Tachyum today launched the world's first universal processor, Prodigy, which unifies the functionality of a CPU, GPU and TPU in a single processor, creating a homogeneous architecture, while delivering massive performance improvements at a cost many times less than competing products.

After the company undertook its mission to conquer the processor performance plateau in nanometer-class chips and the systems they power, Tachyum has succeeded by launching its first commercial product. The Prodigy Cloud/AI/HPC supercomputer processor chip offers 4x the performance of the fastest Xeon, has 3x more raw performance than NVIDIA's H100 on HPC and has 6x more raw performance on AI training and inference workloads, and up to 10x performance at the same power. Prodigy is poised to overcome the challenges of increasing data center power consumption, low server utilization and stalled performance scaling.

Supermicro Accelerates AI Workloads, Cloud Gaming, Media Delivery with New Systems Supporting Intel's Arctic Sound-M and Intel Habana Labs Gaudi 2

Super Micro Computer, Inc. (Nasdaq: SMCI), a global leader in enterprise computing, storage, networking, and green computing technology, supports two new Intel-based accelerators for demanding cloud gaming, media delivery, AI and ML workloads, enabling customers to deploy the latest acceleration technology from Intel and Intel Habana. "Supermicro continues to work closely with Intel and Habana Labs to deliver a range of server solutions supporting Arctic Sound-M and Gaudi 2 that address the demanding needs of organizations that require highly efficient media delivery and AI training," said Charles Liang, president and CEO. "We continue to collaborate with leading technology suppliers to deliver application-optimized total system solutions for complex workloads while also increasing system performance."

Supermicro can quickly bring to market new technologies by using a Building Block Solutions approach to designing new systems. This methodology allows new GPUs and acceleration technology to be easily placed into existing designs or, when necessary, quickly adapt an existing design when needed for higher-performing components. "Supermicro helps deliver advanced AI and media processing with systems that leverage our latest Gaudi 2 and Arctic Sound-M accelerators," stated Sandra Rivera, executive vice president and general manager of the Datacenter and AI Group at Intel. "Supermicro's Gaudi AI Training Server will accelerate deep learning training in some of the fastest growing workloads in the datacenter."

NVIDIA H100 SXM Hopper GPU Pictured Up Close

ServeTheHome, a tech media outlet focused on everything server/enterprise, posted an exclusive set of photos of NVIDIA's latest H100 "Hopper" accelerator. Being the fastest GPU NVIDIA ever created, H100 is made on TSMC's 4 nm manufacturing process and features over 80 billion transistors on an 814 mm² CoWoS package designed by TSMC. Complementing the massive die, we have 80 GB of HBM3 memory that sits close to the die. Pictured below, we have an SXM5 H100 module packed with VRM and power regulation. Given that the rated TDP for this GPU is 700 Watts, power regulation is a serious concern and NVIDIA managed to keep it in check.

On the back of the card, we see one short and one longer mezzanine connector that acts as a power delivery connector, different from the previous A100 GPU layout. This board model is labeled PG520 and is very close to the official renders that NVIDIA supplied us with on launch day.
Return to Keyword Browsing
Nov 23rd, 2024 20:14 EST change timezone

New Forum Posts

Popular Reviews

Controversial News Posts