News Posts matching #Ampere

Return to Keyword Browsing

Galax Reportedly Preparing GeForce RTX GPU Price Cuts

A recent report published by BoardChannels points to Galax possibly implementing a broad set of price cuts across its range of NVIDIA GeForce RTX 40 and 30-series custom graphics card models. Insider information originating from sources within a pool of NVIDIA and AMD board partners suggests that Galax could be shaving off up to 1000 RMB (around $140) from certain Ampere and Ada Lovelace products - effective later this month in its native Hong Kong market as well as mainland China.

The article posits that GeForce RTX 4080 cards could end up becoming 1000 RMB cheaper, and popular RTX 3060 models receiving cuts of around 250 RMB (≈$35). Galax is reported to have already offered entry-level desktop GeForce RTX 3050 cards at lower prices in the latter half of May - with 140 RMB (≈$19.50) reductions. The RTX 4070 series is supposedly set to receive a measly discount of around 150 to 200 RMB (≈$21 to $28), which is likely not doing it many favors given slow worldwide uptake since the product range's launch in mid-April. Galax could be making adjustments to fall in line with rivals (in the region) who have already reduced asking prices for NVIDIA gaming hardware.

Gigabyte Shows AI/HPC and Data Center Servers at Computex

GIGABYTE is exhibiting cutting-edge technologies and solutions at COMPUTEX 2023, presenting the theme "Future of COMPUTING". From May 30th to June 2nd, GIGABYTE is showcasing over 110 products that are driving future industry transformation, demonstrating the emerging trends of AI technology and sustainability, on the 1st floor, Taipei Nangang Exhibition Center, Hall 1.

GIGABYTE and its subsidiary, Giga Computing, are introducing unparalleled AI/HPC server lineups, leading the era of exascale supercomputing. One of the stars is the industry's first NVIDIA-certified HGX H100 8-GPU SXM5 server, G593-SD0. Equipped with the 4th Gen Intel Xeon Scalable Processors and GIGABYTE's industry-leading thermal design, G593-SD0 can perform extremely intensive workloads from generative AI and deep learning model training within a density-optimized 5U server chassis, making it a top choice for data centers aimed for AI breakthroughs. In addition, GIGABYTE is debuting AI computing servers supporting NVIDIA Grace CPU and Grace Hopper Superchips. The high-density servers are accelerated with NVLink-C2C technology under the ARM Neoverse V2 platform, setting a new standard for AI/HPC computing efficiency and bandwidth.

Ampere Computing Unveils New AmpereOne Processor Family with 192 Custom Cores

Ampere Computing today announced a new AmpereOne Family of processors with up to 192 single threaded Ampere cores - the highest core count in the industry. This is the first product from Ampere based on the company's new custom core, built from the ground up and leveraging the company's internal IP. CEO Renée James, who founded Ampere Computing to offer a modern alternative to the industry with processors designed specifically for both efficiency and performance in the Cloud, said there was a fundamental shift happening that required a new approach.

"Every few decades of compute there has emerged a driving application or use of performance that sets a new bar of what is required of performance," James said. "The current driving uses are AI and connected everything combined with our continued use and desire for streaming media. We cannot continue to use power as a proxy for performance in the data center. At Ampere, we design our products to maximize performance at a sustainable power, so we can continue to drive the future of the industry."

NVIDIA Has Stopped Making GeForce RTX 3060 Ti GPUs, According to Industry Leaks

A Taiwanese PC hardware news outlet, Benchlife, has been talking to insider sources positioned within several of NVIDIA's add-in-board (AIB) partners - the author reports that these organizations are experiencing significant changeovers. The AIB informants indicate that production of GeForce RTX 4060 and 4060 Ti models is accelerating, following rumors of the older Ampere-based RTX 3060 Ti card being discontinued. The article's author was seeking further clarification and confirmation from industry insiders, given that most of the recent leaks have emerged from Chinese technology discussion boards. Forumites have posited that NVIDIA has stopped supplying its AIB partners with RTX 3060 Ti silicon. It is difficult to tell whether (via translation) the AIB tipsters have concluded that the older card is totally done for, but NVIDIA is prioritizing the launch of new products.

It would make sense for Team Green to clear the way for the much newer Ada Lovelace-based lineups, but their entry level RTX 3060 cards have remained firm favorites with PC hardware buyers, so it could be quite tricky to play catch up with succeeding product lines. NVIDIA's component suppliers have stated (back in mid-April) that RTX 4000-series GPU production was not ramping up, due to a possible slow uptake of existing cards - in particular the recently released RTX 4070. Given the vast popularity of budget graphics card models, it seems that NVIDIA is preparing to embrace that market segment once again with its latest offerings - due for launch at the end of this month.

NVIDIA GeForce RTX 3070 GPU Available for Below MSRP in Germany

Two years and a half into its storied career, the NVIDIA GeForce RTX 3070 GPU has finally dropped below MSRP in one European territory. German customers will be stoked to jump on e-tailer CaseKing's new offer - ZOTAC's Gaming GeForce RTX 3070 Twin Edge LHR graphics card is currently available for 449 EUR (not counting additional fees), so a saving of 50 Euros from the recommended retail price (499 EUR/$499). 3DCenter seems to be the first hardware news outlet to report on an RTX 3070 GPU dropping under RRP. The RTX 3070 and 3060 Ti models have been best sellers for NVIDIA (and board partners) since late 2020, yet buyers have long complained about unreasonable asking prices, and semi-generous discounts have been very late in arriving - just in time for the succeeding model.

3DCenter has created an overview of the graphics card market in Germany and Austria, and its findings for May 2023 indicate a trend where: "GPU prices in Euros have consistently dropped by ~10% since the end of January, in single cases up to 20%." The overview places the RTX 3070 8 GB in a price bracket position between AMD's Radeon RX 6750XT 12 GB and RX 6800 16 GB (non-XT) SKUs, which brings recent marketing strategies to mind - Team Red thinks that their cards offer the buyer more VRAM for their money when cross examined with the competition.

Intel Sapphire Rapids Sales Forecasted to Slow Down, Microsoft Cuts Orders

According to Ming-Chi Kuo, an industry analyst known for making accurate predictions about Apple, we have some new information regarding Intel's Sapphire Rapids Xeon processors. As Kuo notes, Intel's major Cloud Service Provider (CSP) client, Microsoft, has notified the supply chain that the company is cutting orders of Sapphire Rapids Xeons by 50-70% in the second half of 2023. Interestingly, Intel's supply chain has notified the company to cut chip orders by around 50% amidst weak server demand. This comes straight after Intel's plans to start shipping Sapphire Rapids processors in the second quarter of 2023 and deliver the highly anticipated lineup to customers.

Additionally, Kuo has stated that Intel isn't only competing for clients with AMD but also with Arm-based CPUs. Microsoft also plans to start buying Arm-based server processors made by Ampere Computing in the first half of 2024. This will reduce Microsoft's dependence on x86 architecture and induce higher competition in the market, especially if other CSPs follow.

MSI Releases GeForce RTX 3060 Ti SUPER 3X with SUPRIM Cooler

MSI released the GeForce RTX 3060 Ti SUPER 3X graphics card. This is basically the company using up left over inventory of its TriFrozr SUPRIM cooling solutions with an SKU that's likely to sell in higher volumes. Underneath the cooler is a GeForce RTX 3060 Ti "Ampere" GPU with GDDR6X memory type, instead of the GDDR6 that's standard to the RTX 3060 Ti. The board design of the SUPER 3X is identical to that of the MSI RTX 3070 SUPRIM, which means the company is reusing both the cooler and PCB to create this product. The backplate of the card even features leftover SUPRIM branding.

The MSI RTX 3060 Ti SUPER 3X offers identical clock speeds to the company's RTX 3060 Ti G6X Gaming X Trio graphics card, with 1845 MHz boost frequencies, compared to 1665 MHz reference. The GDDR6X memory ticks at 19 Gbps, which over the 256-bit memory interface yields an impressive 608 GB/s memory bandwidth that's on par with the RTX 3070 Ti. From the looks of it, the MSI RTX 3060 Ti SUPER 3X is a China-exclusive product.

Component Suppliers Suggest That NVIDIA is Taking a Relaxed Approach with RTX 40-Series Production

Two of NVIDIA's providers of Outsourced Semiconductor Assembly and Test (OSAT) services are of the opinion that Team Green is happy to stay the course with its Ada Lovelace GPU production schedule. The backend providers Siliconware Precision Industries (SPIL) and King Yuan Electronics (KYEC) have not been given any new instructions with regard to shifts (up or down) in component assembly output. It is theorized that NVIDIA is aiming to clear any stock backlogs of graphics card models featuring previous generation architecture - namely the second gen GeForce RTX 30-series, built on Ampere.

The retail demand for the newly released GeForce RTX 4070 cards has been mild, to say the least - with plenty of inventory remaining on the shelves in the States. Critical reception of the midweight GeForce RTX GPU has also been middling - many have advised that budget conscience buyers should potentially look elsewhere. The market for discrete graphics card is in a fairly healthy state at the moment, with major production issues and fractured supply chains becoming lesser concerns for electronics manufacturers. NVIDIA has the advantage of being a market leader, and seems to be quite content with proceedings - but their analysts are very likely keeping an eye on the RTX 4070 sales figures. Its products are out and readily available - no need to change direction too sharply.

Chinese GPU Maker Biren Technology Loses its Co-Founder, Only Months After Revealing New GPUs

Golf Jiao, a co-founder and general manager of Biren Technology, has left the company late last month according to insider sources in China. No official statement has been issued by the executive team at Biren Tech, and Jiao has not provided any details regarding his departure from the fabless semiconductor design company. The Shanghai-based firm is a relatively new startup - it was founded in 2019 by several former NVIDIA, Qualcomm and Alibaba veterans. Biren Tech received $726.6 million in funding for its debut range of general-purpose graphics processing units (GPGPUs), also defined as high-performance computing graphics processing units (HPC GPUs).

The company revealed its ambitions to take on NVIDIA's Ampere A100 and Hopper H100 compute platforms, and last August announced two HPC GPUs in the form of the BR100 and BR104. The specifications and performance charts demonstrated impressive figures, but Biren Tech had to roll back its numbers when it was hit by U.S Government enforced sanctions in October 2022. The fabless company had contracted with TSMC to produce its Biren range, and the new set of rules resulted in shipments from the Taiwanese foundry being halted. Biren Tech cut its work force by a third soon after losing its supply chain with TSMC, and the engineering team had to reassess how the BR100 and BR104 would perform on a process node larger than the original 7 nm design. It was decided that a downgrade in transfer rates would appease the legal teams, and get newly redesigned Biren silicon back onto the assembly line.

NVIDIA Ramps Up Battle Against Makers of Unlicensed GeForce Cards

NVIDIA is stepping up to manufacturers of counterfeit graphics card in China according to an article published by MyDrivers - the hardware giant is partnering up with a number of the nation's major e-commerce companies in order to eliminate inventories of bogus GPUs. It is claimed that these online retail platforms, including JD.com and Douyin, are partway into removing a swathe of dodgy stock from their listings. NVIDIA is seeking to disassociate itself from the pool of unlicensed hardware and the brands responsible for flooding the domestic and foreign markets with so-called fake graphics cards. The company is reputed to be puzzled about the murky origins of this bootlegging of their patented designs.

The market became saturated with fake hardware during the Ethereum mining boom - little known cottage companies such as 51RSIC, Corn, Bingying and JieShuoMllse were pushing rebadged cheap OEM cards to domestic e-tail sites. The knock-off GPUs also crept outside of that sector, and import listings started to appear on international platforms including Ebay, AliExpress, Amazon and Newegg. NVIDIA is also fighting to stop the sale of refurbished cards - these are very likely to have been utilized in intensive cryptocurrency mining activities. A flood of these hit the market following an extreme downturn in crypto mining efforts, and many enthusiast communities have warned against acquiring pre-owned cards due to the high risk of component failure.

NVIDIA Enables More Encoding Streams on GeForce Consumer GPUs

NVIDIA has quietly removed some video encoding limitations on its consumer GeForce graphics processing units (GPUs), allowing encoding of up to five simultaneous streams. Previously, NVIDIA's consumer GeForce GPUs were limited to three simultaneous NVENC encodes. The same limitation did not apply to professional GPUs.

According to NVIDIA's own Video Encode and Decode GPU Support Matrix document, the number of concurrent NVENC encodes on consumer GPUs have been increased from three to five. This includes certain GeForce GPUs based on Maxwell 2nd Gen, Pascal, Turing, Ampere, and Ada Lovelace GPU architectures. While the number of concurrent NVDEC decodes were never limited, there is a limitation on how many streams you can encode by certain GPU, depending on the resolution of the stream and the codec.

ASUS IoT Presents Complete Embedded Portfolio to Accelerate Digital Transformation at Embedded World 2023

ASUS IoT, the global AIoT solution provider, today announced that it is delighted to be attending the Embedded World 2023 from March 14-16 in Nuremburg, Germany. On show will be the comprehensive ASUS IoT lineup of edge AI computers, industrial motherboards, COM Express (COMe) modules, Tinker-series single-board computers (SBCs), Mini PCs, market-ready solutions and AI solutions. This impressive portfolio of embedded and IoT offerings delivers ASUS IoT's leading hardware and software computing capabilities and services across diverse vertical markets.

NVIDIA pulls RTX 30-series Founders Edition Cards From its Webstore

NVIDIA has removed all its GeForce RTX 30-series Founders Edition graphics cards from its webstore. There are still plenty of RTX 30-series graphics cards available everywhere, including the NVIDIA webstore, as NVIDIA is happily selling custom versions from its AIC partners. This does not mean that you can't find the RTX 30-series Founders Edition graphics cards at other shops, as some might be available from other retailers/e-tailers like Amazon, Best Buy, Newegg, Mindfactory, and others. It is now obvious that NVIDIA is slowly phasing out the Ampere-based graphics cards and that it will focus on GeForce RTX 4090, 4080, and 4070 Ti, as well as the GeForce RTX 4070 that should come soon.

NVIDIA to Clear Out GA104 Inventory by Carving GeForce RTX 3060 Out of Them

NVIDIA is preparing yet another variant of the GeForce RTX 3060 "Ampere" graphics card, by carving it out of the much larger "GA104" silicon. This SKU will feature 12 GB of faster 19 Gbps GDDR6X memory. Across a 192-bit memory bus, this yields an impressive 456 GB/s of memory bandwidth that's higher than the bandwidth of the original RTX 3070 with 14 Gbps GDDR6 memory across a 256-bit memory bus (448 GB/s). From what we can tell, the core-configuration of the card remains the same, with 3,584 CUDA cores, 112 Tensor cores, 28 RT cores, 112 TMUs, and 48 ROPs. This SKU is carved out of the GA104 silicon by enabling 28 out of 48 SM (that's 58% of the available number-crunching machinery); and slashing down the memory interface by 25%.

Data Center CPU Landscape Allows Ampere Computing to Gain Traction

Once upon a time, the data center market represented a duopoly of x86-64 makers AMD and Intel. However, in recent years companies started developing custom Arm-based processors to handle workloads as complex within smaller power envelopes and doing it more efficiently. According to Counterpoint Research firm, we have the latest data highlighting a significant new player called Ampere Computing in the data center world. With the latest data center revenue share report, we get to see Intel/AMD x86-64 and AWS/Ampere Arm CPU revenue. For the first time, we see that a 3rd party company, Ampere Computing, managed to capture as much as 1.54% market revenue share of the entire data center market in 2022. Thanks to having CPUs in off-the-shelf servers from OEMs, enterprises and cloud providers are able to easily integrate Ampere Altra processors.

Intel, still the most significant player, saw a 70.77% share of the overall revenue; however, that comes as a drop from 2021 data which stated an 80.71% revenue share in the data center market. This represents a 16% year-over-year decline. This reduction is not due to the low demand for server processors, as the global data center CPU market's revenue registered only a 4.4% YoY decline in 2022, but due to the high demand for AMD EPYC solutions, where team red managed to grab 19.84% of the revenue from 2022. This is a 62% YoY growth from last year's 11.74% revenue share. Slowly but surely, AMD is eating Intel's lunch. Another revenue source comes from Amazon Web Services (AWS), which the company filled with its Graviton CPU offerings based on Arm ISA. AWS Graviton CPUs accounted for 3.16% of the market revenue, up 74% from 1.82% in 2021.

AMD Expected to Occupy Over 20% of Server CPU Market and Arm 8% in 2023

AMD and Arm have been gaining up on Intel in the server CPU market in the past few years, and the margins of the share that AMD had won over were especially large in 2022 as datacenter operators and server brands began finding that solutions from the number-2 maker growing superior to those of the long-time leader, according to Frank Kung, DIGITIMES Research analyst focusing primarily on the server industry, who anticipates that AMD's share will well stand above 20% in 2023, while Arm will get 8%.

Prices are one of the three major drivers that resulted in datacenter operators and server brands switching to AMD. Comparing server CPUs from AMD and Intel with similar numbers of cores, clockspeed, and hardware specifications, the price tags of most of the former's products are at least 30% cheaper than the latter's, and the differences could go as high as over 40%, Kung said.

Projected YoY Growth Rate of Global Server Shipments for 2023 Has Been Lowered to 1.87% Due to North American Cloud Service Providers Cutting Demand

Facing global economic headwinds, the four major North American cloud service providers (CSPs) have scaled back their server procurement quantities for 2023 and could make further downward corrections in the future. Meta is the leader among the four in terms of server demand reduction, followed by Microsoft, Google, and AWS. TrendForce has lowered the YoY growth rate of their total server procurement quantity for this year from the original projection of 6.9% to the latest projection of 4.4%. With CSPs cutting demand, global server shipments are now estimated to grow by just 1.87% YoY for 2023. Regarding the server DRAM market, prices there are estimated to drop by around 20~25% QoQ for 1Q23 as CSPs' downward corrections exacerbate the oversupply situation.

Looking at the four CSPs individually, the YoY decline of Meta's server procurement quantity has been widened to 3.0% and could get larger. The instability of the global economy remains the largest variable for all CSPs. Besides this, Meta has also encountered a notable obstacle in expanding its operation in Europe. Specifically, its data center in Denmark has not met the regional standard for emissions. This issue is expected to hinder its progress in setting up additional data centers across the EU. Moreover, businesses related to e-commerce account for about 98% of Meta's revenue. Therefore, the decline in e-commerce activities amidst the recent easing of the COVID-19 pandemic has impacted Meta's growth momentum. Additionally, Meta's server demand has been affected by the high level of component inventory held by server ODMs.

ASUS IoT Announces the PE3000G with Discrete GPU Support via MXM Module

SUS IoT, the global AIoT solution provider, today announced PE3000G—the industry's first edge AI system to support Mobile PCI Express Module (MXM) GPUs from both NVIDIA and Intel. Specifically, the all-new industrial PC works seamlessly with NVIDIA Ampere/Turing or Intel Arc A-series MXM GPUs. Powered by a 12th Gen Intel Core processor and up to 64 GB of DDR5 4800 MHz memory, and combining a proven power design, guaranteed and fanless thermal performance, and superior physical and mechanical ruggedness, PE3000G brings unprecedented longevity, computing power, flexibility and reliability to AI computing at the edge—making it an ideal option for scenarios where resilience, longevity and both CPU and GPU scalability are paramount.

"PE3000G is ASUS IoT's response to the burgeoning demand for accelerating AI inference and extreme deployment in industrial settings," commented KuoWei Chao, General Manager of the ASUS IoT business unit. "With robust power, thermal and mechanical design, it pushes versatile edge-AI-inference applications to business-critical applications. PE3000G is an ideal fit to accelerate edge AI inference in SWaP-constrained applications, such as machine vision in factory automation, outdoor surveillance system and AI-inference systems for autonomous vehicles."

BIOSTAR Expands Graphics Card Lineup with RTX 30-series and GTX 16-series Graphics Card SKUs

BIOSTAR sneakily expanded its graphics card lineup with GeForce RTX 30-series "Ampere" and GTX 16-series "Turing" custom-design graphics cards. The company didn't have these at launch, and for the past couple of years, remained as an AMD-exclusive board partner. The lineup includes a custom-design GeForce RTX 3080 10 GB graphics card with a triple-slot, triple-fan cooling solution; an RTX 3070 with a dual-fan, 2-slot cooling; and cards based on the GTX 1650, GTX 1660, and GTX 1660 Super. The cards come surprisingly late to the party—2 years late for the RTX 30-series, and almost 4 years late for the GTX 16-series. BIOSTAR probably landed itself a good deal with NVIDIA on supply of these GPUs as the company works to clear inventory and pave the way for its 40-series "Ada" GPUs across a wider price-range.

Supermicro Adds ARM-based Servers using Ampere Altra and Ampere Altra Max Processors targeting Cloud-Native Applications

Supermicro, a Total IT Solution Provider for Cloud, AI/ML, Storage, and 5G/Edge, is announcing an expanded product line with exciting new ARM-based series of servers as part of the MegaDC family. Using Ampere Altra and Ampere Altra Max processors, the Mt. Hamilton platform leverages a single unified motherboard design, targeting cloud-native applications, such as Cloud Gaming, Video-on-Demand, CDN, IaaS, Database, Object-Storage, dense VDI, and Telco Edge (Distributed Unit and Centralized Unit) solutions. In addition, the new servers address several objectives for cloud-native workloads, specifically delivering high performance per watt while executing scalable workloads and those that require very low latency responses.

"Supermicro continues to bolster our product line by introducing ARM-based servers, using the Ampere Altra and Altra Max CPUs," said Ivan Tay, SVP of Product Management, Supermicro. "Expanding our already broad server product line gives customers even more choices for their specific workloads. We can quickly offer optimized application servers for customers worldwide using our Building Block Solutions approach."

TechPowerUp GPU-Z v2.52.0 Released

TechPowerUp today released the latest version of TechPowerUp GPU-Z, the popular PC graphics information, monitoring, and diagnostics utility. Version 2.52.0 adds support for AMD Radeon RX 7900 XTX, RX 7900 XT, RX 6300 OEM; NVIDIA GeForce RTX 4070 Ti, and a few rare "Ampere" based GPUs in circulation these days, including the RTX 3080 Ti 20 GB, RTX 3070 Ti based on GA102 silicon, RTX 3050 based on GA107, and the PCIe AIC version of the A800 80 GB accelerator. Detection is improved for the Xe LP-based iGPU of Intel Core "Raptor Lake" processors. NVIDIA GPUs with ECC memory now have ECC status reported in the Advanced panel. On GPUs where the boost frequency can't be read, the base frequency will be used to calculate fillrates. Clock speed detection for Intel Arc "Alchemist" GPUs has been improved. Vendor detection has been added for several new graphics card brands such as Corsair (gaming notebooks), Maxsun, and Wingtech.

DOWNLOAD: TechPowerUp GPU-Z v2.52.0

NVIDIA Partners Beginning to Carve Out RTX 3070 Ti From Larger GA102 Dies

NVIDIA manufactured a heap of large "GA102" Ampere silicon to cater to demand from the crypto-mining boom; only to see that demand vanish. With next-gen RTX 40-series awaiting ramp; the company has to digest these GA102 chips somehow, and is apparently letting its partners use them on performance-segment SKUs such as the GeForce RTX 3070 Ti. The RTX 3070 Ti is normally based on the GA104 silicon, which it maxes out, enabling all 6,144 CUDA cores, 48 RT cores, 192 Tensor cores, 192 TMUs, and 96 ROPs, besides the chip's full 256-bit wide GDDR6X memory interface. This SKU is now being carved out on the larger GA102, by enabling 48 out of 84 streaming multiprocessors (just 57% of the CUDA cores enabled); and narrowing the memory bus from its normal 384-bit, down to 256-bit.

The memory size remains at 8 GB, memory type at GDDR6X, and memory speed at 19 Gbps, working out to 608 GB/s of bandwidth. The most interesting aspect of carving the RTX 3070 Ti out of the GA102 has to be board power; with a ZOTAC-branded card listing it at 320 W, higher than the 290 W of GA104-based cards from the company. Sadly, this is a China-only SKU. Every custom-design graphics card, especially from a reputed AIC such as ZOTAC, has to go through qualification with NVIDIA; which means NVIDIA is not only aware of GA102-based RTX 3070 Ti cards, but is behind fusing the SMs to carve out the SKU, and developing the video BIOS and driver support. ZOTAC is kind enough to list the ASIC code on its website, and for this SKU it is "GA102-150-xx."

NVIDIA Partners Quietly Launch GeForce RTX 3060 with 8GB (128-bit) Memory

NVIDIA's add-in board partners today began quietly launching the GeForce RTX 3060 8 GB, a variant of the RTX 3060 with a third of its memory size and memory bus-width sawed off. The RTX 3060, NVIDIA's best-selling desktop graphics SKU from the RTX 30-series "Ampere," originally launched with 12 GB of GDDR6 memory across a 192-bit wide memory bus, which at its reference speed of 15 Gbps (GDDR6-effective), makes 360 GB/s of memory bandwidth. The new variant comes with 8 GB of GDDR6 memory across a narrower 128-bit memory interface, with the same 15 Gbps data-rate, which works out to 240 GB/s memory bandwidth.

Besides memory size, bus-width, and bandwidth; NVIDIA hasn't tinkered with the core-configuration with the RTX 3060 8 GB. It still comes with 3,584 CUDA cores across 28 SM, which work out to 112 Tensor cores, 28 RT cores, 112 TMUs, and 48 ROPs. The GPU's base frequency is set at 1320 MHz, and boost frequency at 1777 MHz—same as the original RTX 3060. Even the typical graphics power is unchanged, at 170 W. The new 8 GB variant doesn't replace the original, but is being positioned a notch below it, possibly to compete against the likes of the Radeon RX 6600 (non-XT), and perhaps even the Arc A750.

NVIDIA Ada's 4th Gen Tensor Core, 3rd Gen RT Core, and Latest CUDA Core at a Glance

Yesterday, NVIDIA launched its GeForce RTX 40-series, based on the "Ada" graphics architecture. We're yet to receive a technical briefing about the architecture itself, and the various hardware components that make up the silicon; but NVIDIA on its website gave us a first look at what's in store with the key number-crunching components of "Ada," namely the Ada CUDA core, 4th generation Tensor core, and 3rd generation RT core. Besides generational IPC and clock speed improvements, the latest CUDA core benefits from SER (shader execution reordering), an SM or GPC-level feature that reorders execution waves/threads to optimally load each CUDA core and improve parallelism.

Despite using specialized hardware such as the RT cores, the ray tracing pipeline still relies on CUDA cores and the CPU for a handful tasks, and here NVIDIA claims that SER contributes to a 3X ray tracing performance uplift (the performance contribution of CUDA cores). With traditional raster graphics, SER contributes a meaty 25% performance uplift. With Ada, NVIDIA is introducing its 4th generation of Tensor core (after Volta, Turing, and Ampere). The Tensor cores deployed on Ada are functionally identical to the ones on the Hopper H100 Tensor Core HPC processor, featuring the new FP8 Transformer Engine, which delivers up to 5X the AI inference performance over the previous generation Ampere Tensor Core (which itself delivered a similar leap by leveraging sparsity).

AAEON Unveils BOXER-8641AI One of the First Azure-Certified NVIDIA Jetson AGX Orin Devices

AAEON, an industry leading developer of edge computing platforms, is happy to announce that it has joined the Azure Certified Device program, ensuring customers get IoT solutions up and running quickly with hardware and software that has been pre-tested and verified to work with Azure IoT. This certification assures that AAEON's new NVIDIA Jetson AGX Orin -powered BOXER-8641AI has been tested for functionality and interoperability, ensuring that it has used the Microsoft reference configuration to become one of the market's first Azure-certified NVIDIA Jetson Orin devices.

This announcement illustrates AAEON's continued innovation in the edge AI space, as the BOXER-8641AI has been validated to work with Azure and the latest versions of IoT Edge. Such a certification gives AAEON customers confidence that when choosing the BOXER-8641AI for their application, it is fully IoT Edge compatible.
Return to Keyword Browsing
May 15th, 2024 16:28 EDT change timezone

New Forum Posts

Popular Reviews

Controversial News Posts