News Posts matching #Meta

Return to Keyword Browsing

Meta's Llama 4 Can Process 10 Million Tokens as Input, Lives in Native Multimodality

by

Apr 7th, 2025 10:07 Discuss (3 Comments)

Meta has prepared a leap-forward update for its Llama model series with the v4 release, entering an era of native multimodality within the company's AI models. At the forefront is Llama 4 Scout, a model boasting 17 billion active parameters distributed across 16 experts in a mixture-of-experts (MoE) configuration. With FP4 precision, this model is engineered to run entirely on a single NVIDIA H100 GPU. Scout now supports an industry-leading input context window of up to 10 million tokens, a substantial leap from previous limits like Google's old Gemini 1.5 Pro, which came with 2 million token input content. Llama 4 Scout is built using a hybrid dense and MoE architecture, which selectively activates only a subset of each token's total parameters, optimizing training and inference efficiency. This architecture not only accelerates computation but also reduces associated costs.

Meanwhile, Llama 4 Maverick, another model in the series, also features 17 billion active parameters but incorporates 128 experts, scaling to 400 billion total parameters. Maverick has demonstrated superior performance in coding, image understanding, multilingual processing, and logical reasoning, even outperforming several leading models in its class. Both models embrace native multimodality by integrating text and image data early in the processing pipeline. Utilizing a custom MetaCLIP-based vision encoder, these models can simultaneously process multiple images and text, combining tokens into a single backend processor. This ensures robust visual comprehension and precise object anchoring, powering applications such as detailed image description, visual question-answering, and analysis of temporal image sequences.

Read full story

Sci-fi Shooter/Smasher "Harpagun" Slams onto PS VR2 & Meta Quest VR Platforms on April 10

Press Release by

Apr 4th, 2025 11:49 Discuss (5 Comments)

What is it? What's the formula, the secret ingredient to make a VR game bring pure excitement and adrenaline to the players? How can we make them smile and go "wow" after they take off the headset? For us devs at Something Random working on Harpagun, it all comes down to a few simple elements. Let's take it from the top.

Movement that matters: Speed, control, and immersion
Action games are all about movement. An issue mostly already solved in flat screen games but still problematic in VR. Some forms of locomotion allow for precision but are pretty slow and unresponsive. Others let you zoom around the locations, but can be uncomfortable. Those that are, are limiting or take you out of the illusion of "being there." In Harpagun we needed a system with a clear set of goals: comfort, simplicity, responsiveness, speed and immersion. In a proper arcade game players have to be able to react in a blink of an eye, immediately focus on what's the most important, change their position to avoid danger or get a better shot at an enemy. We managed to achieve that with our "pylon and anchor" system. Players move between sets of points with their eyes anchored to points of interest such as a path forward or center of the combat area. The movement is smooth but fast enough to be comfortable while allowing for total control of the battlefield.

Read full story

Meta Reportedly Reaches Test Phase with First In-house AI Training Chip

by

Mar 11th, 2025 11:56 Discuss (0 Comments)

According to a Reuters technology report, Meta's engineering department is engaged in the testing of their "first in-house chip for training artificial intelligence systems." Two inside sources have declared this significant development milestone; involving a small-scale deployment of early samples. The owner of Facebook could ramp up production, upon initial batches passing muster. Despite a recent-ish showcasing of an open-architecture NVIDIA "Blackwell" GB200 system for enterprise, Meta leadership is reported to be pursuing proprietary solutions. Multiple big players—in the field of artificial intelligence—are attempting to breakaway from a total reliance on Team Green. Last month, press outlets concentrated on OpenAI's alleged finalization of an in-house design, with rumored involvement coming from Broadcom and TSMC.

One of the Reuters industry moles believes that Meta has signed up with TSMC—supposedly, the Taiwanese foundry was responsible for the production of test batches. Tom's Hardware reckons that Meta and Broadcom were working together with the tape out of the social media giant's "first AI training accelerator." Development of the company's "Meta Training and Inference Accelerator" (MTIA) series has stretched back a couple of years—according to Reuters, this multi-part project: "had a wobbly start for years, and at one point scrapped a chip at a similar phase of development...Meta last year, started using an MTIA chip to perform inference, or the process involved in running an AI system as users interact with it, for the recommendation systems that determine which content shows up on Facebook and Instagram news feeds." Leadership is reportedly aiming to get custom silicon solutions up and running for AI training by next year. Past examples of MTIA hardware were deployed with open-source RISC-V cores (for inference tasks), but is not clear whether this architecture will form the basis of Meta's latest AI chip design.

Arm to Develop In-House Server CPUs, Signs Meta as First Customer

by

Feb 14th, 2025 10:23 Discuss (5 Comments)

Reports from Financial Times suggest Arm has plans to create its own CPU, set to hit the market in 2025 with Meta Platforms said to be one of the first customers. The chip is said to be a CPU for data center servers, with TSMC handling the manufacturing. However, when the Financial Times asked about this, SoftBank (the majority owner of Arm) and Meta stayed quiet, while Arm didn't give a statement. A Nikkei report from May 2024 suggested that a prototype AI processor chip would be completed by spring 2025 and available for sale by fall 2025, so the latest information from the Financial Times report feels like a confirmation of previous rumors.

Right now, Arm makes money by letting others use its instruction set and core designs to make their own chips. This new move could mean Arm will compete with its current customers. Sources in the industry say Arm is trying to win business from Qualcomm, with rumors that Arm has been bringing in executives from companies it works with to help develop this chip. While Qualcomm had talked in the past about giving Meta a data center CPU using Arm's design, it looks like Arm has won at least some of that deal. However, no technical or specification details are available currently for Arm's 1st in-house server CPU.

Read full story

AMD Faces Investor Skepticism as AI Market Moves Toward Custom Chips

by

Feb 3rd, 2025 12:18 Discuss (21 Comments)

AMD is set to share its fourth-quarter results on Tuesday, Feb. 4 facing opportunities and problems in the fast-changing AI chip market as investors are expected to look closely at AMD's AI strategy. Reuters reports that experts think AMD's revenue will increase by over 22% to $7.53 billion. They expect its data center part to make up more than half of total sales at $4.15 billion. Yet, investors still worry about how AMD stands in the AI race. TD Cowen experts and Omdia believe AMD could sell $10 billion worth of AI chips this year, this is twice what AMD itself thinks it will sell, which is $5 billion. However, the scene is getting more complex with Big Tech firms like Microsoft, Amazon, and Meta making their own special chips for AI work. This move to custom chips, along with NVIDIA's strong market position and its popular CUDA software, makes things tough for AMD. The high costs of switching chipmakers also make it hard for AMD to grow its share of the market, however, the ongoing increase in AI spending by tech giants could help balance out these problems. Investors see "customer silicon and NVIDIA as the AI chip market going forward," said Ryuta Makino, analyst at AMD investor Gabelli Funds.

Supply chain issues make AMD's position more difficult as TSMC is boosting its advanced packaging ability to fix bottlenecks, while NVIDIA's production increase of its new "Blackwell" AI chips might restrict AMD's access to manufacturing resources. Yet, AMD's business has some good news, its personal computer unit should grow by almost 33% to $1.94 billion catching up to Intel.

Read full story

Comcast Introduces Nation's First Ultra-Low Lag Xfinity Internet Experience With Meta, NVIDIA, and Valve

Press Release by

Jan 29th, 2025 12:02 Discuss (20 Comments)

Comcast is introducing the first customers in the world to a pioneering new, ultra-low lag connectivity experience when they use interactive applications like gaming, videoconferencing, and virtual reality. With the launch, Xfinity Internet latency will be dramatically reduced to faster than the blink of an eye, currently when using FaceTime on iPhone, iPad, Mac, Apple TV, and Apple Vision Pro, apps on Meta's mixed reality headsets that will support this technology, NVIDIA's GeForce NOW, many games on Valve's Steam games platform, and in the future on other applications that choose to leverage this open standard technology.

"Our connectivity is the key to unlocking a world of entertainment, sports, news and information and we're constantly pushing the limits of network innovation to create an experience that exceeds the expanding demands of our customers," said Emily Waldorf, Senior Vice President, Consumer Products, Comcast Connectivity and Platforms. "Modern applications are real-time and interactive and require more than just fast speeds. Xfinity Internet's lower lag times will be a differentiator for Comcast."

Read full story

PC Gaming in the Cloud Goes Everywhere With New Devices and AAA Games on GeForce NOW

CES Press Release by

Jan 7th, 2025 01:28 Discuss (0 Comments)

GeForce NOW turns any device into a GeForce RTX gaming PC, and is bringing cloud gaming and AAA titles to more devices and regions. Announced today at the CES trade show, gamers will soon be able to play titles from their Steam library at GeForce RTX quality with the launch of a native GeForce NOW app for the Steam Deck. NVIDIA is working to bring cloud gaming to the popular PC gaming handheld device later this year.

In collaboration with Apple, Meta and ByteDance, NVIDIA is expanding GeForce NOW cloud gaming to Apple Vision Pro spatial computers, Meta Quest 3 and 3S and Pico virtual- and mixed-reality devices - with all the bells and whistles of NVIDIA technologies, including ray tracing and NVIDIA DLSS. In addition, NVIDIA is launching the first GeForce RTX-powered data center in India, making gaming more accessible around the world. Plus, GeForce NOW's extensive library of over 2,100 supported titles is expanding with highly anticipated AAA titles. DOOM: The Dark Ages and Avowed will join the cloud when they launch on PC this year.

Read full story

Ultra Accelerator Link Consortium Plans Year-End Launch of UALink v1.0

Press Release by

Oct 29th, 2024 14:11 Discuss (2 Comments)

Ultra Accelerator Link (UALink ) Consortium, led by Board Members from AMD, Amazon Web Services (AWS), Astera Labs, Cisco, Google, Hewlett Packard Enterprise (HPE), Intel, Meta and Microsoft, have announced the incorporation of the Consortium and are extending an invitation for membership to the community. The UALink Promoter Group was founded in May 2024 to define a high-speed, low-latency interconnect for scale-up communications between accelerators and switches in AI pods & clusters. "The UALink standard defines high-speed and low latency communication for scale-up AI systems in data centers"

Read full story

Meta Shows Open-Architecture NVIDIA "Blackwell" GB200 System for Data Center

by

Oct 18th, 2024 09:52 Discuss (11 Comments)

During the Open Compute Project (OCP) Summit 2024, Meta, one of the prime members of the OCP project, showed its NVIDIA "Blackwell" GB200 systems for its massive data centers. We previously covered Microsoft's Azure server rack with GB200 GPUs featuring one-third of the rack space for computing and two-thirds for cooling. A few days later, Google showed off its smaller GB200 system, and today, Meta is showing off its GB200 system—the smallest of the bunch. To train a dense transformer large language model with 405B parameters and a context window of up to 128k tokens, like the Llama 3.1 405B, Meta must redesign its data center infrastructure to run a distributed training job on two 24,000 GPU clusters. That is 48,000 GPUs used for training a single AI model.

Called "Catalina," it is built on the NVIDIA Blackwell platform, emphasizing modularity and adaptability while incorporating the latest NVIDIA GB200 Grace Blackwell Superchip. To address the escalating power requirements of GPUs, Catalina introduces the Orv3, a high-power rack capable of delivering up to 140kW. The comprehensive liquid-cooled setup encompasses a power shelf supporting various components, including a compute tray, switch tray, the Orv3 HPR, Wedge 400 fabric switch with 12.8 Tbps switching capacity, management switch, battery backup, and a rack management controller. Interestingly, Meta also upgraded its "Grand Teton" system for internal usage, such as deep learning recommendation models (DLRMs) and content understanding with AMD Instinct MI300X. Those are used to inference internal models, and MI300X appears to provide the best performance per Dollar for inference. According to Meta, the computational demand stemming from AI will continue to increase exponentially, so more NVIDIA and AMD GPUs is needed, and we can't wait to see what the company builds.

Marvell Collaborates with Meta for Custom Ethernet Network Interface Controller Solution

Press Release by

Oct 15th, 2024 01:06 Discuss (0 Comments)

Marvell Technology, Inc. (NASDAQ: MRVL), a leader in data infrastructure semiconductor solutions, today announced the development of FBNIC, a custom 5 nm network interface controller (NIC) ASIC in collaboration with Meta to meet the company's infrastructure and use case requirements. The FBNIC board design will also be contributed by Marvell to the Open Compute Project (OCP) community. FBNIC combines a customized network controller designed by Marvell and Meta, a co-designed board, and Meta's ASIC, firmware and software. This custom design delivers innovative capabilities, optimizes performance, increases efficiencies, and reduces the average time needed to resolve potential network and server issues.

"The future of large-scale, data center computing will increasingly revolve around optimizing semiconductors and other components for specific applications and cloud infrastructure architectures," said Raghib Hussain, President of Products and Technologies at Marvell. "It's been exciting to partner with Meta on developing their custom FBNIC on our industry-leading 5 nm accelerated infrastructure silicon platform. We look forward to the OCP community leveraging the board design for future innovations."

Read full story

Microsoft Discontinues HoloLens 2, Shifts Mixed-Reality Strategy

by

Oct 3rd, 2024 07:12 Discuss (17 Comments)

Microsoft has officially ended production of its HoloLens 2 mixed-reality headset, according to sources confirmed by The Register. The tech giant recently notified its partners that the HoloLens 2, introduced in 2019 as an enterprise-focused augmented reality device, is no longer available for purchase. This marks a significant shift in Microsoft's AR strategy, with the company stating, "Support for HoloLens 2, including security updates, will end on December 31, 2027." Despite aggressive marketing efforts, the HoloLens 2 struggled to gain widespread adoption, reflecting broader challenges in the AR/VR market where high-end headsets like HoloLens 2 and Apple Vision Pro retail for around $3,500, limiting their appeal. Some Microsoft employees reportedly expressed surprise that the project continued as long as it did, suggesting internal doubts about its viability.

Rather than continuing as a hardware provider, Microsoft plans to pivot its role in the mixed reality space, focusing on "first-party software solutions and services, partnering with the broader mobile phone and mixed reality hardware ecosystem." This decision aligns with the current state of the AR/VR industry, where the ecosystem is still in its early stages, and companies like Meta are heavily investing in its development. Microsoft's shift from hardware production to ecosystem investment mirrors trends in the broader tech industry and could position the company for future opportunities as the mixed-reality market matures. As the ecosystem develops and more use cases emerge, Microsoft's investment in software and services could prove valuable despite the current challenges in justifying investments in a field that's still searching for compelling widespread applications.

Logitech Releases MX Ink Mixed Reality Stylus for Meta Quest

Press Release by

Sep 25th, 2024 13:57 Discuss (1 Comment)

Logitech announced the availability of MX Ink, the first Mixed Reality (MR) stylus specifically designed for Meta Quest. A precision tool with a familiar pen-like feel, MX Ink allows users to navigate, annotate and create freely across 2D spaces like papers, desks, or whiteboards, as well as immersive 3D environments. The pressure-sensitive tip of MX Ink enables natural writing and gaming motions, merging the tactile sensation of a physical tool with the limitless possibilities of the virtual creative space.

MX ink is currently supported by a wide range of applications across the creativity and productivity landscape, as well as in industries such as medicine, architecture, and education, with new applications being added regularly.

Read full story

Meta Announces the Quest 3S, its Most Affordable Mixed Reality Headset to Date Starting at US$300

Press Release by

Sep 25th, 2024 13:41 Discuss (2 Comments)

Today at Connect, we unveiled Meta Quest 3S, a headset with the same mixed reality capabilities and fast performance as Meta Quest 3, but at a lower price point. Starting at just $299.99 USD, Quest 3S is the best headset for those new to mixed reality and immersive experiences, or who might have been waiting for a low-cost upgrade from Quest and Quest 2.

From watching your favorite TV shows on a cinema-sized screen to your own personal trainer that you can take with you anywhere you go, plus multitasking capabilities, gaming and more, there's no better mixed reality device on the market at this price.

Read full story

VR/MR Device Shipments to Reach 37 Million Units by 2030, with OLEDoS and LCD Dominating High-End and Mainstream Markets

Press Release by

Aug 5th, 2024 03:57 Discuss (0 Comments)

TrendForce's latest report reveals that shipments of near-eye displays are expected to increase year-by-year over the next few years following inventory clearance. It is anticipated that OLEDoS will dominate the high-end VR/MR market, with its technological share rising to 23% by 2030, while LCD will continue to occupy the mainstream market, holding a 63% share in near-eye displays.

TrendForce defines VR/MR devices as near-eye displays that achieve an immersive experience through a single display. Devices emphasizing transparency and the integration of virtual and real-world applications are classified as AR devices.

Read full story

Global AI Server Demand Surge Expected to Drive 2024 Market Value to US$187 Billion; Represents 65% of Server Market

Press Release by

Jul 17th, 2024 05:45 Discuss (7 Comments)

TrendForce's latest industry report on AI servers reveals that high demand for advanced AI servers from major CSPs and brand clients is expected to continue in 2024. Meanwhile, TSMC, SK hynix, Samsung, and Micron's gradual production expansion has significantly eased shortages in 2Q24. Consequently, the lead time for NVIDIA's flagship H100 solution has decreased from the previous 40-50 weeks to less than 16 weeks.

TrendForce estimates that AI server shipments in the second quarter will increase by nearly 20% QoQ, and has revised the annual shipment forecast up to 1.67 million units—marking a 41.5% YoY growth.

Read full story

AI Startup Etched Unveils Transformer ASIC Claiming 20x Speed-up Over NVIDIA H100

by

Jun 25th, 2024 12:30 Discuss (37 Comments)

A new startup emerged out of stealth mode today to power the next generation of generative AI. Etched is a company that makes an application-specific integrated circuit (ASIC) to process "Transformers." The transformer is an architecture for designing deep learning models developed by Google and is now the powerhouse behind models like OpenAI's GPT-4o in ChatGPT, Anthropic Claude, Google Gemini, and Meta's Llama family. Etched wanted to create an ASIC for processing only the transformer models, making a chip called Sohu. The claim is Sohu outperforms NVIDIA's latest and greatest by an entire order of magnitude. Where a server configuration with eight NVIDIA H100 GPU clusters pushes Llama-3 70B models at 25,000 tokens per second, and the latest eight B200 "Blackwell" GPU cluster pushes 43,000 tokens/s, the eight Sohu clusters manage to output 500,000 tokens per second.

Why is this important? Not only does the ASIC outperform Hopper by 20x and Blackwell by 10x, but it also serves so many tokens per second that it enables an entirely new fleet of AI applications requiring real-time output. The Sohu architecture is so efficient that 90% of the FLOPS can be used, while traditional GPUs boast a 30-40% FLOP utilization rate. This translates into inefficiency and waste of power, which Etched hopes to solve by building an accelerator dedicated to power transformers (the "T" in GPT) at massive scales. Given that the frontier model development costs more than one billion US dollars, and hardware costs are measured in tens of billions of US Dollars, having an accelerator dedicated to powering a specific application can help advance AI faster. AI researchers often say that "scale is all you need" (resembling the legendary "attention is all you need" paper), and Etched wants to build on that.

Read full story

CSPs to Expand into Edge AI, Driving Average NB DRAM Capacity Growth by at Least 7% in 2025

Press Release by

Jun 25th, 2024 03:46 Discuss (0 Comments)

TrendForce has observed that in 2024, major CSPs such as Microsoft, Google, Meta, and AWS will continue to be the primary buyers of high-end AI servers, which are crucial for LLM and AI modeling. Following establishing a significant AI training server infrastructure in 2024, these CSPs are expected to actively expand into edge AI in 2025. This expansion will include the development of smaller LLM models and setting up edge AI servers to facilitate AI applications across various sectors, such as manufacturing, finance, healthcare, and business.

Moreover, AI PCs or notebooks share a similar architecture to AI servers, offering substantial computational power and the ability to run smaller LLM and generative AI applications. These devices are anticipated to serve as the final bridge between cloud AI infrastructure and edge AI for small-scale training or inference applications.

Read full story

AMD, Broadcom, Cisco, Google, HPE, Intel, Meta and Microsoft Form Ultra Accelerator Link (UALink) Promoter Group to Combat NVIDIA NVLink

Press Release by

May 30th, 2024 11:51 Discuss (17 Comments)

AMD, Broadcom, Cisco, Google, Hewlett Packard Enterprise (HPE), Intel, Meta and Microsoft today announced they have aligned to develop a new industry standard dedicated to advancing high-speed and low latency communication for scale-up AI systems linking in Data Centers.

Called the Ultra Accelerator Link (UALink), this initial group will define and establish an open industry standard that will enable AI accelerators to communicate more effectively. By creating an interconnect based upon open standards, UALink will enable system OEMs, IT professionals and system integrators to create a pathway for easier integration, greater flexibility and scalability of their AI-connected data centers.

Read full story

RISC-V Adoption to Grow 50% Yearly Due to AI Processor Demand

by

May 21st, 2024 02:13 Discuss (7 Comments)

The open-source RISC-V instruction set architecture is shaping up for explosive growth over the next several years, primarily fueled by the increasing demand for artificial intelligence (AI) across industries. A new forecast from tech research firm Omdia predicts that shipments of RISC-V-based chips will skyrocket at an astonishing 50% annual growth rate between 2024 and 2030, sitting at a staggering 17 billion RISC-V units in 2030. The automotive sector is expected to see the most significant growth in RISC-V adoption, with a forecasted annual increase of 66%. This growth is largely attributed to the unique benefits RISC-V offers in this industry, including its flexibility and customizability.

The rise of AI in the automotive sector, particularly in applications such as autonomous driving and advanced driver assistance systems (ADAS), is also expected to contribute to RISC-V's success. Industrial applications will continue to be the largest domain for RISC-V, accounting for approximately 46% of sales. However, the growth in the automotive sector is expected to outpace other industries, driven by the increasing demand for AI-enabled technologies in this sector. The forecast from Omdia is based on current trends and the growing adoption of RISC-V by major players in the tech industry, including Google and Meta, which are investing in RISC-V to power their custom solutions. Additionally, chip producers like Qualcomm are creating their RISC-V chips for consumer use, further solidifying the technology's future position in the market.

Microsoft Prepares MAI-1 In-House AI Model with 500B Parameters

by

May 7th, 2024 01:29 Discuss (1 Comment)

According to The Information, Microsoft is developing a new AI model, internally named MAI-1, designed to compete with the leading models from Google, Anthropic, and OpenAI. This significant step forward in the tech giant's AI capabilities is boosted by Mustafa Suleyman, the former Google AI leader who previously served as CEO of Inflection AI before Microsoft acquired the majority of its staff and intellectual property for $650 million in March. MAI-1 is a custom Microsoft creation that utilizes training data and technology from Inflection but is not a transferred model. It is also distinct from Inflection's previously released Pi models, as confirmed by two Microsoft insiders familiar with the project. With approximately 500 billion parameters, MAI-1 will be significantly larger than its predecessors, surpassing the capabilities of Microsoft's smaller, open-source models.

For comparison, OpenAI's GPT-4 boasts 1.8 trillion parameters in a Mixture of Experts sparse design, while open-source models from Meta and Mistral feature 70 billion parameters dense. Microsoft's investment in MAI-1 highlights its commitment to staying competitive in the rapidly evolving AI landscape. The development of this large-scale model represents a significant step forward for the tech giant, as it seeks to challenge industry leaders in the field. The increased computing power, training data, and financial resources required for MAI-1 demonstrate Microsoft's dedication to pushing the boundaries of AI capabilities and intention to compete on its own. With the involvement of Mustafa Suleyman, a renowned expert in AI, the company is well-positioned to make significant strides in this field.

Razer Introduces New Meta Quest 3 Accessories

Press Release by

May 6th, 2024 15:01 Discuss (2 Comments)

As the Product Evangelist for Razer's VR accessories, it's my absolute pleasure to introduce an exciting leap forward in virtual reality gaming: the launch of our new Razer Facial Interface and Razer Adjustable Head Strap System for Meta Quest 3. In developing these products, we aimed to merge Razer's cutting-edge technology with the needs of the modern VR gamer, creating a truly immersive and comfortable experience. These join our current line of products for Meta Quest, including the Razer Hammerhead HyperSpeed for Meta Quest 3.

Crafted for Comfort, Designed for Gamers
Our journey began with a vision to redefine what gamers can expect from their VR equipment. Partnering with ResMed, human factor experts with over 30 years of experience, we drew upon over three decades of expertise to ensure our accessories not only push the envelope in terms of design but also set a new standard for comfort. Our previous generation of VR accessories for the Meta Quest platform was recognized with the Australian Good Design Award, a testament to our commitment to innovation.

Read full story

Meta Opens OS Powering Meta Quest Devices to Third-Party Hardware Makers, ASUS ROG Gaming Headset Incoming

Press Release by

Apr 24th, 2024 02:42 Discuss (5 Comments)

Today we're taking the next step toward our vision for a more open computing platform for the metaverse. We're opening up the operating system powering our Meta Quest devices to third-party hardware makers, giving more choice to consumers and a larger ecosystem for developers to build for. We're working with leading global technology companies to bring this new ecosystem to life and making it even easier for developers to build apps and reach their audiences on the platform.

Introducing Meta Horizon OS
This new hardware ecosystem will run on Meta Horizon OS, the mixed reality operating system that powers our Meta Quest headsets. We chose this name to reflect our vision of a computing platform built around people and connection—and the shared social fabric that makes this possible. Meta Horizon OS combines the core technologies powering today's mixed reality experiences with a suite of features that put social presence at the center of the platform.

Read full story

Meta Announces New MTIA AI Accelerator with Improved Performance to Ease NVIDIA's Grip

by

Apr 11th, 2024 04:12 Discuss (19 Comments)

Meta has announced the next generation of its Meta Training and Inference Accelerator (MTIA) chip, which is designed to train and infer AI models at scale. The newest MTIA chip is a second-generation design of Meta's custom silicon for AI, and it is being built on TSMC's 5 nm technology. Running at the frequency of 1.35 GHz, the new chip is getting a boost to 90 Watts of TDP per package compared to just 25 Watts for the first-generation design. Basic Linear Algebra Subprograms (BLAS) processing is where the chip shines, and it includes matrix multiplication and vector/SIMD processing. At GEMM matrix processing, each chip can process 708 TeraFLOPS at INT8 (presumably meant FP8 in the spec) with sparsity, 354 TeraFLOPS without, 354 TeraFLOPS at FP16/BF16 with sparsity, and 177 TeraFLOPS without.

Classical vector and processing is a bit slower at 11.06 TeraFLOPS at INT8 (FP8), 5.53 TeraFLOPS at FP16/BF16, and 2.76 TFLOPS single-precision FP32. The MTIA chip is specifically designed to run AI training and inference on Meta's PyTorch AI framework, with an open-source Triton backend that produces compiler code for optimal performance. Meta uses this for all its Llama models, and with Llama3 just around the corner, it could be trained on these chips. To package it into a system, Meta puts two of these chips onto a board and pairs them with 128 GB of LPDDR5 memory. The board is connected via PCIe Gen 5 to a system where 12 boards are stacked densely. This process is repeated six times in a single rack for 72 boards and 144 chips in a single rack for a total of 101.95 PetaFLOPS, assuming linear scaling at INT8 (FP8) precision. Of course, linear scaling is not quite possible in scale-out systems, which could bring it down to under 100 PetaFLOPS per rack.

Below, you can see images of the chip floorplan, specifications compared to the prior version, as well as the system.

Read full story

Homeworld Franchise Comes to Virtual Reality for the First Time With 'Homeworld: Vast Reaches', a New Game Arriving in 2024

Press Release by

Apr 3rd, 2024 12:35 Discuss (4 Comments)

FarBridge, Inc., a leading game development studio, in partnership with Gearbox Entertainment, is excited to announce Homeworld: Vast Reaches, a bold new story in the beloved Homeworld saga that reimagines strategic space battles for Virtual Reality and Mixed Reality. This new game in the Homeworld universe is launching on the Meta Quest 2 and Meta Quest 3 headsets later this year. Players can now wishlist the game at HomeworldVastReaches.com.

In the award-winning Homeworld games for PC, you play as Fleet Command, a human commander who controls a fleet of spaceships. Players will take on the same role in Homeworld: Vast Reaches in vicious combat against a mysterious new foe.

Read full story

Jensen Huang Will Discuss AI's Future at NVIDIA GTC 2024

Press Release by

Mar 14th, 2024 12:45 Discuss (12 Comments)

NVIDIA's GTC 2024 AI conference will set the stage for another leap forward in AI. At the heart of this highly anticipated event: the opening keynote by Jensen Huang, NVIDIA's visionary founder and CEO, who speaks on Monday, March 18, at 1 p.m. Pacific, at the SAP Center in San Jose, California.

Planning Your GTC Experience
There are two ways to watch. Register to attend GTC in person to secure a spot for an immersive experience at the SAP Center. The center is a short walk from the San Jose Convention Center, where the rest of the conference takes place. Doors open at 11 a.m., and badge pickup starts at 10:30 a.m. The keynote will also be livestreamed at www.nvidia.com/gtc/keynote/.

Read full story

Return to Keyword Browsing

Jun 30th, 2025 20:47 CDT change timezone

Latest GPU Drivers

New Forum Posts

20:33 by Logic
Laptop overclocking adventures (1238)
19:49 by iameatingjam
[INTEL]-How To Update Your Microcode for Intel HX 13/14th Gen. CPUs Laptops/Mobile Easily. (172)
19:40 by Tyler-98-W68
Will you buy a RTX 5090? (584)
19:24 by AusWolf
The TPU UK Clubhouse (26530)
19:24 by Logic
Optane and "enable write caching " (27)
18:51 by A Computer Guy
Question about Intel Optane SSDs (87)
18:45 by lexluthermiester
Do you use Linux? (664)
18:45 by Gold Leader
Remember Fermi? Well here's my EVGA GTX 480 that I picked up for just 19 Euros! (9)
17:56 by HermannSW
Vega owners club (587)
17:38 by mclaren85
Can you guess Which game it is? (194)

Popular Reviews

Jun 30th, 2025 ASUS ROG Crosshair X870E Extreme Review
Jun 20th, 2025 Sapphire Radeon RX 9060 XT Pulse OC 16 GB Review - Samsung Memory Tested
Jun 27th, 2025 AVerMedia CamStream 4K Review
Jun 26th, 2025 Lexar NQ780 4 TB Review
Nov 6th, 2024 AMD Ryzen 7 9800X3D Review - The Best Gaming Processor
May 13th, 2025 Upcoming Hardware Launches 2025 (Updated May 2025)
Mar 5th, 2025 Sapphire Radeon RX 9070 XT Nitro+ Review - Beating NVIDIA
Mar 11th, 2025 AMD Ryzen 9 9950X3D Review - Great for Gaming and Productivity
Jun 25th, 2025 ASRock Phantom Gaming Z890 Riptide Wi-Fi Review
May 27th, 2025 NVIDIA GeForce RTX 5060 8 GB Review

TPU on YouTube

Controversial News Posts