News Posts matching #Generative AI

Return to Keyword Browsing

NVIDIA Hopper Leaps Ahead in Generative AI at MLPerf

Press Release by

Mar 27th, 2024 12:47 Discuss (15 Comments)

It's official: NVIDIA delivered the world's fastest platform in industry-standard tests for inference on generative AI. In the latest MLPerf benchmarks, NVIDIA TensorRT-LLM—software that speeds and simplifies the complex job of inference on large language models—boosted the performance of NVIDIA Hopper architecture GPUs on the GPT-J LLM nearly 3x over their results just six months ago. The dramatic speedup demonstrates the power of NVIDIA's full-stack platform of chips, systems and software to handle the demanding requirements of running generative AI. Leading companies are using TensorRT-LLM to optimize their models. And NVIDIA NIM—a set of inference microservices that includes inferencing engines like TensorRT-LLM—makes it easier than ever for businesses to deploy NVIDIA's inference platform.

Raising the Bar in Generative AI
TensorRT-LLM running on NVIDIA H200 Tensor Core GPUs—the latest, memory-enhanced Hopper GPUs—delivered the fastest performance running inference in MLPerf's biggest test of generative AI to date. The new benchmark uses the largest version of Llama 2, a state-of-the-art large language model packing 70 billion parameters. The model is more than 10x larger than the GPT-J LLM first used in the September benchmarks. The memory-enhanced H200 GPUs, in their MLPerf debut, used TensorRT-LLM to produce up to 31,000 tokens/second, a record on MLPerf's Llama 2 benchmark. The H200 GPU results include up to 14% gains from a custom thermal solution. It's one example of innovations beyond standard air cooling that systems builders are applying to their NVIDIA MGX designs to take the performance of Hopper GPUs to new heights.

Read full story

Intel Gaudi 2 Remains Only Benchmarked Alternative to NV H100 for Generative AI Performance

Press Release by

Mar 27th, 2024 11:22 Discuss (1 Comment)

Today, MLCommons published results of the industry-standard MLPerf v4.0 benchmark for inference. Intel's results for Intel Gaudi 2 accelerators and 5th Gen Intel Xeon Scalable processors with Intel Advanced Matrix Extensions (Intel AMX) reinforce the company's commitment to bring "AI Everywhere" with a broad portfolio of competitive solutions. The Intel Gaudi 2 AI accelerator remains the only benchmarked alternative to Nvidia H100 for generative AI (GenAI) performance and provides strong performance-per-dollar. Further, Intel remains the only server CPU vendor to submit MLPerf results. Intel's 5th Gen Xeon results improved by an average of 1.42x compared with 4th Gen Intel Xeon processors' results in MLPerf Inference v3.1.

"We continue to improve AI performance on industry-standard benchmarks across our portfolio of accelerators and CPUs. Today's results demonstrate that we are delivering AI solutions that deliver to our customers' dynamic and wide-ranging AI requirements. Both Intel Gaudi and Xeon products provide our customers with options that are ready to deploy and offer strong price-to-performance advantages," said Zane Ball, Intel corporate vice president and general manager, DCAI Product Management.

Read full story

Phison Collaborates with MediaTek to Propel Generative AI Computing and Services

Press Release by

Mar 26th, 2024 11:56 Discuss (0 Comments)

Phison Electronics, a leading provider of NAND controllers and NAND storage solutions, today announced a pivotal strategic collaboration with industry giant MediaTek to push forward innovations in generative artificial intelligence (Generative AI) computing and services, and meet demand for fine-tuning AI model computations across industries. Under the collaboration, Phison's cutting-edge AI computing service, aiDAPTIV+, will pair with MediaTek's premier generative AI service platform, MediaTek DaVinci, heralding a new epoch for AI computing and application services and accelerating the adoption of generative AI in everyday life.

MediaTek DaVinci is an advanced, open platform for generative AI services, built on the Generative AI Service Framework (GAISF). MediaTek DaVinci enables developers to build a variety of plugins for enterprise applications, fostering a vibrant ecosystem and enhancing the user experience. Phison's aiDAPTIV+ features a pioneering SSD-integrated AI computing architecture that breaks down large AI models for concurrent operation with SSDs. This approach significantly reduces infrastructure costs and boosts computational efficiency, enabling the training of substantial AI models with limited GPU and DRAM resources. aiDAPTIV+ has already demonstrated its effectiveness in the Industry 4.0 sector and is poised to accelerate AI transformation across various sectors, bolstering business competitiveness. Additionally, aiDAPTIV+ consumes less power than traditional AI server setups for the same AI model fine-tuning tasks and this aligns with the current trend of minimizing energy consumption and carbon footprint.

Read full story

MediaTek Launches Next-gen ASIC Design Platform with Co-packaged Optics Solutions

Press Release by

Mar 25th, 2024 14:06 Discuss (0 Comments)

Ahead of the 2024 Optical Fiber Communication Conference (OFC), MediaTek (last week) announced it is launching a next-generation custom ASIC design platform that includes the heterogeneous integration of both high-speed electrical and optical I/Os in the same ASIC implementation. MediaTek will be demonstrating a serviceable socketed implementation that combines 8x800G electrical links and 8x800G optical links for a more flexible deployment. It integrates both MediaTek's in-house SerDes for electrical I/O as well as co-packaged Odin optical engines from Ranovus for optical I/O. Leveraging the heterogeneous solution that includes both 112G LR SerDes and optical modules, this CPO demonstration delivers reduced board space and device costs, boosts bandwidth density, and lowers system power by up to 50% compared to existing solutions.

Additionally, Ranovus' Odin optical engine has the option to provide either internal or external laser optical modules to better align with practical usage scenarios. MediaTek's ASIC experience and capabilities in the 3 nm advanced process, 2.5D and 3D advanced packaging, thermal management, and reliability, combined with optical experience, makes it possible for customers to access the latest technology for high-performance computing (HPC), AI/ML and data center networking.

Read full story

Dell Expands Generative AI Solutions Portfolio, Selects NVIDIA Blackwell GPUs

Press Release by

Mar 19th, 2024 15:41 Discuss (3 Comments)

Dell Technologies is strengthening its collaboration with NVIDIA to help enterprises adopt AI technologies. By expanding the Dell Generative AI Solutions portfolio, including with the new Dell AI Factory with NVIDIA, organizations can accelerate integration of their data, AI tools and on-premises infrastructure to maximize their generative AI (GenAI) investments. "Our enterprise customers are looking for an easy way to implement AI solutions—that is exactly what Dell Technologies and NVIDIA are delivering," said Michael Dell, founder and CEO, Dell Technologies. "Through our combined efforts, organizations can seamlessly integrate data with their own use cases and streamline the development of customized GenAI models."

"AI factories are central to creating intelligence on an industrial scale," said Jensen Huang, founder and CEO, NVIDIA. "Together, NVIDIA and Dell are helping enterprises create AI factories to turn their proprietary data into powerful insights."

Read full story

Ubisoft Exploring Generative AI, Could Revolutionize NPC Narratives

Press Release by

Mar 19th, 2024 14:45 Discuss (13 Comments)

Have you ever dreamed of having a real conversation with an NPC in a video game? Not just one gated within a dialogue tree of pre-determined answers, but an actual conversation, conducted through spontaneous action and reaction? Lately, a small R&D team at Ubisoft's Paris studio, in collaboration with Nvidia's Audio2Face application and Inworld's Large Language Model (LLM), have been experimenting with generative AI in an attempt to turn this dream into a reality. Their project, NEO NPC, uses GenAI to prod at the limits of how a player can interact with an NPC without breaking the authenticity of the situation they are in, or the character of the NPC itself.

Considering that word—authenticity—the project has had to be a hugely collaborative effort across artistic and scientific disciplines. Generative AI is a hot topic of conversation in the videogame industry, and Senior Vice President of Production Technology Guillemette Picard is keen to stress that the goal behind all genAI projects at Ubisoft is to bring value to the player; and that means continuing to focus on human creativity behind the scenes. "The way we worked on this project, is always with our players and our developers in mind," says Picard. "With the player in mind, we know that developers and their creativity must still drive our projects. Generative AI is only of value if it has value for them."

Read full story

Supermicro Launches Three NVIDIA-Based, Full-Stack, Ready-to-Deploy Generative AI SuperClusters

Press Release by

Mar 18th, 2024 23:12 Discuss (2 Comments)

Supermicro, Inc., a Total IT Solution Provider for AI, Cloud, Storage, and 5G/Edge, is announcing its latest portfolio to accelerate the deployment of generative AI. The Supermicro SuperCluster solutions provide foundational building blocks for the present and the future of large language model (LLM) infrastructure. The three powerful Supermicro SuperCluster solutions are now available for generative AI workloads. The 4U liquid-cooled systems or 8U air-cooled systems are purpose-built and designed for powerful LLM training performance, as well as large batch size and high-volume LLM inference. A third SuperCluster, with 1U air-cooled Supermicro NVIDIA MGX systems, is optimized for cloud-scale inference.

"In the era of AI, the unit of compute is now measured by clusters, not just the number of servers, and with our expanded global manufacturing capacity of 5,000 racks/month, we can deliver complete generative AI clusters to our customers faster than ever before," said Charles Liang, president and CEO of Supermicro. "A 64-node cluster enables 512 NVIDIA HGX H200 GPUs with 72 TB of HBM3e through a couple of our scalable cluster building blocks with 400 Gb/s NVIDIA Quantum-2 InfiniBand and Spectrum-X Ethernet networking. Supermicro's SuperCluster solutions combined with NVIDIA AI Enterprise software are ideal for enterprise and cloud infrastructures to train today's LLMs with up to trillions of parameters. The interconnected GPUs, CPUs, memory, storage, and networking, when deployed across multiple nodes in racks, construct the foundation of today's AI. Supermicro's SuperCluster solutions provide foundational building blocks for rapidly evolving generative AI and LLMs."

Read full story

NVIDIA Launches Blackwell-Powered DGX SuperPOD for Generative AI Supercomputing at Trillion-Parameter Scale

Press Release by

Mar 18th, 2024 16:42 Discuss (2 Comments)

NVIDIA today announced its next-generation AI supercomputer—the NVIDIA DGX SuperPOD powered by NVIDIA GB200 Grace Blackwell Superchips—for processing trillion-parameter models with constant uptime for superscale generative AI training and inference workloads.

Featuring a new, highly efficient, liquid-cooled rack-scale architecture, the new DGX SuperPOD is built with NVIDIA DGX GB200 systems and provides 11.5 exaflops of AI supercomputing at FP4 precision and 240 terabytes of fast memory—scaling to more with additional racks.

Read full story

NVIDIA Blackwell Platform Arrives to Power a New Era of Computing

Press Release by

Mar 18th, 2024 16:39 Discuss (20 Comments)

Powering a new era of computing, NVIDIA today announced that the NVIDIA Blackwell platform has arrived—enabling organizations everywhere to build and run real-time generative AI on trillion-parameter large language models at up to 25x less cost and energy consumption than its predecessor.

The Blackwell GPU architecture features six transformative technologies for accelerated computing, which will help unlock breakthroughs in data processing, engineering simulation, electronic design automation, computer-aided drug design, quantum computing and generative AI—all emerging industry opportunities for NVIDIA.

Read full story

TSMC and Synopsys Bring Breakthrough NVIDIA Computational Lithography Platform to Production

Press Release by

Mar 18th, 2024 16:32 Discuss (4 Comments)

NVIDIA today announced that TSMC and Synopsys are going into production with NVIDIA's computational lithography platform to accelerate manufacturing and push the limits of physics for the next generation of advanced semiconductor chips. TSMC, the world's leading foundry, and Synopsys, the leader in silicon to systems design solutions, have integrated NVIDIA cuLitho with their software, manufacturing processes and systems to speed chip fabrication, and in the future support the latest-generation NVIDIA Blackwell architecture GPUs.

"Computational lithography is a cornerstone of chip manufacturing," said Jensen Huang, founder and CEO of NVIDIA. "Our work on cuLitho, in partnership with TSMC and Synopsys, applies accelerated computing and generative AI to open new frontiers for semiconductor scaling." NVIDIA also introduced new generative AI algorithms that enhance cuLitho, a library for GPU-accelerated computational lithography, dramatically improving the semiconductor manufacturing process over current CPU-based methods.

Read full story

MSI - Micro-Star International

MSI Showcases Liquid Cooled Server Platforms For Data Centers at CloudFest 2024

Press Release by

Mar 18th, 2024 04:16 Discuss (0 Comments)

MSI, a leading global server provider, will showcase its latest liquid-cooled and GPU servers powered by AMD processors and 5th Gen Intel Xeon Scalable processors, optimized to meet the evolving needs of modern data centers, at CloudFest 2024, booth #H02 in Europa-Park from March 19-21. "With an increasing number of data centers leveraging applications like AI to enhance customer experience, the demands for more computing power and higher density deployments have driven significant changes in IT infrastructure, leading to greater use of liquid cooling," said Danny Hsu, General Manager of Enterprise Platform Solutions. "MSI's liquid-cooled server platforms enable data centers to deliver efficiency while deploying more compute-intensive workloads."

The G4101 is a 4U 4GPU server platform designed for AI training workloads. It supports a single AMD EPYC 9004 Series processor equipped with a liquid cooling module, along with twelve DDR5 RDIMM slots. Additionally, it features four PCIe 5.0 x16 slots tailored for triple-slot graphic cards with coolers, ensuring increased airflow and sustained performance. With twelve front 2.5-inch U.2 NVMe/SATA drive bays, it offers high-speed and flexible storage options, catering to the diverse needs of AI workloads. The G4101 combines air flow spacing and liquid closed-loop cooling, making it the optimal thermal management solution for even the most demanding tasks.

Read full story

Extropic Intends to Accelerate AI through Thermodynamic Computing

Press Release by

Mar 17th, 2024 14:52 Discuss (6 Comments)

Extropic, a pioneer in physics-based computing, this week emerged from stealth mode and announced the release of its Litepaper, which outlines the company's revolutionary approach to AI acceleration through thermodynamic computing. Founded in 2022 by Guillaume Verdon, Extropic has been developing novel chips and algorithms that leverage the natural properties of out-of-equilibrium thermodynamic systems to perform probabilistic computations for generative AI applications in a highly efficient manner. The Litepaper delves into Extropic's groundbreaking computational paradigm, which aims to address the limitations of current digital hardware in handling the complex probability distributions required for generative AI.

Today's algorithms spend around 25% of their time moving numbers around in memory, limiting the speedup achievable by accelerating specific operations. In contrast, Extropic's chips natively accelerate a broad class of probabilistic algorithms by running them physically as a rapid and energy-efficient, physics-based process in their entirety, unlocking a new regime of AI acceleration well beyond what was previously thought achievable. In coming out of stealth, the company has announced the fabrication of a superconducting prototype processor and developments surrounding room-temperature semiconductor-based devices for the broader market, with the goal of revolutionizing the field of AI acceleration and enabling new possibilities in generative AI.

Read full story

Acer Reports FY2023 Net Income of NT$4.93 Billion and Announces NT$1.6 Cash Dividend Per Share

Press Release by

Mar 14th, 2024 22:06 Discuss (0 Comments)

Acer Inc. (TWSE: 2353) announced today its financial results for the fourth quarter of 2023 and fiscal 2023 ended December 31. In the fourth quarter, Acer reported consolidated revenues of NT$63.15 billion, gross profits of NT$6.91 billion with 10.9% margin, operating income of NT$1.39 billion with 2.2% margin, and net income [1] of NT$1.02 billion with earning-per-share (EPS) of NT$0.34.

For the full year of 2023, consolidated revenues reached NT$241.31 billion, gross profits of NT$25.82 billion with 10.7% margin, operating income was NT$4.23 billion with 1.8% margin, and net income was NT$4.93 billion with earning-per-share (EPS) of NT$1.64. Acer's computer and display business has returned to the right track of profitability and seasonality while inventory is under control. The company is optimistic about the business opportunities that artificial intelligence brings and considers Generative AI to become a megatrend in 2024 and beyond.

Read full story

NVIDIA Introduces Generative AI Professional Certification

Press Release by

Mar 7th, 2024 15:26 Discuss (9 Comments)

NVIDIA is offering a new professional certification in generative AI to enable developers to establish technical credibility in this important domain. Generative AI is revolutionizing industries worldwide, yet there's a critical skills gap and need to uplevel employees to more fully harness the technology. Available for the first time from NVIDIA, this new professional certification enables developers, career professionals, and others to validate and showcase their generative AI skills and expertise. Our new professional certification program introduces two associate-level generative AI certifications, focusing on proficiency in large language models and multimodal workflow skills.

"Generative AI has moved to center stage as governments, industries and organizations everywhere look to harness its transformative capabilities," NVIDIA founder and CEO Jensen Huang recently said. The certification will become available starting at GTC, where in-person attendees can also access recommended training to prepare for a certification exam. "Organizations in every industry need to increase their expertise in this transformative technology," said Greg Estes, VP of developer programs at NVIDIA. "Our goals are to assist in upskilling workforces, sharpen the skills of qualified professionals, and enable individuals to demonstrate their proficiency in order to gain a competitive advantage in the job market."

Read full story

NVIDIA and HP Supercharge Data Science and Generative AI on Workstations

Press Release by

Mar 7th, 2024 12:46 Discuss (0 Comments)

NVIDIA and HP Inc. today announced that NVIDIA CUDA-X data processing libraries will be integrated with HP AI workstation solutions to turbocharge the data preparation and processing work that forms the foundation of generative AI development.

Built on the NVIDIA CUDA compute platform, CUDA-X libraries speed data processing for a broad range of data types, including tables, text, images and video. They include the NVIDIA RAPIDS cuDF library, which accelerates the work of the nearly 10 million data scientists using pandas software by up to 110x using an NVIDIA RTX 6000 Ada Generation GPU instead of a CPU-only system, without requiring any code changes.

Read full story

Qualcomm AI Hub Introduced at MWC 2024

Press Release by

Feb 28th, 2024 12:00 Discuss (0 Comments)

Qualcomm Technologies, Inc. unveiled its latest advancements in artificial intelligence (AI) at Mobile World Congress (MWC) Barcelona. From the new Qualcomm AI Hub, to cutting-edge research breakthroughs and a display of commercial AI-enabled devices, Qualcomm Technologies is empowering developers and revolutionizing user experiences across a wide range of devices powered by Snapdragon and Qualcomm platforms.

"With Snapdragon 8 Gen 3 for smartphones and Snapdragon X Elite for PCs, we sparked commercialization of on-device AI at scale. Now with the Qualcomm AI Hub, we will empower developers to fully harness the potential of these cutting-edge technologies and create captivating AI-enabled apps," said Durga Malladi, senior vice president and general manager, technology planning and edge solutions, Qualcomm Technologies, Inc. "The Qualcomm AI Hub provides developers with a comprehensive AI model library to quickly and easily integrate pre-optimized AI models into their applications, leading to faster, more reliable and private user experiences."

Read full story

Supermicro Accelerates Performance of 5G and Telco Cloud Workloads with New and Expanded Portfolio of Infrastructure Solutions

Press Release by

Feb 26th, 2024 03:11 Discuss (0 Comments)

Supermicro, Inc. (NASDAQ: SMCI), a Total IT Solution Provider for AI, Cloud, Storage, and 5G/Edge, delivers an expanded portfolio of purpose-built infrastructure solutions to accelerate performance and increase efficiency in 5G and telecom workloads. With one of the industry's most diverse offerings, Supermicro enables customers to expand public and private 5G infrastructures with improved performance per watt and support for new and innovative AI applications. As a long-term advocate of open networking platforms and a member of the O-RAN Alliance, Supermicro's portfolio incorporates systems featuring 5th Gen Intel Xeon processors, AMD EPYC 8004 Series processors, and the NVIDIA Grace Hopper Superchip.

"Supermicro is expanding our broad portfolio of sustainable and state-of-the-art servers to address the demanding requirements of 5G and telco markets and Edge AI," said Charles Liang, president and CEO of Supermicro. "Our products are not just about technology, they are about delivering tangible customer benefits. We quickly bring data center AI capabilities to the network's edge using our Building Block architecture. Our products enable operators to offer new capabilities to their customers with improved performance and lower energy consumption. Our edge servers contain up to 2 TB of high-speed DDR5 memory, 6 PCIe slots, and a range of networking options. These systems are designed for increased power efficiency and performance-per-watt, enabling operators to create high-performance, customized solutions for their unique requirements. This reassures our customers that they are investing in reliable and efficient solutions."

Read full story

Jensen Huang to Unveil Latest AI Breakthroughs at GTC 2024 Conference

Press Release by

Feb 20th, 2024 10:40 Discuss (6 Comments)

NVIDIA today announced it will host its flagship GTC 2024 conference at the San Jose Convention Center from March 18-21. More than 300,000 people are expected to register to attend in person or virtually. NVIDIA founder and CEO Jensen Huang will deliver the keynote from the SAP Center on Monday, March 18, at 1 p.m. Pacific time. It will be livestreamed and available on demand. Registration is not required to view the keynote online. Since Huang first highlighted machine learning in his 2014 GTC keynote, NVIDIA has been at the forefront of the AI revolution. The company's platforms have played a crucial role in enabling AI across numerous domains including large language models, biology, cybersecurity, data center and cloud computing, conversational AI, networking, physics, robotics, and quantum, scientific and edge computing.

The event's 900 sessions and over 300 exhibitors will showcase how organizations are deploying NVIDIA platforms to achieve remarkable breakthroughs across industries, including aerospace, agriculture, automotive and transportation, cloud services, financial services, healthcare and life sciences, manufacturing, retail and telecommunications. "Generative AI has moved to center stage as governments, industries and organizations everywhere look to harness its transformative capabilities," Huang said. "GTC has become the world's most important AI conference because the entire ecosystem is there to share knowledge and advance the state of the art. Come join us."

Read full story

Samsung Electronics Collaborates with Arm on Optimized Next Gen Cortex-X CPU Using 2nm SF2 GAAFET Process

Press Release by

Feb 20th, 2024 09:19 Discuss (1 Comment)

Samsung Electronics Co., Ltd., a world leader in advanced semiconductor technology, today announced a collaboration to deliver optimized next generation Arm Cortex -X CPU developed on Samsung Foundry's latest Gate-All-Around (GAA) process technology. This initiative is built on years of partnership with millions of devices shipped with Arm CPU intellectual property (IP) on various process nodes offered by Samsung Foundry.

This collaboration sets the stage for a series of announcements and planned innovation between Samsung and Arm. The companies have bold plans to reinvent 2-nanometer (nm) GAA for next-generation data center and infrastructure custom silicon, and a groundbreaking AI chiplet solution that will revolutionize the future generative artificial intelligence (AI) mobile computing market.

Read full story

NVIDIA Introduces NVIDIA RTX 2000 Ada Generation GPU

Press Release by

Feb 12th, 2024 12:31 Discuss (23 Comments)

Generative AI is driving change across industries—and to take advantage of its benefits, businesses must select the right hardware to power their workflows. The new NVIDIA RTX 2000 Ada Generation GPU delivers the latest AI, graphics and compute technology to compact workstations, offering up to 1.5x the performance of the previous-generation RTX A2000 12 GB in professional workflows. From crafting stunning 3D environments to streamlining complex design reviews to refining industrial designs, the card's capabilities pave the way for an AI-accelerated future, empowering professionals to achieve more without compromising on performance or capabilities. Modern multi-application workflows, such as AI-powered tools, multi-display setups and high-resolution content, put significant demands on GPU memory. With 16 GB of memory in the RTX 2000 Ada, professionals can tap the latest technologies and tools to work faster and better with their data.

Powered by NVIDIA RTX technology, the new GPU delivers impressive realism in graphics with NVIDIA DLSS, delivering ultra-high-quality, photorealistic ray-traced images more than 3x faster than before. In addition, the RTX 2000 Ada enables an immersive experience for enterprise virtual-reality workflows, such as for product design and engineering design reviews. With its blend of performance, versatility and AI capabilities, the RTX 2000 Ada helps professionals across industries achieve efficiencies. Architects and urban planners can use it to accelerate visualization workflows and structural analysis, enhancing design precision. Product designers and engineers using industrial PCs can iterate rapidly on product designs with fast, photorealistic rendering and AI-powered generative design. Content creators can edit high-resolution videos and images seamlessly, and use AI for realistic visual effects and content creation assistance. And in vital embedded applications and edge computing, the RTX 2000 Ada can power real-time data processing for medical devices, optimize manufacturing processes with predictive maintenance and enable AI-driven intelligence in retail environments.

Read full story

Cisco & NVIDIA Announce Easy to Deploy & Manage Secure AI Solutions for Enterprise

Press Release by

Feb 9th, 2024 09:51 Discuss (1 Comment)

This week, Cisco and NVIDIA have announced plans to deliver AI infrastructure solutions for the data center that are easy to deploy and manage, enabling the massive computing power that enterprises need to succeed in the AI era. "AI is fundamentally changing how we work and live, and history has shown that a shift of this magnitude is going to require enterprises to rethink and re-architect their infrastructures," said Chuck Robbins, Chair and CEO, Cisco. "Strengthening our great partnership with NVIDIA is going to arm enterprises with the technology and the expertise they need to build, deploy, manage, and secure AI solutions at scale." Jensen Huang, founder and CEO of NVIDIA said: "Companies everywhere are racing to transform their businesses with generative AI. Working closely with Cisco, we're making it easier than ever for enterprises to obtain the infrastructure they need to benefit from AI, the most powerful technology force of our lifetime."

A Powerful Partnership
Cisco, with its industry-leading expertise in Ethernet networking and extensive partner ecosystem, together with NVIDIA, the inventor of the GPU that fueled the AI boom, share a vision and commitment to help customers navigate the transitions for AI with highly secure Ethernet-based infrastructure. Cisco and NVIDIA have offered a broad range of integrated product solutions over the past several years across Webex collaboration devices and data center compute environments to enable hybrid workforces with flexible workspaces, AI-powered meetings and virtual desktop infrastructure.

Read full story

Huawei Reportedly Prioritizing Ascend AI GPU Production

by

Feb 7th, 2024 11:58 Discuss (6 Comments)

Huawei's Ascend 910B AI GPU is reportedly in high demand in China—we last learned that NVIDIA's latest US sanction-busting H20 "Hopper" model is lined up as a main competitor, allegedly in terms of both pricing and performance. A recent Reuters report proposes that Huawei is reacting to native enterprise market trends by shifting its production priorities—in favor of Ascend product ranges, while demoting their Kirin smartphone chipset family. Generative AI industry experts believe that the likes of Alibaba and Tencent have rejected Team Green's latest batch of re-jigged AI chips (H20, L20 and L2)—tastes have gradually shifted to locally developed alternatives.

Huawei leadership is seemingly keen to seize these growth opportunities—their Ascend 910B is supposedly ideal for workloads "that require low-to-mind inferencing power." Reuters has spoken to three anonymous sources—all with insider knowledge of goings-on at a single facility that manufacturers Ascend AI chips and the Kirin smartphone SoCs. Two of the leakers claim that this unnamed fabrication location is facing many "production quality" challenges, namely output being "hamstrung by a low yield rate." The report claims that Huawei has pivoted by deprioritizing Kirin 9000S (7 nm) production, thus creating a knock-on effect for its premium Mate 60 smartphone range.

FTC Launches Inquiry into Generative AI Investments and Partnerships

Press Release by

Jan 25th, 2024 13:05 Discuss (1 Comment)

The Federal Trade Commission announced today that it issued orders to five companies requiring them to provide information regarding recent investments and partnerships involving generative AI companies and major cloud service providers. The agency's 6(b) inquiry will scrutinize corporate partnerships and investments with AI providers to build a better internal understanding of these relationships and their impact on the competitive landscape. The compulsory orders were sent to Alphabet, Inc., Amazon.com, Inc., Anthropic PBC, Microsoft Corp., and OpenAI, Inc.

"History shows that new technologies can create new markets and healthy competition. As companies race to develop and monetize AI, we must guard against tactics that foreclose this opportunity, "said FTC Chair Lina M. Khan. "Our study will shed light on whether investments and partnerships pursued by dominant companies risk distorting innovation and undermining fair competition."

Read full story

Team Group Launches the Industrial P745 Gen 4 SSD

Press Release by

Jan 11th, 2024 05:02 Discuss (1 Comment)

The leading industrial memory and storage provider, Team Group has employed its advanced R&D capabilities and manufacturing processes to launch the industrial P745 SSD, which combines 112-layer 3D NAND flash memory, PCIe Gen 4x4 level speeds, and 8-channel controllers. Emphasizing high transfer speeds, power efficiency, and low latency, the P745 SSD delivers sequential read and write speeds of up to 7,000 MB/s and 6,200 MB/s, respectively, delivering excellent IOPS performance. To meet the demands of AI applications, temperature control was enhanced to maintain stable and high-speed performance. In the rapidly developing era of AI and high-performance computing, Team Group continues to provide the best industrial storage solutions.

The P745 SSD is available in both standard temperature (0 to 70°C) and wide temperature (-40 to 85°C) models. It integrates Team Group's cooling technology, the patented graphene and fin heat sinks, resulting in a significant temperature reduction of about 8-15% compared to common products without fin heat sinks. The P745 can be configured to meet the needs of different application environments, enabling the product to maintain stable operation at high temperatures and high performance. The P745 is also equipped with advanced firmware that protects data by automatically adjusting speeds when temperatures exceed the safe range. With a maximum capacity of 4 TB, the P745 is an NVMe 1.4 drive that uses the PCIe Gen 4x4 interface and is backward compatible with PCIe 3.0 platforms. It features a built-in DRAM cache buffer for high-speed AI computing that enhances system loading and data caching, reducing NAND flash wear and increasing product life span. In addition, the P745 is equipped with an LDPC error correction function and AES 256-bit high-level encryption technology to ensure the accuracy and security of data transmission.

Read full story

Intel and DigitalBridge Launch Articul8, an Enterprise Generative AI Company

Press Release by

Jan 3rd, 2024 08:43 Discuss (1 Comment)

Intel Corp and DigitalBridge Group, Inc., a global investment firm, today announced the formation of Articul8 AI, Inc. (Articul8), an independent company offering enterprise customers a full-stack, vertically-optimized and secure generative artificial intelligence (GenAI) software platform. The platform delivers AI capabilities that keep customer data, training and inference within the enterprise security perimeter. The platform also provides customers the choice of cloud, on-prem or hybrid deployment.

Articul8 was created with intellectual property (IP) and technology developed at Intel, and the two companies will remain strategically aligned on go-to-market opportunities and collaborate on driving GenAI adoption in the enterprise. Arun Subramaniyan, formerly vice president and general manager in Intel's Data Center and AI Group, has assumed leadership of Articul8 as its CEO.

Read full story

Return to Keyword Browsing

May 1st, 2024 07:47 EDT change timezone

Latest GPU Drivers

New Forum Posts

07:42 by MPM
Overclock T9300 (0)
07:39 by Chrispy_
Old high quality PSU, or semi-old mid-quality PSU? (10)
07:23 by mama
7900 XTX Seriously lacking (92)
07:22 by xtreemchaos
The Official Thermal Interface Material thread (1123)
07:13 by Keullo-e
Arctic MX-6 shelf life is just a couple months? (54)
07:11 by bonehead123
Brother bought a house, found some old PC hardware.. (17)
06:38 by JimminyCrumpet
WX9100 Power mods - help w/ powerplay tables (108)
06:20 by Orwennes
Only EDP Other in Core? (3)
05:55 by five
problem with my 7900xtx (16)
05:50 by Keullo-e
What's an inexpensive AIO product line with a strong pump and low price? (94)

Popular Reviews

Apr 26th, 2024 Ugreen NASync DXP4800 Plus Review
Apr 29th, 2024 Team Group T-Force Vulcan ECO DDR5-6000 32 GB CL38 Review
Apr 25th, 2024 HYTE THICC Q60 240 mm AIO Review
Feb 12th, 2024 Upcoming Hardware Launches 2023 (Updated Feb 2024)
Apr 22nd, 2024 MOONDROP x Crinacle DUSK In-Ear Monitors Review - The Last 5%
Apr 30th, 2024 Montech Sky Two GX Review
Apr 17th, 2024 Thermalright Phantom Spirit 120 EVO Review
Apr 5th, 2023 AMD Ryzen 7 7800X3D Review - The Best Gaming CPU
Apr 12th, 2024 ASUS Radeon RX 7900 GRE TUF OC Review
Apr 18th, 2024 FiiO K19 Desktop DAC/Headphone Amplifier Review

Controversial News Posts