News Posts matching #startup

Return to Keyword Browsing

IBM & Inclusive Brains Announce Collab: Combining AI, Quantum & Neurotechnologies

Press Release by

Jun 4th, 2025 12:00 Discuss (0 Comments)

IBM and Inclusive Brains, a leader in non-invasive neurotechnologies and multimodal artificial intelligence, have entered a joint study agreement to experiment with advanced AI and quantum machine learning techniques. The aim of the joint study is to boost the performance of multi-modal brain-machine interfaces (BMIs).

Innovation with Positive Social Impact
BMIs have the potential to enable individuals with disabilities—particularly those who have lost the ability to use their hands or voice—to leverage connected devices and digital environments to regain control of their surroundings, eliminating the need for vocal commands or physical operation of a keyboard, screen or mouse. Inclusive Brains aims to improve access to education and employment opportunities using the insights generated in the joint study. Beyond better inclusion of people with paralysis, Inclusive Brains aims to broader societal benefits, including improved prevention of both physical and mental health issues among the wider population thanks to enhanced classifications and therefore a better understanding of brain activity patterns.

Read full story

Qualcomm & HUMAIN to Develop State-of-the-Art AI Data Centers

Press Release by

May 14th, 2025 11:37 Discuss (0 Comments)

Qualcomm Technologies, Inc., a technology leader enabling connected intelligent computing, and HUMAIN, a national artificial intelligence (AI) champion advancing Saudi Arabia's goals for AI, today signed a Memorandum of Understanding (MOU) with the intent to enter a strategic collaboration for the development of next-generation AI data centers, infrastructure and cloud-to-edge services to meet the rapidly growing demand for AI across the globe, including the Kingdom of Saudi Arabia. The MOU was signed during the Saudi-US Investment Forum in Riyadh, which was held during the official visit of President of the United States Donald Trump to the Kingdom of Saudi Arabia.

The data centers and supporting ecosystem are intended to provide both government and enterprise organizations with access to high-performance and power efficient CPU and AI cloud infrastructure and cloud-to-edge services. These offerings will enable the deployment of AI solutions that can make real-time predictions and decisions, in addition to significantly increasing the availability and value of advanced AI-enabled applications.

Read full story

NVIDIA Dismisses Anthropic's Report of Ludicrous GPU & CPU Smuggling Methods

by

May 2nd, 2025 10:25 Discuss (16 Comments)

The first couple of paragraphs within Anthropic's "Securing America's Compute Advantage: (Our) Position on the Diffusion Rule" article are standard fare. Roughly half-way through a read of this policy-related piece, the North American (Amazon-backed) AI startup makes some bizarre claims about the smuggling of AI-oriented products into China. Given ongoing global tensions and growing industry demands, these activities are somewhat expected—but Anthropic leadership described very specific methodologies. As stated within their "Chip Smuggling is a Major Threat" passage: "China has established sophisticated smuggling operations, with documented cases involving hundreds of millions of dollars worth of chips. In some cases, smugglers have employed creative methods to circumvent export controls, including hiding processors in prosthetic baby bumps and packing GPUs alongside live lobsters." Specific bits of hardware were not mentioned in this section, but the author later alludes to the frictionless transfer of thousands of "NVIDIA H100 advanced chips" into Chinese territories.

In a statement issued to CNBC, a Team Green spokesperson dismissed Anthropic's fanciful claims: "American firms should focus on innovation and rise to the challenge, rather than tell tall tales that large, heavy, and sensitive electronics are somehow smuggled in 'baby bumps' or 'alongside' live lobsters." This very public spat has received mainstream attention; with further coverage documenting additional "to and fro" barbs. NVIDIA criticized Anthropic's anti-foreign competition stance: "China, with half of the world's AI researchers, has highly capable AI experts at every layer of the AI stack. America cannot manipulate regulators to capture victory in AI." Amusingly, Anthropic's operations rely heavily on Team Green hardware—many online critics reckon that top US AI companies are jostling for priority access to cutting-edge GPUs/accelerators. In reaction to NVIDIA's dismissal of their report, a company spokesperson retorted with: "Anthropic stands by its recently filed public submission in support of strong and balanced export controls that help secure America's lead in infrastructure development and ensure that the values of freedom and democracy shape the future of AI."

Safe Superintelligence Inc. Uses Google TPUs Instead of Regular GPUs for Next-Generation Models

by

Apr 10th, 2025 04:04 Discuss (7 Comments)

It seems like Google aims to grab a bit of the market share from NVIDIA and AMD by offering startups large compute deals and allowing them to train their massive AI models on the Google Cloud Platform (GCP). One such case is the OpenAI co-founder Ilya Sutskever's Safe Superintelligence Inc. (SSI) startup. According to a GCP post, SSI is "partnering with Google Cloud to use TPUs to accelerate its research and development efforts toward building a safe, superintelligent AI." Google's latest TPU v7p, codenamed Ironwood, was released yesterday. Carrying 4,614 TeraFLOPS of FP8 precision and 192 GB of HBM memory, these TPUs are interconnected using Google's custom ICI infrastructure and are scaled to configurations in pods of 9,216 chips, where Ironwood delivers 42.5 ExaFLOPS of total computing power.

For AI training, this massive power will allow AI models to quickly go over training, accelerating research iterations and ultimately accelerating model development. For SSI, the end goal is a simple mission: achieving ASI with safety at the front. "We approach safety and capabilities in tandem, as technical problems to be solved through revolutionary engineering and scientific breakthroughs. We plan to advance capabilities as fast as possible while making sure our safety always remains ahead," notes the SSI website, adding that "Our singular focus means no distraction by management overhead or product cycles, and our business model means safety, security, and progress are all insulated from short-term commercial pressures."

Read full story

OpenAI Reportedly Considering ~$500 Million Takeover of io Products - a Smart AI Device Startup Founded by Sam Altman

by

Apr 8th, 2025 13:01 Discuss (6 Comments)

At some point in 2023, Sam Altman founded a new AI hardware device firm: io Products. Last September, Jony Ive confirmed his involvement in this startup tech company. Industry whispers suggest that Open AI's CEO (Altman) and the former Apple chief designer (Ive) were collaborating on the making of voice-enabled smart AI assistant household products. According to a (paywalled) The Information news article, this fledgling operation attracted an "undisclosed amount of funding" from wealthy benefactors. The "small team" is reportedly working on some type of "revolutionary" screenless AI phone, but this rumored project is likely in a very early stage of development—so moles have not disclosed exact details.

The report posits that Ive has recruited Tang Tan and Evans Hankey—both high-level ex-Apple industrial leads, with long-term iPhone design experience. The Information's anonymous inside sources believe that Ive's LoveFrom boutique agency will determine the visual setup for forthcoming io Products smart devices. Additionally, the report extends to a potential takeover bid—valued at around $500 million (USD)—coming from Altman's main gig. Open AI's leadership is supposedly weighing up its options. The Information disclosed an alternative avenue; a simple partnership—rather than a comprehensive takeover.

Humane AI Pin To Become $699 Paperweight Post HP Acquisition

by

Feb 19th, 2025 10:20 Discuss (8 Comments)

The AI Pin - Humane's first, and last product - received an overwhelmingly negative reception at launch, and that's putting it mildly. The device was hindered by poor usability and an extremely laggy operating system that made even the simplest of tasks an absolute chore. The product, unsurprisingly, was criticized by almost every reviewer who touched it, and rumors regarding a potential acquisition of Humane soon began to circulate. Now, HP has seemingly finalized a $116-million acquisition offer for the startup that once raised over $230 million, and valued itself at $1 billion. It appears that HP wants almost all of Humane's assets - with the notable exception of one key product, the AI Pin.

As such, once the calendar reaches the 28th of February, the AI Pin will cease to function. The servers that the AI Pin utilized for communications will be shut down, rendering the AI Pin unable to access customer's cloud data, make phone calls, send AI queries, or text messages. Customers who bought the Pin in the last 90 days will be eligible for a full refund, whereas those who bought the ill-fated gadget before that will soon have an arguably good-looking fashion accessory that will probably still be able to tell them the time. After all, the product's build quality and design was the lone aspect of the AI Pin that received praise. It remains to be seen how HP puts Humane's R&D to use, although considering the AI hype that is engulfing the consumer tech industry, it wouldn't be surprising to see HP leverage Humane's tech to carve out its own niche in the AI-assisted personal computing space. Of course, that is contingent on HP's implementation being better than what Humane pulled off the last time around.

Quobly Announces Key Milestone for Fault-tolerant Quantum Computing

Press Release by

Dec 10th, 2024 05:19 Discuss (0 Comments)

Quobly, a leading French quantum computing startup, has reported that FD-SOI technology can serve as a scalable platform for commercial quantum computing, leveraging traditional semiconductor manufacturing fabs and CEA-Leti's R&D pilot line.

The semiconductor industry has played a pivotal role in enabling classical computers to scale at cost; it has the same transformative potential for quantum computers, making them commercially scalable and cost competitive. Silicon spin qubits are excellent for achieving fault-tolerant, large-scale quantum computing, registering clock speeds in the µsec range, fidelity above 99% for one and two-qubit gate operations and incomparably small unit cell sizes (in the hundredths of 100 nm²).

Read full story

Ubitium Debuts First Universal RISC-V Processor: CPU, GPU, DSP, FPGA All in One Chip

Press Release by

Nov 21st, 2024 11:44 Discuss (11 Comments)

For over half a century, general-purpose processors have been built on the Tomasulo algorithm, developed by IBM engineer Robert Tomasulo in 1967. It's a $500B industry built on specialized CPU, GPU and other chips for different computing tasks. Hardware startup Ubitium has shattered this paradigm with a breakthrough universal processor that handles all computing workloads on a single, efficient chip - unlocking simpler, smarter, and more cost-effective devices across industries - while revolutionizing a 57-year-old industry standard.

Alongside this, Ubitium is announcing a $3.7 million in seed funding round, co-led by Runa Capital, Inflection, and KBC Focus Fund. The investment will be used to develop the first prototypes and prepare initial development kits for customers, with the first chips planned for 2026.

Read full story

Q.ANT Introduces First Commercial Photonic Processor

Press Release by

Nov 19th, 2024 08:50 Discuss (10 Comments)

Q.ANT, the leading startup for photonic computing, today announced the launch of its first commercial product - a photonics-based Native Processing Unit (NPU) built on the company's compute architecture LENA - Light Empowered Native Arithmetics. The product is fully compatible with today's existing computing ecosystem as it comes on the industry-standard PCI-Express. The Q.ANT NPU executes complex, non-linear mathematics natively using light instead of electrons, promising to deliver at least 30 times greater energy efficiency and significant computational speed improvements over traditional CMOS technology. Designed for compute-intensive applications such as AI Inference, machine learning, and physics simulation, the Q.ANT NPU has been proven to solve real-world challenges, including number recognition for deep neural network inference (see the recent press release regarding Cloud Access to NPU).

"With our photonic chip technology now available on the standard PCIe interface, we're bringing the incredible power of photonics directly into real-world applications. For us, this is not just a processor—it's a statement of intent: Sustainability and performance can go hand in hand," said Dr. Michael Förtsch, CEO of Q.ANT. "For the first time, developers can create AI applications and explore the capabilities of photonic computing, particularly for complex, nonlinear calculations. For example, experts calculated that one GPT-4 query today uses 10 times more electricity than a regular internet search request. Our photonic computing chips offer the potential to reduce the energy consumption for that query by a factor of 30."

Read full story

Mobilint Debuts New AI Chips at Silicon Valley Summit

Press Release by

Sep 9th, 2024 13:16 Discuss (1 Comment)

Mobilint, an edge AI chip company led by CEO Dongjoo Shin, is set to make waves at the upcoming AI Hardware & Edge AI Summit 2024 in Silicon Valley. The three-day event, starting on September 10th, will showcase Mobilint's latest innovations in AI chip technology. The company will demonstrate live demos of its high-efficiency SoC 'REGULUS' for on-device AI and high-performance acceleration chip 'ARIES' for on-premises AI.

The AI Hardware Summit is an annual event where global IT giants such as Microsoft, NVIDIA, Google, Meta, and AMD, along with prominent startups, gather to share their developments in AI and machine learning. This year's summit features world-renowned AI experts as speakers, including Andrew Ng, CEO of Landing AI, and Mark Russinovich, CTO of Microsoft Azure.

Read full story

X-Silicon Startup Wants to Combine RISC-V CPU, GPU, and NPU in a Single Processor

by

Apr 5th, 2024 08:35 Discuss (31 Comments)

While we are all used to having a system with a CPU, GPU, and, recently, NPU—X-Silicon Inc. (XSi), a startup founded by former Silicon Valley veterans—has unveiled an interesting RISC-V processor that can simultaneously handle CPU, GPU, and NPU workloads in a chip. This innovative chip architecture, which will be open-source, aims to provide a flexible and efficient solution for a wide range of applications, including artificial intelligence, virtual reality, automotive systems, and IoT devices. The new microprocessor combines a RISC-V CPU core with vector capabilities and GPU acceleration into a single chip, creating a versatile all-in-one processor. By integrating the functionality of a CPU and GPU into a single core, X-Silicon's design offers several advantages over traditional architectures. The chip utilizes the open-source RISC-V instruction set architecture (ISA) for both CPU and GPU operations, running a single instruction stream. This approach promises lower memory footprint execution and improved efficiency, as there is no need to copy data between separate CPU and GPU memory spaces.

Called the C-GPU architecture, X-Silicon uses RISC-V Vector Core, which has 16 32-bit FPUs and a Scaler ALU for processing regular integers as well as floating point instructions. A unified instruction decoder feeds the cores, which are connected to a thread scheduler, texture unit, rasterizer, clipping engine, neural engine, and pixel processors. All is fed into a frame buffer, which feeds the video engine for video output. The setup of the cores allows the users to program each core individually for HPC, AI, video, or graphics workloads. Without software, there is no usable chip, which prompts X-Silicon to work on OpenGL ES, Vulkan, Mesa, and OpenCL APIs. Additionally, the company plans to release a hardware abstraction layer (HAL) for direct chip programming. According to Jon Peddie Research (JPR), the industry has been seeking an open-standard GPU that is flexible and scalable enough to support various markets. X-Silicon's CPU/GPU hybrid chip aims to address this need by providing manufacturers with a single, open-chip design that can handle any desired workload. The XSi gave no timeline, but it has plans to distribute the IP to OEMs and hyperscalers, so the first silicon is still away.

Tiny Corp. Prepping Separate AMD & NVIDIA GPU-based AI Compute Systems

by

Mar 25th, 2024 14:50 Discuss (17 Comments)

George Hotz and his startup operation (Tiny Corporation) appeared ready to completely abandon AMD Radeon GPUs last week, after experiencing a period of firmware-related headaches. The original plan involved the development of a pre-orderable $15,000 TinyBox AI compute cluster that housed six XFX Speedster MERC310 RX 7900 XTX graphics cards, but software/driver issues prompted experimentation via alternative hardware routes. A lot of media coverage has focused on the unusual adoption of consumer-grade GPUs—Tiny Corp.'s struggles with RDNA 3 (rather than CDNA 3) were maneuvered further into public view, after top AMD brass pitched in.

The startup's social media feed is very transparent about showcasing everyday tasks, problem-solving and important decision-making. Several Acer Predator BiFrost Arc A770 OC cards were purchased and promptly integrated into a colorfully-lit TinyBox prototype, but Hotz & Co. swiftly moved onto Team Green pastures. Tiny Corp. has begrudgingly adopted NVIDIA GeForce RTX 4090 GPUs. Earlier today, it was announced that work on the AMD-based system has resumed—although customers were forewarned about anticipated teething problems. The surprising message arrived in the early hours: "a hard to find 'umr' repo has turned around the feasibility of the AMD TinyBox. It will be a journey, but it gives us an ability to debug. We're going to sell both, red for $15,000 and green for $25,000. When you realize your pre-order you'll choose your color. Website has been updated. If you like to tinker and feel pain, buy red. The driver still crashes the GPU and hangs sometimes, but we can work together to improve it."

Read full story

Tiny Corp. Pauses Development of AMD Radeon GPU-based Tinybox AI Cluster

by

Mar 20th, 2024 12:35 Discuss (36 Comments)

George Hotz and his Tiny Corporation colleagues were pinning their hopes on AMD delivering some good news earlier this month. The development of a "TinyBox" AI compute cluster project hit some major roadblocks a couple of weeks ago—at the time, Radeon RX 7900 XTX GPU firmware was not gelling with Tiny Corp.'s setup. Hotz expressed "70% confidence" in AMD approving open-sourcing certain bits of firmware. At the time of writing this has not transpired—this week the Tiny Corp. social media account has, once again, switched to an "all guns blazing" mode. Hotz and Co. have publicly disclosed that they were dabbling with Intel Arc graphics cards, as of a few weeks ago. NVIDIA hardware is another possible route, according to freshly posted open thoughts.

Yesterday, it was confirmed that the young startup organization had paused its utilization of XFX Speedster MERC310 RX 7900 XTX graphics cards: "the driver is still very unstable, and when it crashes or hangs we have no way of debugging it. We have no way of dumping the state of a GPU. Apparently it isn't just the MES causing these issues, it's also the Command Processor (CP). After seeing how open Tenstorrent is, it's hard to deal with this. With Tenstorrent, I feel confident that if there's an issue, I can debug and fix it. With AMD, I don't." The $15,000 TinyBox system relies on "cheaper" gaming-oriented GPUs, rather than traditional enterprise solutions—this oddball approach has attracted a number of customers, but the latest announcements likely signal another delay. Yesterday's tweet continued to state: "we are exploring Intel, working on adding Level Zero support to tinygrad. We also added a $400 bounty for XMX support. We are also (sadly) exploring a 6x GeForce RTX 4090 GPU box. At least we know the software is good there. We will revisit AMD once we have an open and reproducible build process for the driver and firmware. We are willing to dive really deep into hardware to make it amazing. But without access, we can't."

Read full story

AMD Radeon Graphics

Tiny Corp. CEO Expresses "70% Confidence" in AMD Open-Sourcing Certain GPU Firmware

by

Mar 9th, 2024 09:56 Discuss (12 Comments)

Lately Tiny Corp. CEO—George Hotz—has used his company's social media account to publicly criticize AMD Radeon RX 7900 XTX GPU firmware. The creator of Tinybox, a pre-orderable $15,000 AI compute cluster, has not selected "traditional" hardware for his systems—it is possible that AMD's Instinct MI300X accelerator is quite difficult to acquire, especially for a young startup operation. The decision to utilize gaming-oriented XFX-branded RDNA 3.0 GPUs instead of purpose-built CDNA 3.0 platforms—for local model training and AI inference—is certainly a peculiar one. Hotz and his colleagues have encountered roadblocks in the development of their Tinybox system—recently, public attention was drawn to an "LLVM spilling bug." AMD President/CEO/Chair, Dr. Lisa Su, swiftly stepped in and promised a "good solution." Earlier in the week, Tiny Corp. reported satisfaction with a delivery of fixes—courtesy of Team Red's software engineering department. They also disclosed that they would be discussing matters with AMD directly, regarding the possibility of open-sourcing Radeon GPU MES firmware.

Subsequently, Hotz documented his interactions with Team Red representatives—he expressed 70% confidence in AMD approving open-sourcing certain bits of firmware in a week's time: "Call went pretty well. We are gating the commitment to 6x Radeon RX 7900 XTX on a public release of a roadmap to get the firmware open source. (and obviously the MLPerf training bug being fixed). We aren't open source purists, it doesn't matter to us if the HDCP stuff is open for example. But we need the scheduler and the memory hierarchy management to be open. This is what it takes to push the performance of neural networks. The Groq 500 T/s mixtral demo should be possible on a tinybox, but it requires god tier software and deep integration with the scheduler. We also advised that the build process for amdgpu-dkms should be more open. While the driver itself is open, we haven't found it easy to rebuild and install. Easy REPL cycle is a key driver for community open source. We want the firmware to be easy to rebuild and install also." Prior to this week's co-operations, Tiny Corp. hinted that it could move on from utilizing Radeon RX 7900 XTX, in favor of Intel Alchemist graphics hardware—if AMD's decision making does not favor them, Hotz & Co. could pivot to builds including Acer Predator BiFrost Arc A770 16 GB OC cards.

AMD Radeon Graphics

Dr. Lisa Su Responds to TinyBox's Radeon RX 7900 XTX GPU Firmware Problems

by

Mar 6th, 2024 14:22 Discuss (24 Comments)

The TinyBox AI server system attracted plenty of media attention last week—its creator, George Hotz, decided to build with AMD RDNA 3.0 GPU hardware rather than the expected/traditional choice of CDNA 3.0. Tiny Corp. is a startup firm dealing in neural network frameworks—they currently "write and maintain tinygrad." Hotz & Co. are in the process of assembling rack-mounted 12U TinyBox systems for customers—an individual server houses an AMD EPYC 7532 processor and six XFX Speedster MERC310 Radeon RX 7900 XTX graphics cards. The Tiny Corp. social media account has engaged in numerous NVIDIA vs. AMD AI hardware debates/tirades—Hotz appears to favor the latter, as evidenced in his latest choice of components. ROCm support on Team Red AI Instinct accelerators is fairly mature at this point in time, but a much newer prospect on gaming-oriented graphics cards.

Tiny Corporation's unusual leveraging of Radeon RX 7900 XTX GPUs in a data center configuration has already hit a developmental roadblock. Yesterday, the company's social media account expressed driver-related frustrations in a public forum: "If AMD open sources their firmware, I'll fix their LLVM spilling bug and write a fuzzer for HSA. Otherwise, it's not worth putting tons of effort into fixing bugs on a platform you don't own." Hotz's latest complaint was taken onboard by AMD's top brass—Dr. Lisa Su responded with the following message: "Thanks for the collaboration and feedback. We are all in to get you a good solution. Team is on it." Her software engineers—within a few hours—managed to fling out a set of fixes in Tiny Corporation's direction. Hotz appreciated the quick turnaround, and proceeded to run a model without encountering major stability issues: "AMD sent me an updated set of firmware blobs to try. They are responsive, and there have been big strides in the driver in the last year. It will be good! This training run is almost 5 hours in, hasn't crashed yet." Tiny Corp. drummed up speculation about AMD open sourcing GPU MES firmware—Hotz disclosed that he will be talking (on the phone) to Team Red leadership.

Chinese Governing Bodies Reportedly Offering "Compute Vouchers" to AI Startups

by

Mar 6th, 2024 11:04 Discuss (1 Comment)

Regional Chinese governments are attempting to prop up local AI startup companies with an intriguing "voucher" support system. A Financial Times article outlines "computing" support packages valued between "$140,000 to $280,000" for fledgling organizations involved in LLM training. Widespread shortages of AI chips and rising data center operation costs are cited as the main factors driving a rollout of strategic subsidizations. The big three—Alibaba, Tencent, and ByteDance—are reportedly less willing to rent out their AI-crunching servers, due to internal operations demanding lengthy compute sessions. China's largest technology companies are believed to hording the vast majority of NVIDIA AI hardware, while smaller competitors are believed to fighting over table scraps. US trade restrictions have further escalated supply issues, with lower-performance/China-specific models being rejected—AMD's Instinct MI309 AI accelerator being the latest example.

The "computer voucher" initiative could be the first part of a wider scheme—reports suggest that regional governing bodies (including Shanghai) are devising another subsidy tier for domestic AI chips. Charlie Chai, an 86Research analyst, reckons that the initial support package is only a short-term solution. He shared this observation with FT: "the voucher is helpful to address the cost barrier, but it will not help with the scarcity of the resources." The Chinese government is reportedly looking into the creation of an alternative state-run system, that will become less reliant on a "Big Tech" data center model. A proposed "East Data West Computing" project could produce a more energy-efficient cluster of AI data centers, combined with a centralized management system.

Microsoft Investment in Mistral Attracts Possible Investigation by EU Regulators

by

Feb 27th, 2024 07:12 Discuss (5 Comments)

Tech giant Microsoft and Paris-based startup Mistral AI, an innovator in open-source AI model development, have announced a new multi-year partnership to accelerate AI innovation and expand access to Mistral's state-of-the-art models. The collaboration will leverage Azure's cutting-edge AI infrastructure to propel Mistral's research and bring its innovations to more customers globally. The partnership focuses on three core areas. First, Microsoft will provide Mistral with Azure AI supercomputing infrastructure to power advanced AI training and inference for Mistral's flagship models like Mistral-Large. Second, the companies will collaborate on AI research and development to push AI model's boundaries. And third, Azure's enterprise capabilities will give Mistral additional opportunities to promote, sell, and distribute their models to Microsoft customers worldwide.

However, an investment in a European startup can not go smoothly without the constant eyesight of the European Union authorities and regulators to oversee the deal. According to Bloomberg, an EU spokesperson on Tuesday claimed that the EU regulators will perform an analysis of Microsoft's investment into Mistral after receiving a copy of the agreement between the two parties. While there is no formal investigation yet, if EU regulators continue to probe Microsoft's deal and intentions, they could launch a complete formal investigation that could lead to the termination of Microsoft's plans. Of course, the formal investigation is still on hold, but investing in EU startups might become unfeasible for American tech giants if the EU regulators continue to push the scrutiny of every investment made in companies based on EU soil.

Groq LPU AI Inference Chip is Rivaling Major Players like NVIDIA, AMD, and Intel

by

Feb 20th, 2024 03:35 Discuss (1 Comment)

AI workloads are split into two different categories: training and inference. While training requires large computing and memory capacity, access speeds are not a significant contributor; inference is another story. With inference, the AI model must run extremely fast to serve the end-user with as many tokens (words) as possible, hence giving the user answers to their prompts faster. An AI chip startup, Groq, which was in stealth mode for a long time, has been making major moves in providing ultra-fast inference speeds using its Language Processing Unit (LPU) designed for large language models (LLMs) like GPT, Llama, and Mistral LLMs. The Groq LPU is a single-core unit based on the Tensor-Streaming Processor (TSP) architecture which achieves 750 TOPS at INT8 and 188 TeraFLOPS at FP16, with 320x320 fused dot product matrix multiplication, in addition to 5,120 Vector ALUs.

Having massive concurrency with 80 TB/s of bandwidth, the Groq LPU has 230 MB capacity of local SRAM. All of this is working together to provide Groq with a fantastic performance, making waves over the past few days on the internet. Serving the Mixtral 8x7B model at 480 tokens per second, the Groq LPU is providing one of the leading inference numbers in the industry. In models like Llama 2 70B with 4096 token context length, Groq can serve 300 tokens/s, while in smaller Llama 2 7B with 2048 tokens of context, Groq LPU can output 750 tokens/s. According to the LLMPerf Leaderboard, the Groq LPU is beating the GPU-based cloud providers at inferencing LLMs Llama in configurations of anywhere from 7 to 70 billion parameters. In token throughput (output) and time to first token (latency), Groq is leading the pack, achieving the highest throughput and second lowest latency.

Read full story

SoftBank Founder Wants $100 Billion to Compete with NVIDIA's AI

by

Feb 19th, 2024 01:00 Discuss (13 Comments)

Japanese tech billionaire and founder of the SoftBank Group, Masayoshi Son, is embarking on a hugely ambitious new project to build an AI chip company that aims to rival NVIDIA, the current leader in AI semiconductor solutions. Codenamed "Izanagi" after the Japanese god of creation, Son aims to raise up to $100 billion in funding for the new venture. With his company SoftBank having recently scaled back investments in startups, Son is now setting his sights on the red-hot AI chip sector. Izanagi would leverage SoftBank's existing chip design firm, Arm, to develop advanced semiconductors tailored for artificial intelligence computing. The startup would use Arm's instruction set for the chip's processing elements. This could pit Izanagi directly against NVIDIA's leadership position in AI chips. Son has a chest of $41 billion in cash at SoftBank that he can deploy for Izanagi.

Additionally, he is courting sovereign wealth funds in the Middle East to contribute up to $70 billion in additional capital. In total, Son may be seeking up to $100 billion to bankroll Izanagi into a chip powerhouse. AI chips are seeing surging demand as machine learning and neural networks require specialized semiconductors that can process massive datasets. NVIDIA and other names like Intel, AMD, and select startups have capitalized on this trend. However, Son believes the market has room for another major player. Izanagi would focus squarely on developing bleeding-edge AI chip architectures to power the next generation of artificial intelligence applications. It is still unclear if this would be an AI training or AI inference project, but given that the training market is currently bigger as we are in the early buildout phase of AI infrastructure, the consensus might settle on training. With his track record of bold bets, Son is aiming very high with Izanagi. It's a hugely ambitious goal, but Son has defied expectations before. Project Izanagi will test the limits of even his vision and financial firepower.

AMD to Acquire Open-Source AI Software Expert Nod.ai

Press Release by

Oct 10th, 2023 09:38 Discuss (17 Comments)

AMD today announced the signing of a definitive agreement to acquire Nod.ai to expand the company's open AI software capabilities. The addition of Nod.ai will bring an experienced team that has developed an industry-leading software technology that accelerates the deployment of AI solutions optimized for AMD Instinct data center accelerators, Ryzen AI processors, EPYC processors, Versal SoCs and Radeon GPUs to AMD. The agreement strongly aligns with the AMD AI growth strategy centered on an open software ecosystem that lowers the barriers of entry for customers through developer tools, libraries and models.

"The acquisition of Nod.ai is expected to significantly enhance our ability to provide AI customers with open software that allows them to easily deploy highly performant AI models tuned for AMD hardware," said Vamsi Boppana, senior vice president, Artificial Intelligence Group at AMD. "The addition of the talented Nod.ai team accelerates our ability to advance open-source compiler technology and enable portable, high-performance AI solutions across the AMD product portfolio. Nod.ai's technologies are already widely deployed in the cloud, at the edge and across a broad range of end point devices today."

Read full story

UPMEM Raises €7M to Revolutionize AI and Analytics Processing

Press Release by

Sep 12th, 2023 10:48 Discuss (0 Comments)

UPMEM, a fabless semiconductor startup has raised €4.1 M equity from the European Innovation Council (EIC) Fund and Venture Capitalists (Partech, Western Digital Capital, C4 Ventures…), and a €2.5M grant from the EIC. Founded by Fabrice Devaux and Gilles Hamou, the company is pioneering ultra-efficient Processing In Memory (PIM) accelerators to tackle the significant challenge of compute efficiency for AI and big data applications.

UPMEM's PIM solution, integrating UPMEM's first commercial-grade PIM chip on the market, is now available to cloud markets across the globe (US, Asia...) to provide the most cost-effective and energy-efficient solutions for AI and analytics applications in data centers and at the edge, such as large language models (LLM e.g. GPT), genomics, large analytics.

Read full story

NVIDIA Partners with Reliance to Advance AI in India

Press Release by

Sep 8th, 2023 09:33 Discuss (0 Comments)

In a major step to support India's industrial sector, NVIDIA and Reliance Industries today announced a collaboration to develop India's own foundation large language model trained on the nation's diverse languages and tailored for generative AI applications to serve the world's most populous nation. The companies will work together to build AI infrastructure that is over an order of magnitude more powerful than the fastest supercomputer in India today. NVIDIA will provide access to the most advanced NVIDIA GH200 Grace Hopper Superchip and NVIDIA DGX Cloud, an AI supercomputing service in the cloud. GH200 marks a fundamental shift in computing architecture that provides exceptional performance and massive memory bandwidth.

The NVIDIA-powered AI infrastructure is the foundation of the new frontier into AI for Reliance Jio Infocomm, Reliance Industries' telecom arm. The global AI revolution is transforming industries and daily life. To serve India's vast potential in AI, Reliance will create AI applications and services for their 450 million Jio customers and provide energy-efficient AI infrastructure to scientists, developers and startups across India.

Read full story

Silicon Box Opens US$2 Billion Advanced Semiconductor Assembly Plant in Singapore

by

Jul 21st, 2023 10:17 Discuss (6 Comments)

Somewhat out of the blue, Silicon Box has announced the opening of its US$2 billion semiconductor assembly plant in Singapore. The "startup" is founded by several of Marvell's founders, suggesting the company has the right pedigree to compete in what is sure to be a very competitive market over the next few years. Silicon Box is not a foundry and will at least at this point in time, not be involved in foundry services, but instead the company will focus on advanced chip packaging technologies, focusing on chiplets.

The company is promising "faster time-to-market, lower new device design cost" on its very rudimentary website, something the company has yet to prove to be capable of. However, its new plant in Singapore covers 73,000 square metres and is said to feature state of the art production equipment for turning chiplets into chips. The factory is said to create some 1,200 jobs in Singapore, suggesting that this is a company that means business. According to a comment by company founder and CEO BJ Han to Reuters, "customers had been lining up" since before the completion of the assembly plant. Silicon Box is expecting to have several AI chipset companies as its customers, including Tenstorrent, which so far is the only officially mentioned client. Time will tell if Silicon Box can compete with established chip packaging businesses and if they can deliver on their promise to be faster and cheaper than the competition.

Jensen Huang & Leading EU Generative AI Execs Participated in Fireside Chat

Press Release by

Jul 6th, 2023 09:17 Discuss (5 Comments)

Three leading European generative AI startups joined NVIDIA founder and CEO Jensen Huang this week to talk about the new era of computing. More than 500 developers, researchers, entrepreneurs and executives from across Europe and further afield packed into the Spindler and Klatt, a sleek, riverside gathering spot in Berlin. Huang started the reception by touching on the message he delivered Monday at the Berlin Summit for Earth Virtualization Engines (EVE), an international collaboration focused on climate science. He shared details of NVIDIA's Earth-2 initiative and how accelerated computing, AI-augmented simulation and interactive digital twins drive climate science research.

Before sitting down for a fireside chat with the founders of the three startups, Huang introduced some "special guests" to the audience—four of the world's leading climate modeling scientists, who he called the "unsung heroes" of saving the planet. "These scientists have dedicated their careers to advancing climate science," said Huang. "With the vision of EVE, they are the architects of the new era of climate science."

Read full story

Ethernovia, an Automotive Ethernet Startup, Receives $64 Million Investment

Press Release by

May 15th, 2023 10:46 Discuss (6 Comments)

Ethernovia, Inc., a Silicon Valley-based startup, today announced that it has completed its A-round financing of $64 million. The funding round consists of multiple investors, including Porsche Automobile Holding SE (Porsche SE), Qualcomm Ventures, VentureTech Alliance, AMD Ventures, Western Digital Capital, Fall Line Capital, Taiwania Capital, ENEA Capital and others.

As vehicle architectures evolve from domain-centric controllers, networking solutions must concurrently evolve to support higher data rates of advanced vehicle applications while meeting demand for improved reliability and security. Such advanced applications include Advanced Driver-Assisted Systems (ADAS), autonomous driving (AD) and a rich ecosystem of customer software delivered Over the Air (OTA).

Read full story

Return to Keyword Browsing

Jul 13th, 2025 02:14 CDT change timezone

Latest GPU Drivers

New Forum Posts

02:13 by Dr. Dro
Best motherboards for XP gaming (116)
02:09 by Cowboystrekk
9800x3D - 6400 CL32 1:1 not stable (12)
02:04 by silentbogo
Is there a WIFI chip I should get? (1)
01:53 by cinemaware
What are you playing? (23945)
01:27 by LabRat 891
9060 XT 16GB or 6800 XT/6900XT? (30)
01:01 by sweethoneybee
ASUS ProArt GeForce RTX 4060 Ti OC Edition 16GB GDDR6 Gaming - nvflash64 VBIOS mismatch (5)
00:51 by A Computer Guy
Upgrade from old x58 system (10)
00:28 by lexluthermiester
New ToS of Take Two and 2K (11)
00:20 by Atlasturner
Someone run games on AMD BC-250 under Linux * Cut down PS5 die to 6 CPU cores 24 GPU cores for use in crypto mining (86)
00:05 by eidairaman1
GPU strip blinking,Trixx software not working properly,fan health fail... (1)

Popular Reviews

Jul 9th, 2025 Fractal Design Epoch RGB TG Review
Jul 11th, 2025 Lexar NM1090 Pro 4 TB Review
Jul 8th, 2025 Corsair FRAME 5000D RS Review
Jul 11th, 2025 Our Visit to the Hunter Super Computer
Jul 4th, 2025 NVIDIA GeForce RTX 5050 8 GB Review
Jul 7th, 2025 NZXT N9 X870E Review
Jun 20th, 2025 Sapphire Radeon RX 9060 XT Pulse OC 16 GB Review - An Excellent Choice
Nov 6th, 2024 AMD Ryzen 7 9800X3D Review - The Best Gaming Processor
May 13th, 2025 Upcoming Hardware Launches 2025 (Updated May 2025)
Jul 10th, 2025 Chieftec Iceberg 360 Review

TPU on YouTube

Controversial News Posts