News Posts matching #Language

AMD Recommends EPYC Processors for Everyday AI Server Tasks

Press Release by

Mar 12th, 2025 13:21 Discuss (8 Comments)

Ask a typical IT professional today whether they're leveraging AI, and there's a good chance they'll say yes-after all, they have reputations to protect! Kidding aside, many will report that their teams may use Web-based tools like ChatGPT or even have internal chatbots that serve their employee base on their intranet, but for that not much AI is really being implemented at the infrastructure level. As it turns out, the true answer is a bit different. AI tools and techniques have embedded themselves firmly into standard enterprise workloads and are a more common, everyday phenomena than even many IT people may realize. Assembly line operations now include computer vision-powered inspections. Supply chains use AI for demand forecasting making business move faster and of course, AI note-taking and meeting summary is embedded on virtually all the variants of collaboration and meeting software.

Increasingly, critical enterprise software tools incorporate built-in recommendation systems, virtual agents or some other form of AI-enabled assistance. AI is truly becoming a pervasive, complementary tool for everyday business. At the same time, today's enterprises are navigating a hybrid landscape where traditional, mission-critical workloads coexist with innovative AI-driven tasks. This "mixed enterprise and AI" workload environment calls for infrastructure that can handle both types of processing seamlessly. Robust, general-purpose CPUs like the AMD EPYC processors are designed to be powerful and secure and flexible to address this need. They handle everyday tasks—running databases, web servers, ERP systems—and offer strong security features crucial for enterprise operations augmented with AI workloads. In essence, modern enterprise infrastructure is about creating a balanced ecosystem. AMD EPYC CPUs play a pivotal role in creating this balance, delivering high performance, efficiency, and security features that underpin both traditional enterprise workloads and advanced AI operations.

Read full story

NVIDIA & Partners Will Discuss Supercharging of AI Development at GTC 2025

Press Release by

T0@st

Feb 26th, 2025 10:27 Discuss (1 Comment)

Generative AI is redefining computing, unlocking new ways to build, train and optimize AI models on PCs and workstations. From content creation and large and small language models to software development, AI-powered PCs and workstations are transforming workflows and enhancing productivity. At GTC 2025, running March 17-21 in the San Jose Convention Center, experts from across the AI ecosystem will share insights on deploying AI locally, optimizing models and harnessing cutting-edge hardware and software to enhance AI workloads—highlighting key advancements in RTX AI PCs and workstations.

Develop and Deploy on RTX
RTX GPUs are built with specialized AI hardware called Tensor Cores that provide the compute performance needed to run the latest and most demanding AI models. These high-performance GPUs can help build digital humans, chatbots, AI-generated podcasts and more. With more than 100 million GeForce RTX and NVIDIA RTX GPUs users, developers have a large audience to target when new AI apps and features are deployed. In the session "Build Digital Humans, Chatbots, and AI-Generated Podcasts for RTX PCs and Workstations," Annamalai Chockalingam, senior product manager at NVIDIA, will showcase the end-to-end suite of tools developers can use to streamline development and deploy incredibly fast AI-enabled applications.

Read full story

IBM & Lenovo Expand Strategic AI Technology Partnership in Saudi Arabia

Press Release by

T0@st

Feb 10th, 2025 12:12 Discuss (1 Comment)

IBM and Lenovo today announced at LEAP 2025 a planned expansion of their strategic technology partnership designed to help scale the impact of generative AI for clients in the Kingdom of Saudi Arabia. IDC expects annual worldwide spending on AI-centric systems to surpass $300 billion by 2026, with many leading organizations in Saudi Arabia exploring and investing in generative AI use cases as they prepare for the emergence of an "AI everywhere" world.

Building upon their 20-year partnership, IBM and Lenovo will collaborate to deliver AI solutions comprised of technology from the IBM watsonx portfolio of AI products, including the Saudi Data and Artificial Intelligence Authority (SDAIA) open-source Arabic Large Language Model (ALLaM), and Lenovo infrastructure. These solutions are expected to help government and business clients in the Kingdom to accelerate their use of AI to improve public services and make data-driven decisions in areas such as fraud detection, public safety, customer service, code modernization, and IT operations.

Read full story

Timekettle W4 Pro Earbuds Featuring Babel OS Launches Real-time 2-Way Call Translation, Enabling Natural Cross-lingual Conversations

CES Press Release by

TheLostSwede

Jan 7th, 2025 14:19 Discuss (1 Comment)

Timekettle, a global leader in AI-powered communication technology, is thrilled to introduce the enhanced W4 Pro Earbuds with bidirectional call functionality. Powered by its first proprietary software system, also announced at CES, Babel OS, the new W4 Pro sets a new benchmark for cross-language communication, offering seamless two-way real-time translation during phone and video calls on any communication platform.

The newly launched feature allows two-way translations to facilitate effortless cross-lingual conversations through any telecommunication application or traditional phone system. It provides bidirectional synchronous translations without disrupting the original voice quality.

Read full story

Windows 11 Grows in November Steam Survey Results As Linux Coasts at 2% and English Overtakes Chinese

Cpt.Jank

Dec 2nd, 2024 23:15 Discuss (7 Comments)

Steam's monthly hardware and software surveys provide a decent picture of what hardware and software gamers rely on to play their favorite games—at least those on Valve's game platform. Since the launch of Windows 11, it has been a somewhat reliable way to track the adoption of the new Windows version, and, as the official cut-off for Windows 10 support draws near, one would expect Windows 11 to pick up steam, especially among gamers, where Windows is the dominant OS. The results of the November Steam Survey are in, and while not much has changed on the hardware front, it seems like Microsoft is indeed wearing gamers down when it comes to Windows 11 adoption. Despite seeing a decent uptick in Windows 11 installations, the overall Windows market share dropped, even if almost imperceptibly, while Linux and macOS both saw a slight uptick in adoption among Steam gamers. As expected, Windows remained the dominant platform for gamers, but Windows 11, specifically, saw growth of 4.18%, while Windows 10 lost 4.15%, which is almost an exact 1:1 match, indicating that gamers are largely staying on Windows when they finally decide to move on from Windows 10. Overall, Windows lost 0.05% market share, compared to Linux, which gained 0.03% and macOS, which grew by 0.02%.

Valve's SteamOS Holo was the most popular Linux version in the survey, but it, too, slid by 0.28%. Of course, the hardware split for Linux is representative of the software side of things, which is to say: It's mostly just Steam Decks. As expected, most of the video cards and CPUs in the Linux results were AMD GPUs, with well over 36% of the sampled Linux gamers using AMD GPUs, even disregarding the obvious bias introduced by the AMD-powered Steam Deck hardware. The most popular NVIDIA GPU on Linux systems running Steam is currently the GeForce RTX 3060, at a mere 1.46% of the market share. Meanwhile, on Windows side, 5.03% of gamers are using the GTX 3060, with the next most popular GPU being the NVIDIA GeForce RTX 4060 Laptop GPU, at 4.92%.

Read full story

NVIDIA ACE Brings AI-Powered Interactions To Mecha BREAK

Press Release by

Nomad76

Aug 20th, 2024 08:55 Discuss (4 Comments)

NVIDIA ACE is a revolutionary suite of digital human technologies that brings digital humans to life with generative AI. Since its debut at Computex 2023 in the Ramen Shop tech demo, ACE's capabilities have evolved rapidly.

At Gamescom 2024, we announced our first digital human technology on-device small language model improving the conversation abilities of game characters. We also announced that the first game to showcase these ACE and digital human technologies is Amazing Seasun Game's Mecha BREAK, bringing its characters to life and providing a more dynamic and immersive gameplay experience on GeForce RTX AI PCs.

Read full story

Ubisoft Exploring Generative AI, Could Revolutionize NPC Narratives

Press Release by

T0@st

Mar 19th, 2024 13:45 Discuss (13 Comments)

Have you ever dreamed of having a real conversation with an NPC in a video game? Not just one gated within a dialogue tree of pre-determined answers, but an actual conversation, conducted through spontaneous action and reaction? Lately, a small R&D team at Ubisoft's Paris studio, in collaboration with Nvidia's Audio2Face application and Inworld's Large Language Model (LLM), have been experimenting with generative AI in an attempt to turn this dream into a reality. Their project, NEO NPC, uses GenAI to prod at the limits of how a player can interact with an NPC without breaking the authenticity of the situation they are in, or the character of the NPC itself.

Considering that word—authenticity—the project has had to be a hugely collaborative effort across artistic and scientific disciplines. Generative AI is a hot topic of conversation in the videogame industry, and Senior Vice President of Production Technology Guillemette Picard is keen to stress that the goal behind all genAI projects at Ubisoft is to bring value to the player; and that means continuing to focus on human creativity behind the scenes. "The way we worked on this project, is always with our players and our developers in mind," says Picard. "With the player in mind, we know that developers and their creativity must still drive our projects. Generative AI is only of value if it has value for them."

Read full story

Groq LPU AI Inference Chip is Rivaling Major Players like NVIDIA, AMD, and Intel

AleksandarK

Feb 20th, 2024 03:35 Discuss (1 Comment)

AI workloads are split into two different categories: training and inference. While training requires large computing and memory capacity, access speeds are not a significant contributor; inference is another story. With inference, the AI model must run extremely fast to serve the end-user with as many tokens (words) as possible, hence giving the user answers to their prompts faster. An AI chip startup, Groq, which was in stealth mode for a long time, has been making major moves in providing ultra-fast inference speeds using its Language Processing Unit (LPU) designed for large language models (LLMs) like GPT, Llama, and Mistral LLMs. The Groq LPU is a single-core unit based on the Tensor-Streaming Processor (TSP) architecture which achieves 750 TOPS at INT8 and 188 TeraFLOPS at FP16, with 320x320 fused dot product matrix multiplication, in addition to 5,120 Vector ALUs.

Having massive concurrency with 80 TB/s of bandwidth, the Groq LPU has 230 MB capacity of local SRAM. All of this is working together to provide Groq with a fantastic performance, making waves over the past few days on the internet. Serving the Mixtral 8x7B model at 480 tokens per second, the Groq LPU is providing one of the leading inference numbers in the industry. In models like Llama 2 70B with 4096 token context length, Groq can serve 300 tokens/s, while in smaller Llama 2 7B with 2048 tokens of context, Groq LPU can output 750 tokens/s. According to the LLMPerf Leaderboard, the Groq LPU is beating the GPU-based cloud providers at inferencing LLMs Llama in configurations of anywhere from 7 to 70 billion parameters. In token throughput (output) and time to first token (latency), Groq is leading the pack, achieving the highest throughput and second lowest latency.

Read full story

Apple Wants to Store LLMs on Flash Memory to Bring AI to Smartphones and Laptops

AleksandarK

Dec 21st, 2023 08:47 Discuss (24 Comments)

Apple has been experimenting with Large Language Models (LLMs) that power most of today's AI applications. The company wants these LLMs to serve the users best and deliver them efficiently, which is a difficult task as they require a lot of resources, including compute and memory. Traditionally, LLMs have required AI accelerators in combination with large DRAM capacity to store model weights. However, Apple has published a paper that aims to bring LLMs to devices with limited memory capacity. By storing LLMs on NAND flash memory (regular storage), the method involves constructing an inference cost model that harmonizes with the flash memory behavior, guiding optimization in two critical areas: reducing the volume of data transferred from flash and reading data in larger, more contiguous chunks. Instead of storing the model weights on DRAM, Apple wants to utilize flash memory to store weights and only pull them on-demand to DRAM once it is needed.

Two principal techniques are introduced within this flash memory-informed framework: "windowing" and "row-column bundling." These methods collectively enable running models up to twice the size of the available DRAM, with a 4-5x and 20-25x increase in inference speed compared to native loading approaches on CPU and GPU, respectively. Integrating sparsity awareness, context-adaptive loading, and a hardware-oriented design pave the way for practical inference of LLMs on devices with limited memory, such as SoCs with 8/16/32 GB of available DRAM. Especially with DRAM prices outweighing NAND Flash, setups such as smartphone configurations could easily store and inference LLMs with multi-billion parameters, even if the DRAM available isn't sufficient. For a more technical deep dive, read the paper on arXiv here.

Google Bard Available Across the EU, Updated with 40 Languages & Spoken Response Function

T0@st

Jul 13th, 2023 12:11 Discuss (9 Comments)

Google has notified the world about its AI chatbot, Bard, getting a wider release and new features—with a rollout across Europe (27 territories), plus the addition of Brazil: "Today we're announcing Bard's biggest expansion to date. It's now available in most of the world, and in the most widely spoken languages. And we're launching new features to help you better customize your experience, boost your creativity and get more done." Their updated system is available now, so users "can collaborate with Bard in over 40 languages." A spoken response function has been implemented which is advertised as being very "helpful if you want to hear the correct pronunciation of a word or listen to a poem or script. Simply enter a prompt and select the sound icon to hear Bard's answers."

Jack Krawczyk, Bard Product Lead, and Amarnag Subramanya, Bard's VP of Engineering made sure to mention that Google is covering its bases, since privacy issues have delayed Bard's ability to reach new places (now mostly in the past): "As part of our bold and responsible approach to AI, we've proactively engaged with experts, policymakers and privacy regulators on this expansion. And as we bring Bard to more regions and languages over time, we'll continue to use our AI Principles as a guide, incorporate user feedback, and take steps to protect people's privacy and data." The initial "trial" period was restricted to the USA and UK, when Google launched Bard back in March.

Team Xbox Celebrates Disability Pride Month

Press Release by

T0@st

Jul 6th, 2023 07:49 Discuss (6 Comments)

This July, as part of Disability Pride Month, Team Xbox proudly celebrates players, creators, and community members with disabilities. More than 400 million video game players have disabilities worldwide, and we recognize the incredible contributions the gaming and disability community has made in making Team Xbox, and the broader gaming industry, more inclusive and welcoming for everyone.

Disability Pride holds a special place in my heart, as I am not only a Program Manager on our Gaming Accessibility Team, but also a person with disabilities. Most people wouldn't think of me as having a disability at first glance. In fact, I didn't know I had disabilities until I was in my 20's when I was diagnosed as being neurodiverse. Now I know that I have had Obsessive Compulsive Disorder and Sensory Processing Disorder since I was a young child. And, as the years have gone by, I've acquired new disabilities due to illness, injury, and trauma. Chronic pain is now part of my life, as is hearing loss, and anxiety and depression related to complex post-traumatic stress disorder.

Read full story

Linux Foundation Launches New TLA+ Organization

Press Release by

T0@st

Apr 21st, 2023 10:11 Discuss (3 Comments)

SAN FRANCISCO, April 21, 2023 -- The Linux Foundation, the nonprofit organization enabling mass innovation through open source, today announced the launch of the TLA+ Foundation to promote the adoption and development of the TLA+ programming language and its community of TLA+ practitioners. Inaugural members include Amazon Web Services (AWS), Oracle and Microsoft. TLA+ is a high-level language for modeling programs and systems, especially concurrent and distributed ones. TLA+ has been successfully used by companies to verify complex software systems, reducing errors and improving reliability. The language helps detect design flaws early in the development process, saving time and resources.

TLA+ and its tools are useful for eliminating fundamental design errors, which are hard to find and expensive to correct in code. The language is based on the idea that the best way to describe things precisely is with simple mathematics. The language was invented decades ago by the pioneering computer scientist Leslie Lamport, now a distinguished scientist with Microsoft Research. After years of Lamport's stewardship and Microsoft's support, TLA+ has found a new home at the Linux Foundation.

Read full story

Return to Keyword Browsing

News Posts matching #Language

AMD Recommends EPYC Processors for Everyday AI Server Tasks

NVIDIA & Partners Will Discuss Supercharging of AI Development at GTC 2025

IBM & Lenovo Expand Strategic AI Technology Partnership in Saudi Arabia

Timekettle W4 Pro Earbuds Featuring Babel OS Launches Real-time 2-Way Call Translation, Enabling Natural Cross-lingual Conversations

Windows 11 Grows in November Steam Survey Results As Linux Coasts at 2% and English Overtakes Chinese

NVIDIA ACE Brings AI-Powered Interactions To Mecha BREAK

Ubisoft Exploring Generative AI, Could Revolutionize NPC Narratives

Groq LPU AI Inference Chip is Rivaling Major Players like NVIDIA, AMD, and Intel

Apple Wants to Store LLMs on Flash Memory to Bring AI to Smartphones and Laptops

Google Bard Available Across the EU, Updated with 40 Languages & Spoken Response Function

Team Xbox Celebrates Disability Pride Month

Linux Foundation Launches New TLA+ Organization

Latest GPU Drivers

New Forum Posts

Popular Reviews

TPU on YouTube

Controversial News Posts