• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

KIOXIA Releases AiSAQ as Open-Source Software to Reduce DRAM Needs in AI Systems

Nomad76

News Editor
Staff member
Joined
May 21, 2024
Messages
862 (3.38/day)
Kioxia Corporation, a world leader in memory solutions, today announced the open-source release of its new All-in-Storage ANNS with Product Quantization (AiSAQ) technology. A novel "approximate nearest neighbor" search (ANNS) algorithm optimized for SSDs, KIOXIA AiSAQ software delivers scalable performance for retrieval-augmented generation (RAG) without placing index data in DRAM - and instead searching directly on SSDs.

Generative AI systems demand significant compute, memory and storage resources. While they have the potential to drive transformative breakthroughs across various industries, their deployment often comes with high costs. RAG is a critical phase of AI that refines large language models (LLMs) with data specific to the company or application.



A central component of RAG is a vector database that accumulates and converts specific data into feature vectors in the database. RAG also utilizes an ANNS algorithm, which identifies vectors that improve the model based on similarity between the accumulated and target vectors. For RAG to be effective, it must rapidly retrieve the information most relevant to a query. Traditionally, ANNS algorithms are deployed in DRAM to achieve the high-speed performance required for these searches.

KIOXIA AiSAQ technology provides a scalable and efficient ANNS solution for billion-scale datasets with negligible memory usage and fast index switching capabilities.

Key Benefits of KIOXIA AiSAQ Technology:
  • Allows large-scale databases to operate without relying on limited DRAM resources, enhancing the performance of RAG systems.
  • Eliminates the need to load index data into DRAM, enabling the vector database to launch instantly. This supports seamless switching between user-specific or application-specific databases on the same server for efficient RAG service delivery.
  • Optimized for cloud systems by storing indexes in disaggregated storage for sharing across multiple servers. This approach dynamically adjusts vector database search performance for specific users or applications and facilitates the rapid migration of search instances between physical servers.

Kioxia is demonstrating its commitment to advancing AI by contributing its innovative KIOXIA AiSAQ technology to the community as open-source software.

Please follow the link for the open-source release of KIOXIA AiSAQ: https://github.com/kioxiaamerica/aisaq-diskann

View at TechPowerUp Main Site | Source
 
Joined
Feb 18, 2005
Messages
5,950 (0.82/day)
Location
Ikenai borderline!
System Name Firelance.
Processor Threadripper 3960X
Motherboard ROG Strix TRX40-E Gaming
Cooling IceGem 360 + 6x Arctic Cooling P12
Memory 8x 16GB Patriot Viper DDR4-3200 CL16
Video Card(s) MSI GeForce RTX 4060 Ti Ventus 2X OC
Storage 2TB WD SN850X (boot), 4TB Crucial P3 (data)
Display(s) Dell S3221QS(A) (32" 38x21 60Hz) + 2x AOC Q32E2N (32" 25x14 75Hz)
Case Enthoo Pro II Server Edition (Closed Panel) + 6 fans
Power Supply Fractal Design Ion+ 2 Platinum 760W
Mouse Logitech G604
Keyboard Razer Pro Type Ultra
Software Windows 10 Professional x64
AI sack? Hmmmmmmmmm...
 
Joined
Jan 2, 2019
Messages
170 (0.08/day)
>>...retrieval-augmented generation (RAG) without placing index data in DRAM - and instead searching directly on SSDs...

For many-many-many years that "new technique" is called Memory Mapped Files.
 
Top