Vector Similarity Search Acceleration using DRAM-based Processing-In-Memory (PIM)

Min Jung, Sujin Kim, Haechannuri Noh, Huijin Roh, Nahyeon Kim, Ji Hoon Kim

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper proposes a vector similarity search acceleration by leveraging DRAM-based Processing In Memory (PIM), which is a key component in Retrieval-Augmented Generation (RAG) used to address limitations in large language models (LLM). As datasets expand, distance computations in vector similarity searches become increasingly memory-intensive. To tackle this challenge, we developed vector similarity search applications using both brute-force and Hierarchical Navigable Small World (HNSW) algorithms, with the distance computation process accelerated through PIM. The proposed PIM implementation was emulated on an FPGA board, when verification and testing demonstrated significant performance gains. These findings highlight the promising potential for PIM commercialization and its capability to enhance LLM performance.

Original languageEnglish
Title of host publication2025 International Conference on Electronics, Information, and Communication, ICEIC 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331510756
DOIs
StatePublished - 2025
Event2025 International Conference on Electronics, Information, and Communication, ICEIC 2025 - Osaka, Japan
Duration: 19 Jan 202522 Jan 2025

Publication series

Name2025 International Conference on Electronics, Information, and Communication, ICEIC 2025

Conference

Conference2025 International Conference on Electronics, Information, and Communication, ICEIC 2025
Country/TerritoryJapan
CityOsaka
Period19/01/2522/01/25

Bibliographical note

Publisher Copyright:
© 2025 IEEE.

Keywords

  • Processing-In-Memory (PIM)
  • Retrieval-Augmented Generation (RAG)
  • in-memory computing
  • large language models (LLMs)
  • vector similarity search

Fingerprint

Dive into the research topics of 'Vector Similarity Search Acceleration using DRAM-based Processing-In-Memory (PIM)'. Together they form a unique fingerprint.

Cite this