CASH-RF: A Compiler-Assisted Hierarchical Register File in GPUs

Yunho Oh, Ipoom Jeong, Won Woo Ro, Myung Kuk Yoon

Research output: Contribution to journalArticlepeer-review

4 Scopus citations


Spin-Transfer Torque Magnetic Random-Access Memory (STT-MRAM) is an emerging non-volatile memory technology that has been received significant attention due to its higher density and lower leakage current over SRAM. One compelling use case is to employ STT-MRAM as a GPU Register File (RF) to reduce its massive energy consumption. One critical challenge is that STT-MRAM has longer access latency and higher dynamic power consumption than SRAM, which motivates the hierarchical RF that places a small SRAM-based Register Cache (RC) between functional units and STT-MRAM RF. The RC acts as the write buffer, so all the writes on the RF are first performed on the RC. In the presence of a conflict miss, the RC writes back the corresponding cache line into the RF. In this work, we observe that a large amount of such write-back operations are unnecessary because they include register values that are never used again. Leveraging this observation, we propose a Compiler-ASsisted Hierarchical Register File in GPUs (CASH-RF) that optimizes STT-MRAM accesses by removing dead register values. In CASH-RF, unnecessary write-back operations are detected by the compiler via the last consumer analysis. At runtime, the corresponding RC lines are discarded after the last references without being updated to the RF. Compared to the baseline GPUs, CASH-RF removes 59.5% of write-back operations, which leads to 54.7% lower RF energy consumption with only 2.6% of performance degradation.

Original languageEnglish
JournalIEEE Embedded Systems Letters
StateAccepted/In press - 2022

Bibliographical note

Publisher Copyright:


  • compiler-assisted
  • Graphics processing units
  • Graphics processing units
  • hierarchical register file
  • Indexes
  • Instruction sets
  • Kernel
  • Radio frequency
  • Random access memory
  • Registers
  • SRAM


Dive into the research topics of 'CASH-RF: A Compiler-Assisted Hierarchical Register File in GPUs'. Together they form a unique fingerprint.

Cite this