Automatic HBM Management: Models and Algorithms

  • Daniel Delayo
  • , Kenny Zhang
  • , Kunal Agrawal
  • , Michael A. Bender
  • , Jonathan W. Berry
  • , Rathish Das
  • , Benjamin Moseley
  • , Cynthia A. Phillips

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Some past and future supercomputer nodes incorporate High- Bandwidth Memory (HBM). Compared to standard DRAM, HBM has similar latency, higher bandwidth and lower capacity. In this paper, we evaluate algorithms for managing High- Bandwidth Memory automatically. Previous work suggests that, in the worst case, performance is extremely sensitive to the policy for managing the channel to DRAM. Prior theory shows that a priority-based scheme (where there is a static strict priority-order among p threads for channel access) is O(1)-competitive, but FIFO is not, and in the worst case is ?(p) competitive. Following this theoretical guidance would be a disruptive change for vendors, who currently use FIFO variants in their DRAMcontroller hardware. Our goal is to determine theoretically and empirically whether we can justify recommending investment in priority-based DRAM controller hardware. In order to experiment with DRAM channel protocols, we chose a theoretical model, validated it against real hardware, and implemented a basic simulator. We corroborated the previous theoretical results for the model, conducted a parameter sweep while running our simulator on address traces from memory bandwidth-bound codes (GNU sort and TACO sparse matrix-vector product), and designed better channel-access algorithms.

Original languageEnglish
Title of host publicationSPAA 2022 - Proceedings of the 34th ACM Symposium on Parallelism in Algorithms and Architectures
PublisherAssociation for Computing Machinery
Pages147-159
Number of pages13
ISBN (Electronic)9781450391467
DOIs
StatePublished - Jul 11 2022
Event34th ACM Symposium on Parallelism in Algorithms and Architectures, SPAA 2022 - Philadelphia, United States
Duration: Jul 11 2022Jul 14 2022

Publication series

NameAnnual ACM Symposium on Parallelism in Algorithms and Architectures

Conference

Conference34th ACM Symposium on Parallelism in Algorithms and Architectures, SPAA 2022
Country/TerritoryUnited States
CityPhiladelphia
Period07/11/2207/14/22

Keywords

  • dynamic priority
  • fairness
  • far-channel arbitration
  • fcfs
  • fifo
  • first-come-first-serve
  • first-in-first-out
  • hbm
  • high-bandwidth memory
  • knight's landing
  • knl
  • lru
  • makespan
  • memory hierarchy
  • memory management
  • priority queue
  • queue management
  • sapphire rapids
  • scheduling
  • timeliness
  • xeon phi

Fingerprint

Dive into the research topics of 'Automatic HBM Management: Models and Algorithms'. Together they form a unique fingerprint.

Cite this