• Media type: E-Article
  • Title: GraphScale: Scalable Processing on FPGAs for HBM and Large Graphs
  • Contributor: Dann, Jonas; Ritter, Daniel; Fröning, Holger
  • Published: Association for Computing Machinery (ACM), 2024
  • Published in: ACM Transactions on Reconfigurable Technology and Systems, 17 (2024) 2, Seite 1-23
  • Language: English
  • DOI: 10.1145/3616497
  • ISSN: 1936-7406; 1936-7414
  • Keywords: General Computer Science
  • Origination:
  • Footnote:
  • Description: Recent advances in graph processing on FPGAs promise to alleviate performance bottlenecks with irregular memory access patterns. Such bottlenecks challenge performance for a growing number of important application areas like machine learning and data analytics. While FPGAs denote a promising solution through flexible memory hierarchies and massive parallelism, we argue that current graph processing accelerators either use the off-chip memory bandwidth inefficiently or do not scale well across memory channels. In this work, we propose GraphScale, a scalable graph processing framework for FPGAs. GraphScale combines multi-channel memory with asynchronous graph processing (i.e., for fast convergence on results) and a compressed graph representation (i.e., for efficient usage of memory bandwidth and reduced memory footprint). GraphScale solves common graph problems like breadth-first search, PageRank, and weakly connected components through modular user-defined functions, a novel two-dimensional partitioning scheme, and a high-performance two-level crossbar design. Additionally, we extend GraphScale to scale to modern high-bandwidth memory (HBM) and reduce partitioning overhead of large graphs with binary packing.