FADO: Floorplan-Aware Directive Optimization for High-Level Synthesis Designs on Multi-Die FPGAs

Du, Linfeng; Liang, Tingyuan; Sinha, Sharad; Xie, Zhiyao; Zhang, Wei

doi:10.1145/3543622.3573188

Computer Science > Hardware Architecture

arXiv:2212.11582v2 (cs)

[Submitted on 22 Dec 2022 (v1), revised 7 Jan 2023 (this version, v2), latest version 5 Feb 2023 (v3)]

Title:FADO: Floorplan-Aware Directive Optimization for High-Level Synthesis Designs on Multi-Die FPGAs

Authors:Linfeng Du, Tingyuan Liang, Sharad Sinha, Zhiyao Xie, Wei Zhang

View PDF

Abstract:Multi-die FPGAs are widely adopted to deploy large hardware accelerators. Two factors impede the performance optimization of HLS designs implemented on multi-die FPGAs. On the one hand, the long net delay due to nets crossing die-boundaries results in an NP-hard problem to properly floorplan and pipeline an application. On the other hand, traditional automated searching flow for HLS directive optimizations targets single-die FPGAs, and hence, it cannot consider the resource constraints on each die and the timing issue incurred by the die-crossings. Further, it leads to an excessively long runtime to legalize the floorplan of HLS designs generated under each group of configurations during directive optimization due to the large design scale.
To co-optimize the directives and floorplan of HLS designs on multi-die FPGAs, we propose the FADO framework, which formulates the directive-floorplan co-search problem based on the multi-choice multi-dimensional bin-packing and solves it using an iterative optimization flow. For each step of directive search, a latency-bottleneck-guided greedy algorithm searches for more efficient directive configurations. For floorplanning, instead of repetitively incurring global floorplanning algorithms, we implement a more efficient incremental floorplan legalization algorithm. It mainly applies the worst-fit online bin-packing algorithm to balance the floorplan, together with an offline best-fit-decreasing re-packing to compact the floorplan, followed by pipelining of long wires crossing die-boundaries.
Through experiments on HLS designs mixing dataflow and non-dataflow kernels, FADO not only well-automates the co-optimization and finishes within 693X~4925X shorter runtime, compared with DSE assisted by global floorplanning, but also yields an improvement of 1.16X~8.78X in overall workflow execution time after implementation on the Xilinx Alveo U250 FPGA.

Comments:	Accepted as a conference paper at FPGA '23. Open source at: this https URL
Subjects:	Hardware Architecture (cs.AR)
Cite as:	arXiv:2212.11582 [cs.AR]
	(or arXiv:2212.11582v2 [cs.AR] for this version)
	https://doi.org/10.48550/arXiv.2212.11582
Related DOI:	https://doi.org/10.1145/3543622.3573188

Submission history

From: Linfeng Du [view email]
[v1] Thu, 22 Dec 2022 10:18:35 UTC (3,860 KB)
[v2] Sat, 7 Jan 2023 08:16:46 UTC (3,933 KB)
[v3] Sun, 5 Feb 2023 08:02:14 UTC (3,933 KB)

Computer Science > Hardware Architecture

Title:FADO: Floorplan-Aware Directive Optimization for High-Level Synthesis Designs on Multi-Die FPGAs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Hardware Architecture

Title:FADO: Floorplan-Aware Directive Optimization for High-Level Synthesis Designs on Multi-Die FPGAs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators