Cross-Modality Program Representation Learning for Electronic Design Automation with High-Level Synthesis

Qin, Zongyue; Bai, Yunsheng; Sohrabizadeh, Atefeh; Ding, Zijian; Hu, Ziniu; Sun, Yizhou; Cong, Jason

Computer Science > Machine Learning

arXiv:2406.09606 (cs)

[Submitted on 13 Jun 2024 (v1), last revised 17 Jul 2024 (this version, v3)]

Title:Cross-Modality Program Representation Learning for Electronic Design Automation with High-Level Synthesis

Authors:Zongyue Qin, Yunsheng Bai, Atefeh Sohrabizadeh, Zijian Ding, Ziniu Hu, Yizhou Sun, Jason Cong

View PDF HTML (experimental)

Abstract:In recent years, domain-specific accelerators (DSAs) have gained popularity for applications such as deep learning and autonomous driving. To facilitate DSA designs, programmers use high-level synthesis (HLS) to compile a high-level description written in C/C++ into a design with low-level hardware description languages that eventually synthesize DSAs on circuits. However, creating a high-quality HLS design still demands significant domain knowledge, particularly in microarchitecture decisions expressed as \textit{pragmas}. Thus, it is desirable to automate such decisions with the help of machine learning for predicting the quality of HLS designs, requiring a deeper understanding of the program that consists of original code and pragmas. Naturally, these programs can be considered as sequence data. In addition, these programs can be compiled and converted into a control data flow graph (CDFG). But existing works either fail to leverage both modalities or combine the two in shallow or coarse ways. We propose ProgSG, a model that allows interaction between the source code sequence modality and the graph modality in a deep and fine-grained way. To alleviate the scarcity of labeled designs, a pre-training method is proposed based on a suite of compiler's data flow analysis tasks. Experimental results show that ProgSG reduces the RMSE of design performance predictions by up to $22\%$, and identifies designs with an average of $1.10\times$ and $1.26\times$ (up to $8.17\times$ and $13.31\times$) performance improvement in design space exploration (DSE) task compared to HARP and AutoDSE, respectively.

Comments:	14 pages, 8 figures. arXiv admin note: text overlap with arXiv:2305.10838
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Hardware Architecture (cs.AR)
Cite as:	arXiv:2406.09606 [cs.LG]
	(or arXiv:2406.09606v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.09606

Submission history

From: Zongyue Qin [view email]
[v1] Thu, 13 Jun 2024 22:34:58 UTC (8,245 KB)
[v2] Thu, 27 Jun 2024 22:06:19 UTC (8,245 KB)
[v3] Wed, 17 Jul 2024 22:08:51 UTC (8,468 KB)

Computer Science > Machine Learning

Title:Cross-Modality Program Representation Learning for Electronic Design Automation with High-Level Synthesis

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Cross-Modality Program Representation Learning for Electronic Design Automation with High-Level Synthesis

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators