A 65 nm Bayesian Neural Network Accelerator with 360 fJ/Sample In-Word GRNG for AI Uncertainty Estimation

Enciso, Zephan M.; Cheng, Boyang; Pei, Likai; Liu, Jianbo; Davis, Steven; Niemier, Michael; Cao, Ningyuan

Computer Science > Hardware Architecture

arXiv:2501.04577 (cs)

[Submitted on 8 Jan 2025 (v1), last revised 22 Jan 2025 (this version, v2)]

Title:A 65 nm Bayesian Neural Network Accelerator with 360 fJ/Sample In-Word GRNG for AI Uncertainty Estimation

Authors:Zephan M. Enciso, Boyang Cheng, Likai Pei, Jianbo Liu, Steven Davis, Michael Niemier, Ningyuan Cao

View PDF HTML (experimental)

Abstract:Uncertainty estimation is an indispensable capability for AI-enabled, safety-critical applications, e.g. autonomous vehicles or medical diagnosis. Bayesian neural networks (BNNs) use Bayesian statistics to provide both classification predictions and uncertainty estimation, but they suffer from high computational overhead associated with random number generation and repeated sample iterations. Furthermore, BNNs are not immediately amenable to acceleration through compute-in-memory architectures due to the frequent memory writes necessary after each RNG operation. To address these challenges, we present an ASIC that integrates 360 fJ/Sample Gaussian RNG directly into the SRAM memory words. This integration reduces RNG overhead and enables fully-parallel compute-in-memory operations for BNNs. The prototype chip achieves 5.12 GSa/s RNG throughput and 102 GOp/s neural network throughput while occupying 0.45 mm2, bringing AI uncertainty estimation to edge computation.

Comments:	7 pages, 12 figures
Subjects:	Hardware Architecture (cs.AR); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
ACM classes:	B.7.1; B.3.1; I.2.10; I.2.9
Cite as:	arXiv:2501.04577 [cs.AR]
	(or arXiv:2501.04577v2 [cs.AR] for this version)
	https://doi.org/10.48550/arXiv.2501.04577

Submission history

From: Zephan Enciso [view email]
[v1] Wed, 8 Jan 2025 15:47:04 UTC (29,845 KB)
[v2] Wed, 22 Jan 2025 19:28:38 UTC (29,845 KB)

Computer Science > Hardware Architecture

Title:A 65 nm Bayesian Neural Network Accelerator with 360 fJ/Sample In-Word GRNG for AI Uncertainty Estimation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Hardware Architecture

Title:A 65 nm Bayesian Neural Network Accelerator with 360 fJ/Sample In-Word GRNG for AI Uncertainty Estimation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators