Compressed Meta-Optical Encoder for Image Classification

Wirth-Singh, Anna; Xiang, Jinlin; Choi, Minho; Fröch, Johannes E.; Huang, Luocheng; Colburn, Shane; Shlizerman, Eli; Majumdar, Arka

Computer Science > Computer Vision and Pattern Recognition

arXiv:2406.06534 (cs)

[Submitted on 23 Apr 2024 (v1), last revised 14 Jun 2024 (this version, v2)]

Title:Compressed Meta-Optical Encoder for Image Classification

Authors:Anna Wirth-Singh, Jinlin Xiang, Minho Choi, Johannes E. Fröch, Luocheng Huang, Shane Colburn, Eli Shlizerman, Arka Majumdar

View PDF HTML (experimental)

Abstract:Optical and hybrid convolutional neural networks (CNNs) recently have become of increasing interest to achieve low-latency, low-power image classification and computer vision tasks. However, implementing optical nonlinearity is challenging, and omitting the nonlinear layers in a standard CNN comes at a significant reduction in accuracy. In this work, we use knowledge distillation to compress modified AlexNet to a single linear convolutional layer and an electronic backend (two fully connected layers). We obtain comparable performance to a purely electronic CNN with five convolutional layers and three fully connected layers. We implement the convolution optically via engineering the point spread function of an inverse-designed meta-optic. Using this hybrid approach, we estimate a reduction in multiply-accumulate operations from 17M in a conventional electronic modified AlexNet to only 86K in the hybrid compressed network enabled by the optical frontend. This constitutes over two orders of magnitude reduction in latency and power consumption. Furthermore, we experimentally demonstrate that the classification accuracy of the system exceeds 93% on the MNIST dataset.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Optics (physics.optics)
Cite as:	arXiv:2406.06534 [cs.CV]
	(or arXiv:2406.06534v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2406.06534

Submission history

From: Anna Wirth-Singh [view email]
[v1] Tue, 23 Apr 2024 00:54:31 UTC (1,077 KB)
[v2] Fri, 14 Jun 2024 05:43:12 UTC (1,642 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Compressed Meta-Optical Encoder for Image Classification

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Compressed Meta-Optical Encoder for Image Classification

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators