SEKI: Self-Evolution and Knowledge Inspiration based Neural Architecture Search via Large Language Models

Cai, Zicheng; Tang, Yaohua; Lai, Yutao; Wang, Hua; Chen, Zhi; Chen, Hao

Computer Science > Computation and Language

arXiv:2502.20422 (cs)

[Submitted on 27 Feb 2025]

Title:SEKI: Self-Evolution and Knowledge Inspiration based Neural Architecture Search via Large Language Models

Authors:Zicheng Cai, Yaohua Tang, Yutao Lai, Hua Wang, Zhi Chen, Hao Chen

View PDF HTML (experimental)

Abstract:We introduce SEKI, a novel large language model (LLM)-based neural architecture search (NAS) method. Inspired by the chain-of-thought (CoT) paradigm in modern LLMs, SEKI operates in two key stages: self-evolution and knowledge distillation. In the self-evolution stage, LLMs initially lack sufficient reference examples, so we implement an iterative refinement mechanism that enhances architectures based on performance feedback. Over time, this process accumulates a repository of high-performance architectures. In the knowledge distillation stage, LLMs analyze common patterns among these architectures to generate new, optimized designs. Combining these two stages, SEKI greatly leverages the capacity of LLMs on NAS and without requiring any domain-specific data. Experimental results show that SEKI achieves state-of-the-art (SOTA) performance across various datasets and search spaces while requiring only 0.05 GPU-days, outperforming existing methods in both efficiency and accuracy. Furthermore, SEKI demonstrates strong generalization capabilities, achieving SOTA-competitive results across multiple tasks.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2502.20422 [cs.CL]
	(or arXiv:2502.20422v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.20422

Submission history

From: Zicheng Cai [view email]
[v1] Thu, 27 Feb 2025 09:17:49 UTC (354 KB)

Computer Science > Computation and Language

Title:SEKI: Self-Evolution and Knowledge Inspiration based Neural Architecture Search via Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:SEKI: Self-Evolution and Knowledge Inspiration based Neural Architecture Search via Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators