From Selection to Generation: A Survey of LLM-based Active Learning

Xia, Yu; Mukherjee, Subhojyoti; Xie, Zhouhang; Wu, Junda; Li, Xintong; Aponte, Ryan; Lyu, Hanjia; Barrow, Joe; Chen, Hongjie; Dernoncourt, Franck; Kveton, Branislav; Yu, Tong; Zhang, Ruiyi; Gu, Jiuxiang; Ahmed, Nesreen K.; Wang, Yu; Chen, Xiang; Deilamsalehy, Hanieh; Kim, Sungchul; Hu, Zhengmian; Zhao, Yue; Lipka, Nedim; Yoon, Seunghyun; Huang, Ting-Hao Kenneth; Wang, Zichao; Mathur, Puneet; Pal, Soumyabrata; Mukherjee, Koyel; Zhang, Zhehao; Park, Namyong; Nguyen, Thien Huu; Luo, Jiebo; Rossi, Ryan A.; McAuley, Julian

Computer Science > Machine Learning

arXiv:2502.11767 (cs)

[Submitted on 17 Feb 2025]

Title:From Selection to Generation: A Survey of LLM-based Active Learning

Abstract:Active Learning (AL) has been a powerful paradigm for improving model efficiency and performance by selecting the most informative data points for labeling and training. In recent active learning frameworks, Large Language Models (LLMs) have been employed not only for selection but also for generating entirely new data instances and providing more cost-effective annotations. Motivated by the increasing importance of high-quality data and efficient model training in the era of LLMs, we present a comprehensive survey on LLM-based Active Learning. We introduce an intuitive taxonomy that categorizes these techniques and discuss the transformative roles LLMs can play in the active learning loop. We further examine the impact of AL on LLM learning paradigms and its applications across various domains. Finally, we identify open challenges and propose future research directions. This survey aims to serve as an up-to-date resource for researchers and practitioners seeking to gain an intuitive understanding of LLM-based AL techniques and deploy them to new applications.

Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL)
Cite as:	arXiv:2502.11767 [cs.LG]
	(or arXiv:2502.11767v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2502.11767

Submission history

From: Yu Xia [view email]
[v1] Mon, 17 Feb 2025 12:58:17 UTC (462 KB)

Computer Science > Machine Learning

Title:From Selection to Generation: A Survey of LLM-based Active Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:From Selection to Generation: A Survey of LLM-based Active Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators