ImitAL: Learned Active Learning Strategy on Synthetic Data

Gonsior, Julius; Thiele, Maik; Lehner, Wolfgang

doi:10.1007/978-3-031-18840-4_4

Computer Science > Machine Learning

arXiv:2208.11636 (cs)

[Submitted on 24 Aug 2022]

Title:ImitAL: Learned Active Learning Strategy on Synthetic Data

Authors:Julius Gonsior, Maik Thiele, Wolfgang Lehner

View PDF

Abstract:Active Learning (AL) is a well-known standard method for efficiently obtaining annotated data by first labeling the samples that contain the most information based on a query strategy. In the past, a large variety of such query strategies has been proposed, with each generation of new strategies increasing the runtime and adding more complexity. However, to the best of our our knowledge, none of these strategies excels consistently over a large number of datasets from different application domains. Basically, most of the the existing AL strategies are a combination of the two simple heuristics informativeness and representativeness, and the big differences lie in the combination of the often conflicting heuristics. Within this paper, we propose ImitAL, a domain-independent novel query strategy, which encodes AL as a learning-to-rank problem and learns an optimal combination between both heuristics. We train ImitAL on large-scale simulated AL runs on purely synthetic datasets. To show that ImitAL was successfully trained, we perform an extensive evaluation comparing our strategy on 13 different datasets, from a wide range of domains, with 7 other query strategies.

Comments:	arXiv admin note: text overlap with arXiv:2108.07670
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2208.11636 [cs.LG]
	(or arXiv:2208.11636v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2208.11636
Related DOI:	https://doi.org/10.1007/978-3-031-18840-4_4

Submission history

From: Julius Gonsior [view email]
[v1] Wed, 24 Aug 2022 16:17:53 UTC (9,419 KB)

Computer Science > Machine Learning

Title:ImitAL: Learned Active Learning Strategy on Synthetic Data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:ImitAL: Learned Active Learning Strategy on Synthetic Data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators