Text classification based on ensemble extreme learning machine

Li, Ming; Xiao, Peilun; Zhang, Ju

Computer Science > Information Retrieval

arXiv:1805.06525 (cs)

[Submitted on 10 May 2018]

Title:Text classification based on ensemble extreme learning machine

Authors:Ming Li, Peilun Xiao, Ju Zhang

View PDF

Abstract:In this paper, we propose a novel approach based on cost-sensitive ensemble weighted extreme learning machine; we call this approach AE1-WELM. We apply this approach to text classification. AE1-WELM is an algorithm including balanced and imbalanced multiclassification for text classification. Weighted ELM assigning the different weights to the different samples improves the classification accuracy to a certain extent, but weighted ELM considers the differences between samples in the different categories only and ignores the differences between samples within the same categories. We measure the importance of the documents by the sample information entropy, and generate cost-sensitive matrix and factor based on the document importance, then embed the cost-sensitive weighted ELM into the AdaBoost.M1 framework seamlessly. Vector space model(VSM) text representation produces the high dimensions and sparse features which increase the burden of ELM. To overcome this problem, we develop a text classification framework combining the word vector and AE1-WELM. The experimental results show that our method provides an accurate, reliable and effective solution for text classification.

Comments:	10 pages, 9 figures
Subjects:	Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:1805.06525 [cs.IR]
	(or arXiv:1805.06525v1 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.1805.06525

Submission history

From: Ming Li [view email]
[v1] Thu, 10 May 2018 06:10:46 UTC (756 KB)

Computer Science > Information Retrieval

Title:Text classification based on ensemble extreme learning machine

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Text classification based on ensemble extreme learning machine

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators