Auto-Detection of Safety Issues in Baby Products

Bleaney, Graham; Kuzyk, Matthew; Man, Julian; Mayanloo, Hossein; Tizhoosh, H. R.

Computer Science > Machine Learning

arXiv:1805.09772 (cs)

[Submitted on 27 Apr 2018 (v1), last revised 21 Jul 2018 (this version, v2)]

Title:Auto-Detection of Safety Issues in Baby Products

Authors:Graham Bleaney, Matthew Kuzyk, Julian Man, Hossein Mayanloo, H.R.Tizhoosh

View PDF

Abstract:Every year, thousands of people receive consumer product related injuries. Research indicates that online customer reviews can be processed to autonomously identify product safety issues. Early identification of safety issues can lead to earlier recalls, and thus fewer injuries and deaths. A dataset of product reviews from this http URL was compiled, along with \emph{this http URL} complaints and recall descriptions from the Consumer Product Safety Commission (CPSC) and European Commission Rapid Alert system. A system was built to clean the collected text and to extract relevant features. Dimensionality reduction was performed by computing feature relevance through a Random Forest and discarding features with low information gain. Various classifiers were analyzed, including Logistic Regression, SVMs, Na{ï}ve-Bayes, Random Forests, and an Ensemble classifier. Experimentation with various features and classifier combinations resulted in a logistic regression model with 66\% precision in the top 50 reviews surfaced. This classifier outperforms all benchmarks set by related literature and consumer product safety professionals.

Comments:	To appear in proceedings of The 31st IEA-AIE 2018, June 25-28, 2018, Montreal, Canada
Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (stat.ML)
Cite as:	arXiv:1805.09772 [cs.LG]
	(or arXiv:1805.09772v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1805.09772

Submission history

From: Hamid Tizhoosh [view email]
[v1] Fri, 27 Apr 2018 15:33:50 UTC (253 KB)
[v2] Sat, 21 Jul 2018 23:43:59 UTC (294 KB)

Computer Science > Machine Learning

Title:Auto-Detection of Safety Issues in Baby Products

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Auto-Detection of Safety Issues in Baby Products

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators