Extraction of Pharmacokinetic Evidence of Drug-drug Interactions from the Literature

Kolchinsky, Artemy; Lourenco, Analia; Wu, Heng-Yi; Li, Lang; Rocha, Luis M.

Abstract:Drug-drug interactions (DDIs) are major causes of morbidity and mortality and a subject of intense scientific interest. Biomedical literature mining can aid DDI research by extracting evidence for large numbers of potential interactions from published literature and clinical databases. While evidence for DDI ranges in scale from intracellular biochemistry to human populations, literature mining methods have not been used to extract specific types of experimental evidence which are reported differently for distinct experimental goals. We focus on pharmacokinetic evidence for DDIs ... We used a manually curated corpus of PubMed abstracts and annotated sentences to evaluate the efficacy of literature mining in classifying PubMed abstracts containing pharmacokinetic evidence for DDIs, as well as extracting sentences containing such evidence. We implemented a text mining pipeline using several linear classifiers and a variety of feature transformation methods. The most important textual features in the abstract and sentence classification tasks were analyzed. We also investigated the performance benefits of using features derived from PubMed metadata fields, from various publicly-available named entity recognizers and from pharmacokinetic dictionaries. Several classifiers performed very well in distinguishing relevant and irrelevant abstracts (reaching F1 ~= 0.93, MCC ~= 0.74, iAUC ~= 0.99) and sentences (F1 ~= 0.76, MCC ~= 0.65, iAUC ~= 0.83). We found that word-bigram textual features were important for achieving optimal classifier performance, that features derived from Medical Subject Headings (MeSH) terms significantly improved abstract classification, and that some drug-related entity named recognition tools and dictionaries led to slight but significant improvements, especially in classification of evidence sentences. ...

Subjects:	Machine Learning (stat.ML); Information Retrieval (cs.IR); Quantitative Methods (q-bio.QM)
ACM classes:	H.2.8; H.3.1; J.3
Cite as:	arXiv:1412.0744 [stat.ML]
	(or arXiv:1412.0744v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1412.0744

Statistics > Machine Learning

Title:Extraction of Pharmacokinetic Evidence of Drug-drug Interactions from the Literature

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators