Neural Methods for Effective, Efficient, and Exposure-Aware Information Retrieval

Mitra, Bhaskar

Computer Science > Information Retrieval

arXiv:2012.11685 (cs)

[Submitted on 21 Dec 2020 (v1), last revised 19 Mar 2021 (this version, v2)]

Title:Neural Methods for Effective, Efficient, and Exposure-Aware Information Retrieval

Authors:Bhaskar Mitra

View PDF

Abstract:Neural networks with deep architectures have demonstrated significant performance improvements in computer vision, speech recognition, and natural language processing. The challenges in information retrieval (IR), however, are different from these other application areas. A common form of IR involves ranking of documents--or short passages--in response to keyword-based queries. Effective IR systems must deal with query-document vocabulary mismatch problem, by modeling relationships between different query and document terms and how they indicate relevance. Models should also consider lexical matches when the query contains rare terms--such as a person's name or a product model number--not seen during training, and to avoid retrieving semantically related but irrelevant results. In many real-life IR tasks, the retrieval involves extremely large collections--such as the document index of a commercial Web search engine--containing billions of documents. Efficient IR methods should take advantage of specialized IR data structures, such as inverted index, to efficiently retrieve from large collections. Given an information need, the IR system also mediates how much exposure an information artifact receives by deciding whether it should be displayed, and where it should be positioned, among other results. Exposure-aware IR systems may optimize for additional objectives, besides relevance, such as parity of exposure for retrieved items and content publishers. In this thesis, we present novel neural architectures and methods motivated by the specific needs and challenges of IR tasks.

Comments:	PhD thesis, Univ College London (2020)
Subjects:	Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2012.11685 [cs.IR]
	(or arXiv:2012.11685v2 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2012.11685

Submission history

From: Bhaskar Mitra [view email]
[v1] Mon, 21 Dec 2020 21:20:16 UTC (3,531 KB)
[v2] Fri, 19 Mar 2021 21:47:04 UTC (3,564 KB)

Computer Science > Information Retrieval

Title:Neural Methods for Effective, Efficient, and Exposure-Aware Information Retrieval

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Neural Methods for Effective, Efficient, and Exposure-Aware Information Retrieval

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators