Improving Out-of-Distribution Generalization of Neural Rerankers with Contextualized Late Interaction

Zhang, Xinyu; Li, Minghan; Lin, Jimmy

Computer Science > Information Retrieval

arXiv:2302.06589 (cs)

[Submitted on 13 Feb 2023]

Title:Improving Out-of-Distribution Generalization of Neural Rerankers with Contextualized Late Interaction

Authors:Xinyu Zhang, Minghan Li, Jimmy Lin

View PDF

Abstract:Recent progress in information retrieval finds that embedding query and document representation into multi-vector yields a robust bi-encoder retriever on out-of-distribution datasets. In this paper, we explore whether late interaction, the simplest form of multi-vector, is also helpful to neural rerankers that only use the [CLS] vector to compute the similarity score. Although intuitively, the attention mechanism of rerankers at the previous layers already gathers the token-level information, we find adding late interaction still brings an extra 5% improvement in average on out-of-distribution datasets, with little increase in latency and no degradation in in-domain effectiveness. Through extensive experiments and analysis, we show that the finding is consistent across different model sizes and first-stage retrievers of diverse natures and that the improvement is more prominent on longer queries.

Subjects:	Information Retrieval (cs.IR); Computation and Language (cs.CL)
Cite as:	arXiv:2302.06589 [cs.IR]
	(or arXiv:2302.06589v1 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2302.06589

Submission history

From: Xinyu Zhang [view email]
[v1] Mon, 13 Feb 2023 18:42:17 UTC (115 KB)

Computer Science > Information Retrieval

Title:Improving Out-of-Distribution Generalization of Neural Rerankers with Contextualized Late Interaction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Improving Out-of-Distribution Generalization of Neural Rerankers with Contextualized Late Interaction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators