Task-Aware Specialization for Efficient and Robust Dense Retrieval for Open-Domain Question Answering

Cheng, Hao; Fang, Hao; Liu, Xiaodong; Gao, Jianfeng

Computer Science > Computation and Language

arXiv:2210.05156 (cs)

[Submitted on 11 Oct 2022 (v1), last revised 22 May 2023 (this version, v2)]

Title:Task-Aware Specialization for Efficient and Robust Dense Retrieval for Open-Domain Question Answering

Authors:Hao Cheng, Hao Fang, Xiaodong Liu, Jianfeng Gao

View PDF

Abstract:Given its effectiveness on knowledge-intensive natural language processing tasks, dense retrieval models have become increasingly popular. Specifically, the de-facto architecture for open-domain question answering uses two isomorphic encoders that are initialized from the same pretrained model but separately parameterized for questions and passages. This bi-encoder architecture is parameter-inefficient in that there is no parameter sharing between encoders. Further, recent studies show that such dense retrievers underperform BM25 in various settings. We thus propose a new architecture, Task-aware Specialization for dense Retrieval (TASER), which enables parameter sharing by interleaving shared and specialized blocks in a single encoder. Our experiments on five question answering datasets show that TASER can achieve superior accuracy, surpassing BM25, while using about 60% of the parameters as bi-encoder dense retrievers. In out-of-domain evaluations, TASER is also empirically more robust than bi-encoder dense retrievers. Our code is available at this https URL.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2210.05156 [cs.CL]
	(or arXiv:2210.05156v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.05156

Submission history

From: Hao Fang [view email]
[v1] Tue, 11 Oct 2022 05:33:25 UTC (315 KB)
[v2] Mon, 22 May 2023 20:38:56 UTC (81 KB)

Computer Science > Computation and Language

Title:Task-Aware Specialization for Efficient and Robust Dense Retrieval for Open-Domain Question Answering

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Task-Aware Specialization for Efficient and Robust Dense Retrieval for Open-Domain Question Answering

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators