Unconfounded Propensity Estimation for Unbiased Ranking

Luo, Dan; Zou, Lixin; Ai, Qingyao; Chen, Zhiyu; Li, Chenliang; Yin, Dawei; Davison, Brian D.

Computer Science > Information Retrieval

arXiv:2305.09918 (cs)

[Submitted on 17 May 2023 (v1), last revised 8 Jul 2023 (this version, v3)]

Title:Unconfounded Propensity Estimation for Unbiased Ranking

Authors:Dan Luo, Lixin Zou, Qingyao Ai, Zhiyu Chen, Chenliang Li, Dawei Yin, Brian D. Davison

View PDF

Abstract:The goal of unbiased learning to rank (ULTR) is to leverage implicit user feedback for optimizing learning-to-rank systems. Among existing solutions, automatic ULTR algorithms that jointly learn user bias models (i.e., propensity models) with unbiased rankers have received a lot of attention due to their superior performance and low deployment cost in practice. Despite their theoretical soundness, the effectiveness is usually justified under a weak logging policy, where the ranking model can barely rank documents according to their relevance to the query. However, when the logging policy is strong, e.g., an industry-deployed ranking policy, the reported effectiveness cannot be reproduced. In this paper, we first investigate ULTR from a causal perspective and uncover a negative result: existing ULTR algorithms fail to address the issue of propensity overestimation caused by the query-document relevance confounder. Then, we propose a new learning objective based on backdoor adjustment and highlight its differences from conventional propensity models, which reveal the prevalence of propensity overestimation. On top of that, we introduce a novel propensity model called Logging-Policy-aware Propensity (LPP) model and its distinctive two-step optimization strategy, which allows for the joint learning of LPP and ranking models within the automatic ULTR framework, and actualize the unconfounded propensity estimation for ULTR. Extensive experiments on two benchmarks demonstrate the effectiveness and generalizability of the proposed method.

Comments:	11 pages, 5 figures
Subjects:	Information Retrieval (cs.IR)
Cite as:	arXiv:2305.09918 [cs.IR]
	(or arXiv:2305.09918v3 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2305.09918

Submission history

From: Dan Luo [view email]
[v1] Wed, 17 May 2023 02:59:13 UTC (2,402 KB)
[v2] Thu, 18 May 2023 03:16:41 UTC (1,897 KB)
[v3] Sat, 8 Jul 2023 23:12:48 UTC (1,892 KB)

Computer Science > Information Retrieval

Title:Unconfounded Propensity Estimation for Unbiased Ranking

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Unconfounded Propensity Estimation for Unbiased Ranking

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators