An Adversarial Imitation Click Model for Information Retrieval

Dai, Xinyi; Lin, Jianghao; Zhang, Weinan; Li, Shuai; Liu, Weiwen; Tang, Ruiming; He, Xiuqiang; Hao, Jianye; Wang, Jun; Yu, Yong

doi:10.1145/3442381.3449913

Computer Science > Information Retrieval

arXiv:2104.06077 (cs)

[Submitted on 13 Apr 2021 (v1), last revised 19 Apr 2021 (this version, v2)]

Title:An Adversarial Imitation Click Model for Information Retrieval

Authors:Xinyi Dai, Jianghao Lin, Weinan Zhang, Shuai Li, Weiwen Liu, Ruiming Tang, Xiuqiang He, Jianye Hao, Jun Wang, Yong Yu

View PDF

Abstract:Modern information retrieval systems, including web search, ads placement, and recommender systems, typically rely on learning from user feedback. Click models, which study how users interact with a ranked list of items, provide a useful understanding of user feedback for learning ranking models. Constructing "right" dependencies is the key of any successful click model. However, probabilistic graphical models (PGMs) have to rely on manually assigned dependencies, and oversimplify user behaviors. Existing neural network based methods promote PGMs by enhancing the expressive ability and allowing flexible dependencies, but still suffer from exposure bias and inferior estimation. In this paper, we propose a novel framework, Adversarial Imitation Click Model (AICM), based on imitation learning. Firstly, we explicitly learn the reward function that recovers users' intrinsic utility and underlying intentions. Secondly, we model user interactions with a ranked list as a dynamic system instead of one-step click prediction, alleviating the exposure bias problem. Finally, we minimize the JS divergence through adversarial training and learn a stable distribution of click sequences, which makes AICM generalize well across different distributions of ranked lists. A theoretical analysis has indicated that AICM reduces the exposure bias from $O(T^2)$ to $O(T)$. Our studies on a public web search dataset show that AICM not only outperforms state-of-the-art models in traditional click metrics but also achieves superior performance in addressing the exposure bias and recovering the underlying patterns of click sequences.

Comments:	Accepted to WWW 2021
Subjects:	Information Retrieval (cs.IR)
Cite as:	arXiv:2104.06077 [cs.IR]
	(or arXiv:2104.06077v2 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2104.06077
Related DOI:	https://doi.org/10.1145/3442381.3449913

Submission history

From: Xinyi Dai [view email]
[v1] Tue, 13 Apr 2021 10:17:55 UTC (4,199 KB)
[v2] Mon, 19 Apr 2021 05:41:07 UTC (4,199 KB)

Computer Science > Information Retrieval

Title:An Adversarial Imitation Click Model for Information Retrieval

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:An Adversarial Imitation Click Model for Information Retrieval

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators