UserBERT: Modeling Long- and Short-Term User Preferences via Self-Supervision

Li, Tianyu; Cevahir, Ali; Cho, Derek; Gong, Hao; Nguyen, DuyKhuong; Stenger, Bjorn

Computer Science > Machine Learning

arXiv:2202.07605 (cs)

[Submitted on 14 Feb 2022]

Title:UserBERT: Modeling Long- and Short-Term User Preferences via Self-Supervision

Authors:Tianyu Li, Ali Cevahir, Derek Cho, Hao Gong, DuyKhuong Nguyen, Bjorn Stenger

View PDF

Abstract:E-commerce platforms generate vast amounts of customer behavior data, such as clicks and purchases, from millions of unique users every day. However, effectively using this data for behavior understanding tasks is challenging because there are usually not enough labels to learn from all users in a supervised manner. This paper extends the BERT model to e-commerce user data for pre-training representations in a self-supervised manner. By viewing user actions in sequences as analogous to words in sentences, we extend the existing BERT model to user behavior data. Further, our model adopts a unified structure to simultaneously learn from long-term and short-term user behavior, as well as user attributes. We propose methods for the tokenization of different types of user behavior sequences, the generation of input representation vectors, and a novel pretext task to enable the pre-trained model to learn from its own input, eliminating the need for labeled training data. Extensive experiments demonstrate that the learned representations result in significant improvements when transferred to three different real-world tasks, particularly compared to task-specific modeling and multi-task representation learning

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2202.07605 [cs.LG]
	(or arXiv:2202.07605v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2202.07605

Submission history

From: Bjorn Stenger [view email]
[v1] Mon, 14 Feb 2022 08:31:36 UTC (2,559 KB)

Computer Science > Machine Learning

Title:UserBERT: Modeling Long- and Short-Term User Preferences via Self-Supervision

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:UserBERT: Modeling Long- and Short-Term User Preferences via Self-Supervision

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators