Efficient On-Device Session-Based Recommendation

Xia, Xin; Yu, Junliang; Wang, Qinyong; Yang, Chaoqun; Nguyen, Quoc Viet Hung; Yin, Hongzhi

Computer Science > Information Retrieval

arXiv:2209.13422 (cs)

[Submitted on 27 Sep 2022 (v1), last revised 6 Jan 2023 (this version, v4)]

Title:Efficient On-Device Session-Based Recommendation

Authors:Xin Xia, Junliang Yu, Qinyong Wang, Chaoqun Yang, Quoc Viet Hung Nguyen, Hongzhi Yin

View PDF

Abstract:On-device session-based recommendation systems have been achieving increasing attention on account of the low energy/resource consumption and privacy protection while providing promising recommendation performance. To fit the powerful neural session-based recommendation models in resource-constrained mobile devices, tensor-train decomposition and its variants have been widely applied to reduce memory footprint by decomposing the embedding table into smaller tensors, showing great potential in compressing recommendation models. However, these model compression techniques significantly increase the local inference time due to the complex process of generating index lists and a series of tensor multiplications to form item embeddings, and the resultant on-device recommender fails to provide real-time response and recommendation. To improve the online recommendation efficiency, we propose to learn compositional encoding-based compact item representations. Specifically, each item is represented by a compositional code that consists of several codewords, and we learn embedding vectors to represent each codeword instead of each item. Then the composition of the codeword embedding vectors from different embedding matrices (i.e., codebooks) forms the item embedding. Since the size of codebooks can be extremely small, the recommender model is thus able to fit in resource-constrained devices and meanwhile can save the codebooks for fast local this http URL, to prevent the loss of model capacity caused by compression, we propose a bidirectional self-supervised knowledge distillation framework. Extensive experimental results on two benchmark datasets demonstrate that compared with existing methods, the proposed on-device recommender not only achieves an 8x inference speedup with a large compression ratio but also shows superior recommendation performance.

Comments:	Extension of Our SIGIR'22 Paper (On-Device Next-Item Recommendation with Self-Supervised Knowledge Distillation), accepted by TOIS. arXiv admin note: text overlap with arXiv:2204.11091
Subjects:	Information Retrieval (cs.IR)
Cite as:	arXiv:2209.13422 [cs.IR]
	(or arXiv:2209.13422v4 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2209.13422

Submission history

From: Xin Xia [view email]
[v1] Tue, 27 Sep 2022 14:23:08 UTC (3,537 KB)
[v2] Wed, 28 Sep 2022 09:49:14 UTC (3,524 KB)
[v3] Tue, 27 Dec 2022 04:57:03 UTC (6,767 KB)
[v4] Fri, 6 Jan 2023 12:25:25 UTC (6,753 KB)

Computer Science > Information Retrieval

Title:Efficient On-Device Session-Based Recommendation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Efficient On-Device Session-Based Recommendation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators