Collaborative Cross-modal Fusion with Large Language Model for Recommendation

Liu, Zhongzhou; Zhang, Hao; Dong, Kuicai; Fang, Yuan

doi:10.1145/3627673.3679596

Computer Science > Information Retrieval

arXiv:2408.08564 (cs)

[Submitted on 16 Aug 2024]

Title:Collaborative Cross-modal Fusion with Large Language Model for Recommendation

Authors:Zhongzhou Liu, Hao Zhang, Kuicai Dong, Yuan Fang

View PDF HTML (experimental)

Abstract:Despite the success of conventional collaborative filtering (CF) approaches for recommendation systems, they exhibit limitations in leveraging semantic knowledge within the textual attributes of users and items. Recent focus on the application of large language models for recommendation (LLM4Rec) has highlighted their capability for effective semantic knowledge capture. However, these methods often overlook the collaborative signals in user behaviors. Some simply instruct-tune a language model, while others directly inject the embeddings of a CF-based model, lacking a synergistic fusion of different modalities. To address these issues, we propose a framework of Collaborative Cross-modal Fusion with Large Language Models, termed CCF-LLM, for recommendation. In this framework, we translate the user-item interactions into a hybrid prompt to encode both semantic knowledge and collaborative signals, and then employ an attentive cross-modal fusion strategy to effectively fuse latent embeddings of both modalities. Extensive experiments demonstrate that CCF-LLM outperforms existing methods by effectively utilizing semantic and collaborative signals in the LLM4Rec context.

Comments:	10 pages, 4 figures, accepted by CIKM 2024
Subjects:	Information Retrieval (cs.IR); Computation and Language (cs.CL)
Cite as:	arXiv:2408.08564 [cs.IR]
	(or arXiv:2408.08564v1 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2408.08564
Related DOI:	https://doi.org/10.1145/3627673.3679596

Submission history

From: Zhongzhou Liu [view email]
[v1] Fri, 16 Aug 2024 06:54:10 UTC (2,452 KB)

Computer Science > Information Retrieval

Title:Collaborative Cross-modal Fusion with Large Language Model for Recommendation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Collaborative Cross-modal Fusion with Large Language Model for Recommendation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators