To read this content please select one of the options below:

Interpretable video tag recommendation with multimedia deep learning framework

Zekun Yang (School of Information Resource Management, Renmin University of China, Beijing, China)
Zhijie Lin (Department of Management Science and Engineering, School of Economics and Management, Tsinghua University, Beijing, China)

Internet Research

ISSN: 1066-2243

Article publication date: 26 July 2021

Issue publication date: 15 March 2022

914

Abstract

Purpose

Tags help promote customer engagement on video-sharing platforms. Video tag recommender systems are artificial intelligence-enabled frameworks that strive for recommending precise tags for videos. Extant video tag recommender systems are uninterpretable, which leads to distrust of the recommendation outcome, hesitation in tag adoption and difficulty in the system debugging process. This study aims at constructing an interpretable and novel video tag recommender system to assist video-sharing platform users in tagging their newly uploaded videos.

Design/methodology/approach

The proposed interpretable video tag recommender system is a multimedia deep learning framework composed of convolutional neural networks (CNNs), which receives texts and images as inputs. The interpretability of the proposed system is realized through layer-wise relevance propagation.

Findings

The case study and user study demonstrate that the proposed interpretable multimedia CNN model could effectively explain its recommended tag to users by highlighting keywords and key patches that contribute the most to the recommended tag. Moreover, the proposed model achieves an improved recommendation performance by outperforming state-of-the-art models.

Practical implications

The interpretability of the proposed recommender system makes its decision process more transparent, builds users’ trust in the recommender systems and prompts users to adopt the recommended tags. Through labeling videos with human-understandable and accurate tags, the exposure of videos to their target audiences would increase, which enhances information technology (IT) adoption, customer engagement, value co-creation and precision marketing on the video-sharing platform.

Originality/value

The proposed model is not only the first explainable video tag recommender system but also the first explainable multimedia tag recommender system to the best of our knowledge.

Keywords

Acknowledgements

The authors appreciate the editors and the anonymous reviewers for their detailed and constructive comments. This work was supported in part by the National Natural Science Foundation of China [Grants 71801217, 72022007 and 71872080] and Tsinghua University Initiative Scientific Research Program [Grant 2019THZWJC12].

Citation

Yang, Z. and Lin, Z. (2022), "Interpretable video tag recommendation with multimedia deep learning framework", Internet Research, Vol. 32 No. 2, pp. 518-535. https://doi.org/10.1108/INTR-08-2020-0471

Publisher

:

Emerald Publishing Limited

Copyright © 2021, Emerald Publishing Limited

Related articles