Interpretable video tag recommendation with multimedia deep learning framework
ISSN: 1066-2243
Article publication date: 26 July 2021
Issue publication date: 15 March 2022
Abstract
Purpose
Tags help promote customer engagement on video-sharing platforms. Video tag recommender systems are artificial intelligence-enabled frameworks that strive for recommending precise tags for videos. Extant video tag recommender systems are uninterpretable, which leads to distrust of the recommendation outcome, hesitation in tag adoption and difficulty in the system debugging process. This study aims at constructing an interpretable and novel video tag recommender system to assist video-sharing platform users in tagging their newly uploaded videos.
Design/methodology/approach
The proposed interpretable video tag recommender system is a multimedia deep learning framework composed of convolutional neural networks (CNNs), which receives texts and images as inputs. The interpretability of the proposed system is realized through layer-wise relevance propagation.
Findings
The case study and user study demonstrate that the proposed interpretable multimedia CNN model could effectively explain its recommended tag to users by highlighting keywords and key patches that contribute the most to the recommended tag. Moreover, the proposed model achieves an improved recommendation performance by outperforming state-of-the-art models.
Practical implications
The interpretability of the proposed recommender system makes its decision process more transparent, builds users’ trust in the recommender systems and prompts users to adopt the recommended tags. Through labeling videos with human-understandable and accurate tags, the exposure of videos to their target audiences would increase, which enhances information technology (IT) adoption, customer engagement, value co-creation and precision marketing on the video-sharing platform.
Originality/value
The proposed model is not only the first explainable video tag recommender system but also the first explainable multimedia tag recommender system to the best of our knowledge.
Keywords
Acknowledgements
The authors appreciate the editors and the anonymous reviewers for their detailed and constructive comments. This work was supported in part by the National Natural Science Foundation of China [Grants 71801217, 72022007 and 71872080] and Tsinghua University Initiative Scientific Research Program [Grant 2019THZWJC12].
Citation
Yang, Z. and Lin, Z. (2022), "Interpretable video tag recommendation with multimedia deep learning framework", Internet Research, Vol. 32 No. 2, pp. 518-535. https://doi.org/10.1108/INTR-08-2020-0471
Publisher
:Emerald Publishing Limited
Copyright © 2021, Emerald Publishing Limited