Exploiting the wisdom of the crowds for characterizing and connecting heterogeneous resources

R Kawase, P Siehndel, B Pereira Nunes… - Proceedings of the 25th …, 2014 - dl.acm.org
Proceedings of the 25th ACM conference on Hypertext and social media, 2014dl.acm.org
Heterogeneous content is an inherent problem for cross-system search, recommendation
and personalization. In this paper we investigate differences in topic coverage and the
impact of topics in different kinds of Web services. We use entity extraction and
categorization to create fingerprints that allow for meaningful comparison. As a basis
taxonomy, we use the 23 main categories of Wikipedia Category Graph, which has been
assembled over the years by the wisdom of the crowds. Following a proof of concept of our …
Heterogeneous content is an inherent problem for cross-system search, recommendation and personalization. In this paper we investigate differences in topic coverage and the impact of topics in different kinds of Web services. We use entity extraction and categorization to create fingerprints that allow for meaningful comparison. As a basis taxonomy, we use the 23 main categories of Wikipedia Category Graph, which has been assembled over the years by the wisdom of the crowds. Following a proof of concept of our approach, we analyze differences in topic coverage and topic impact. The results show many differences between Web services like Twitter, Flickr and Delicious, which reflect users' behavior and the usage of each system. The paper concludes with a user study that demonstrates the benefits of fingerprints over traditional textual methods for recommendations of heterogeneous resources.
ACM Digital Library
Showing the best result for this search. See all results