Topic detection and tracking for threaded discussion communities

M Zhu, W Hu, O Wu - … on Web Intelligence and Intelligent Agent …, 2008 - ieeexplore.ieee.org
M Zhu, W Hu, O Wu
2008 IEEE/WIC/ACM International Conference on Web Intelligence and …, 2008ieeexplore.ieee.org
The threaded discussion communities are one of the most common forms of online
communities, which are becoming more and more popular among web users. Everyday a
huge amount of new discussions are added to these communities, which are difficult to
summarize and search. In this paper, we propose a topic detection and tracking (TDT)
method for the discussion threads. Most existing TDT methods deal with the news stories,
but the language used in discussion data are much more casual, oral and informal …
The threaded discussion communities are one of the most common forms of online communities, which are becoming more and more popular among web users. Everyday a huge amount of new discussions are added to these communities, which are difficult to summarize and search. In this paper, we propose a topic detection and tracking (TDT) method for the discussion threads. Most existing TDT methods deal with the news stories, but the language used in discussion data are much more casual, oral and informal compared with news data. To solve this problem, we design several extensions to the basic TDT framework, focusing on the very nature of discussion data, including a thread/post activity validation step, a term pos-weighting strategy, and a two-level decision framework considering not only the content similarity but also the user activity information. Experiment results show that our pro-posed method greatly improves current TDT methods in real discussion community environment. The discussion data can be better organized for searching and visualization with the help of TDT.
ieeexplore.ieee.org
Showing the best result for this search. See all results