[PDF][PDF] Term aggregation: mining synonymous expressions using personal stylistic variations

A Murakami, T Nasukawa - COLING 2004: Proceedings of the …, 2004 - aclanthology.org
A Murakami, T Nasukawa
COLING 2004: Proceedings of the 20th International Conference on …, 2004aclanthology.org
We present a text mining method for finding synonymous expressions based on the
distributional hypothesis in a set of coherent corpora. This paper proposes a new
methodology to improve the accuracy of a term aggregation system using each author's text
as a coherent corpus. Our approach is based on the idea that one person tends to use one
expression for one meaning. According to our assumption, most of the words with similar
context features in each author's corpus tend not to be synonymous expressions. Our …
Abstract
We present a text mining method for finding synonymous expressions based on the distributional hypothesis in a set of coherent corpora. This paper proposes a new methodology to improve the accuracy of a term aggregation system using each author’s text as a coherent corpus. Our approach is based on the idea that one person tends to use one expression for one meaning. According to our assumption, most of the words with similar context features in each author’s corpus tend not to be synonymous expressions. Our proposed method improves the accuracy of our term aggregation system, showing that our approach is successful.
aclanthology.org
Showing the best result for this search. See all results