×
In order to im- prove the recognition accuracy of out-of- vocabulary words, we propose a cascad- ed model which first segments and disam- biguates in-vocabulary ...
Abstract · 1 Introduction · 2 Our Method. 2.1 Basic procedure; 2.2 Feature selection · 3 User Editable Dicitionary · 4 Refined OOV Word Recognition Model · 5 ...
The term applies both to mental processes used by humans when reading text, and to artificial processes implemented in computers, which are the subject of ...
Leveraging Rich Linguistic Features for Cross-domain Chinese Segmentation · Cascaded Chinese Weibo Segmentation Based on CRFs.
Chinese word segmentation is the task of splitting Chinese text (i.e. a sequence of Chinese characters) into words (Source: www.nlpprogress.com).
Apr 25, 2024 · Leveraging Rich Linguistic Features for Cross-domain Chinese Segmentation. ... Cascaded Chinese Weibo Segmentation Based on CRFs. CIPS-SIGHAN 2012 ...
Oct 24, 2024 · The proposed method achieves both high accuracy and faster operation by expanding perceptual domains and employing a layer-by-layer aggregation ...
In our approach, we combine CRF and MMSEG algorithm and extend features of traditional CRF algorithm to train the model for word segmentation, We use Internet ...
Oct 24, 2015 · While for statistical-based algorithm such as CRF, the training set is turned into a Chinese character sequence and the segmentation task can be ...
This work introduces a multi-layer Chinese word segmentation system which can integrate the outputs from multiple heterogeneous segmentation systems and ...