User profiles for Charun Phrombut

Charun Phrombut

National Electronics and Computer Technology Center
Verified email at nectec.or.th
Cited by 14

The annotation guideline of lst20 corpus

…, K Kriengket, D Leenoi, C Phrombut… - arXiv preprint arXiv …, 2020 - arxiv.org
This report presents the annotation guideline for LST20, a large-scale corpus with multiple
layers of linguistic annotation for Thai language processing. Our guideline consists of five …

ThEconSum: an Economics-domained Dataset for Thai Text Summarization and Baseline Models

…, S Phaholphinyo, C Phrombut… - … Joint Symposium on …, 2022 - ieeexplore.ieee.org
Language resources as datasets are an essential component in developing an effective
automatic text summarization (ATS) system. Some public datasets are relatively uncommon …

[HTML][HTML] Construction of Text Summarization Corpus in Economics Domain and Baseline Models

…, P Porkaew, S Phaholphinyo, C Phrombut… - 2024 - jicce.org
Automated text summarization (ATS) systems rely on language resources as datasets.
However, creating these datasets is a complex and labor-intensive task requiring linguists to …

[CITATION][C] Ahmed Moustafa 163, 181, 187 Akihiro Kido 151 Akkharawoot Takhomm 93 Aldian Nurcahyo 19

…, B Neupane, B Du, CYS Jeong, C Phrombut… - ieeexplore.ieee.org
… ChangYSung Jeong 59 Charun Phrombut 169 Chunsheng Yang 163 …

[CITATION][C] TECHNICAL ORAL SESSIONS

ZZ Hlaing, YK Thu, T Supnithi, P Netisopakul… - 2022 - ieeexplore.ieee.org
… Sawittree Jumpathong, Akkharawoot Takhom, Prachya Boonkwan, Vipas Sutantayawalee,
Peerachet Porkaew, Sitthaa Phaholphinyo, Charun Phrombut, Thepchai Supnithi, Khemarath …

Pythainlp: Thai natural language processing in python

W Phatthiyaphaibun, K Chaovavanich… - arXiv preprint arXiv …, 2023 - arxiv.org
We present PyThaiNLP, a free and open-source natural language processing (NLP) library
for Thai language implemented in Python. It provides a wide range of software, models, and …

Phayathaibert: Enhancing a pretrained thai language model with unassimilated loanwords

P Sriwirote, J Thapiang, V Timtong… - arXiv preprint arXiv …, 2023 - arxiv.org
While WangchanBERTa has become the de facto standard in transformer-based Thai language
modeling, it still has shortcomings in regard to the understanding of foreign words, most …

Handling cross and out-of-domain samples in Thai word segmentation

P Limkonchotiwat, W Phatthiyaphaibun, R Sarwar… - 2021 - wlv.openrepository.com
While word segmentation is a solved problem in many languages, it is still a challenge in
continuous-script or low-resource languages. Like other NLP tasks, word segmentation is …

The Thai Discourse Treebank: Annotating and Classifying Thai Discourse Connectives

P Prasertsom, A Jaroonpol… - Transactions of the …, 2024 - direct.mit.edu
Discourse analysis is a highly applicable area of natural language processing. In English and
other languages, resources for discourse-based tasks are widely available. Thai, however, …

Thai nested named entity recognition corpus

W Buaphet, C Udomcharoenchaikit… - Findings of the …, 2022 - aclanthology.org
This paper presents the first Thai Nested Named Entity Recognition (N-NER) dataset. Thai N-NER
consists of 264,798 mentions, 104 classes, and a maximum depth of 8 layers …