Comparing methods for segmenting elementary discourse units in a French conversational corpus
24th Nordic Conference on Computational Linguistics (NoDaLiDa 2023), 2023•hal.science
While discourse segmentation and parsing has made considerable progress in recent years,
discursive analysis of conversational speech remains a difficult issue. In this paper, we
exploit a French data set that has been manually segmented into discourse units to compare
two approaches to discourse segmentation: fine-tuning existing systems on manual
segmentation vs. using hand-crafted labeling rules to develop a weakly supervised
segmenter. Our results show that both approaches yield similar performance in terms of f …
discursive analysis of conversational speech remains a difficult issue. In this paper, we
exploit a French data set that has been manually segmented into discourse units to compare
two approaches to discourse segmentation: fine-tuning existing systems on manual
segmentation vs. using hand-crafted labeling rules to develop a weakly supervised
segmenter. Our results show that both approaches yield similar performance in terms of f …
While discourse segmentation and parsing has made considerable progress in recent years, discursive analysis of conversational speech remains a difficult issue. In this paper, we exploit a French data set that has been manually segmented into discourse units to compare two approaches to discourse segmentation: fine-tuning existing systems on manual segmentation vs. using hand-crafted labeling rules to develop a weakly supervised segmenter. Our results show that both approaches yield similar performance in terms of f-score while data programming requires less manual annotation work. In a second experiment we play with the amount of training data used for fine-tuning systems and show that a small amount of hand labeled data is enough to obtain good results (albeit not as good as when all available annotated data are used).
hal.science
Showing the best result for this search. See all results