DiscSense: Automated Semantic Analysis of Discourse Markers

Sileo, Damien; Van de Cruys, Tim; Pradel, Camille; Muller, Philippe

Computer Science > Computation and Language

arXiv:2006.01603 (cs)

[Submitted on 2 Jun 2020]

Title:DiscSense: Automated Semantic Analysis of Discourse Markers

Authors:Damien Sileo, Tim Van de Cruys, Camille Pradel, Philippe Muller

View PDF

Abstract:Discourse markers ({\it by contrast}, {\it happily}, etc.) are words or phrases that are used to signal semantic and/or pragmatic relationships between clauses or sentences. Recent work has fruitfully explored the prediction of discourse markers between sentence pairs in order to learn accurate sentence representations, that are useful in various classification tasks. In this work, we take another perspective: using a model trained to predict discourse markers between sentence pairs, we predict plausible markers between sentence pairs with a known semantic relation (provided by existing classification datasets). These predictions allow us to study the link between discourse markers and the semantic relations annotated in classification datasets. Handcrafted mappings have been proposed between markers and discourse relations on a limited set of markers and a limited set of categories, but there exist hundreds of discourse markers expressing a wide variety of relations, and there is no consensus on the taxonomy of relations between competing discourse theories (which are largely built in a top-down fashion). By using an automatic rediction method over existing semantically annotated datasets, we provide a bottom-up characterization of discourse markers in English. The resulting dataset, named DiscSense, is publicly available.

Comments:	Accepted at LREC2020
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2006.01603 [cs.CL]
	(or arXiv:2006.01603v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2006.01603

Submission history

From: Damien Sileo [view email]
[v1] Tue, 2 Jun 2020 13:39:53 UTC (648 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2020-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Damien Sileo
Tim Van de Cruys
Camille Pradel
Philippe Muller

export BibTeX citation

Computer Science > Computation and Language

Title:DiscSense: Automated Semantic Analysis of Discourse Markers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:DiscSense: Automated Semantic Analysis of Discourse Markers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators