Toward Interpretable Semantic Textual Similarity via Optimal Transport-based Contrastive Sentence Learning

Lee, Seonghyeon; Lee, Dongha; Jang, Seongbo; Yu, Hwanjo

Computer Science > Artificial Intelligence

arXiv:2202.13196 (cs)

[Submitted on 26 Feb 2022 (v1), last revised 14 Apr 2022 (this version, v2)]

Title:Toward Interpretable Semantic Textual Similarity via Optimal Transport-based Contrastive Sentence Learning

Authors:Seonghyeon Lee, Dongha Lee, Seongbo Jang, Hwanjo Yu

View PDF

Abstract:Recently, finetuning a pretrained language model to capture the similarity between sentence embeddings has shown the state-of-the-art performance on the semantic textual similarity (STS) task. However, the absence of an interpretation method for the sentence similarity makes it difficult to explain the model output. In this work, we explicitly describe the sentence distance as the weighted sum of contextualized token distances on the basis of a transportation problem, and then present the optimal transport-based distance measure, named RCMD; it identifies and leverages semantically-aligned token pairs. In the end, we propose CLRCMD, a contrastive learning framework that optimizes RCMD of sentence pairs, which enhances the quality of sentence similarity and their interpretation. Extensive experiments demonstrate that our learning framework outperforms other baselines on both STS and interpretable-STS benchmarks, indicating that it computes effective sentence similarity and also provides interpretation consistent with human judgement. The code and checkpoint are publicly available at this https URL.

Comments:	ACL 2022 main + camera-ready version
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2202.13196 [cs.AI]
	(or arXiv:2202.13196v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2202.13196

Submission history

From: Seonghyeon Lee [view email]
[v1] Sat, 26 Feb 2022 17:28:02 UTC (6,621 KB)
[v2] Thu, 14 Apr 2022 01:03:08 UTC (6,621 KB)

Computer Science > Artificial Intelligence

Title:Toward Interpretable Semantic Textual Similarity via Optimal Transport-based Contrastive Sentence Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Toward Interpretable Semantic Textual Similarity via Optimal Transport-based Contrastive Sentence Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators