Explaining Text Similarity in Transformer Models

Vasileiou, Alexandros; Eberle, Oliver

Computer Science > Computation and Language

arXiv:2405.06604 (cs)

[Submitted on 10 May 2024]

Title:Explaining Text Similarity in Transformer Models

Authors:Alexandros Vasileiou, Oliver Eberle

View PDF HTML (experimental)

Abstract:As Transformers have become state-of-the-art models for natural language processing (NLP) tasks, the need to understand and explain their predictions is increasingly apparent. Especially in unsupervised applications, such as information retrieval tasks, similarity models built on top of foundation model representations have been widely applied. However, their inner prediction mechanisms have mostly remained opaque. Recent advances in explainable AI have made it possible to mitigate these limitations by leveraging improved explanations for Transformers through layer-wise relevance propagation (LRP). Using BiLRP, an extension developed for computing second-order explanations in bilinear similarity models, we investigate which feature interactions drive similarity in NLP models. We validate the resulting explanations and demonstrate their utility in three corpus-level use cases, analyzing grammatical interactions, multilingual semantics, and biomedical text retrieval. Our findings contribute to a deeper understanding of different semantic similarity tasks and models, highlighting how novel explainable AI methods enable in-depth analyses and corpus-level insights.

Comments:	Accepted to NAACL 2024
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2405.06604 [cs.CL]
	(or arXiv:2405.06604v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2405.06604

Submission history

From: Oliver Eberle Dr [view email]
[v1] Fri, 10 May 2024 17:11:31 UTC (18,842 KB)

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Computation and Language

Title:Explaining Text Similarity in Transformer Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Computation and Language

Title:Explaining Text Similarity in Transformer Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators