A Simple Contrastive Learning Objective for Alleviating Neural Text Degeneration

Jiang, Shaojie; Zhang, Ruqing; Vakulenko, Svitlana; de Rijke, Maarten

Computer Science > Computation and Language

arXiv:2205.02517 (cs)

[Submitted on 5 May 2022 (v1), last revised 19 May 2022 (this version, v2)]

Title:A Simple Contrastive Learning Objective for Alleviating Neural Text Degeneration

Authors:Shaojie Jiang, Ruqing Zhang, Svitlana Vakulenko, Maarten de Rijke

View PDF

Abstract:The cross-entropy objective has proved to be an all-purpose training objective for autoregressive language models (LMs). However, without considering the penalization of problematic tokens, LMs trained using cross-entropy exhibit text degeneration. To address this, unlikelihood training has been proposed to reduce the probability of unlikely tokens predicted by LMs. But unlikelihood does not consider the relationship between the label tokens and unlikely token candidates, thus showing marginal improvements in degeneration. We propose a new contrastive token learning objective that inherits the advantages of cross-entropy and unlikelihood training and avoids their limitations. The key idea is to teach a LM to generate high probabilities for label tokens and low probabilities of negative candidates. Comprehensive experiments on language modeling and open-domain dialogue generation tasks show that the proposed contrastive token objective yields much less repetitive texts, with a higher generation quality than baseline approaches, achieving the new state-of-the-art performance on text degeneration.

Comments:	22 pages, 11 figures, 8 tables
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2205.02517 [cs.CL]
	(or arXiv:2205.02517v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2205.02517

Submission history

From: Shaojie Jiang [view email]
[v1] Thu, 5 May 2022 08:50:50 UTC (1,137 KB)
[v2] Thu, 19 May 2022 12:34:24 UTC (1,442 KB)

Computer Science > Computation and Language

Title:A Simple Contrastive Learning Objective for Alleviating Neural Text Degeneration

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:A Simple Contrastive Learning Objective for Alleviating Neural Text Degeneration

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators