Building Efficient Universal Classifiers with Natural Language Inference

Laurer, Moritz; van Atteveldt, Wouter; Casas, Andreu; Welbers, Kasper

Computer Science > Computation and Language

arXiv:2312.17543 (cs)

[Submitted on 29 Dec 2023 (v1), last revised 22 Mar 2024 (this version, v2)]

Title:Building Efficient Universal Classifiers with Natural Language Inference

Authors:Moritz Laurer, Wouter van Atteveldt, Andreu Casas, Kasper Welbers

View PDF HTML (experimental)

Abstract:Generative Large Language Models (LLMs) have become the mainstream choice for fewshot and zeroshot learning thanks to the universality of text generation. Many users, however, do not need the broad capabilities of generative LLMs when they only want to automate a classification task. Smaller BERT-like models can also learn universal tasks, which allow them to do any text classification task without requiring fine-tuning (zeroshot classification) or to learn new tasks with only a few examples (fewshot), while being significantly more efficient than generative LLMs. This paper (1) explains how Natural Language Inference (NLI) can be used as a universal classification task that follows similar principles as instruction fine-tuning of generative LLMs, (2) provides a step-by-step guide with reusable Jupyter notebooks for building a universal classifier, and (3) shares the resulting universal classifier that is trained on 33 datasets with 389 diverse classes. Parts of the code we share has been used to train our older zeroshot classifiers that have been downloaded more than 55 million times via the Hugging Face Hub as of December 2023. Our new classifier improves zeroshot performance by 9.4%.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2312.17543 [cs.CL]
	(or arXiv:2312.17543v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2312.17543

Submission history

From: Moritz Laurer [view email]
[v1] Fri, 29 Dec 2023 10:18:36 UTC (7,565 KB)
[v2] Fri, 22 Mar 2024 17:12:49 UTC (7,578 KB)

Computer Science > Computation and Language

Title:Building Efficient Universal Classifiers with Natural Language Inference

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Building Efficient Universal Classifiers with Natural Language Inference

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators