Multitask Pre-training of Modular Prompt for Chinese Few-Shot Learning

Sun, Tianxiang; He, Zhengfu; Zhu, Qin; Qiu, Xipeng; Huang, Xuanjing

Computer Science > Computation and Language

arXiv:2210.07565 (cs)

[Submitted on 14 Oct 2022 (v1), last revised 6 May 2023 (this version, v3)]

Title:Multitask Pre-training of Modular Prompt for Chinese Few-Shot Learning

Authors:Tianxiang Sun, Zhengfu He, Qin Zhu, Xipeng Qiu, Xuanjing Huang

View PDF

Abstract:Prompt tuning is a parameter-efficient approach to adapting pre-trained language models to downstream tasks. Although prompt tuning has been shown to match the performance of full model tuning when training data is sufficient, it tends to struggle in few-shot learning settings. In this paper, we present Multi-task Pre-trained Modular Prompt (MP2) to boost prompt tuning for few-shot learning. MP2 is a set of combinable prompts pre-trained on 38 Chinese tasks. On downstream tasks, the pre-trained prompts are selectively activated and combined, leading to strong compositional generalization to unseen tasks. To bridge the gap between pre-training and fine-tuning, we formulate upstream and downstream tasks into a unified machine reading comprehension task. Extensive experiments under two learning paradigms, i.e., gradient descent and black-box tuning, show that MP2 significantly outperforms prompt tuning, full model tuning, and prior prompt pre-training methods in few-shot settings. In addition, we demonstrate that MP2 can achieve surprisingly fast and strong adaptation to downstream tasks by merely learning 8 parameters to combine the pre-trained modular prompts.

Comments:	Accepted to ACL 2023 (main conference). Code and data are publicly available at this https URL
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2210.07565 [cs.CL]
	(or arXiv:2210.07565v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.07565

Submission history

From: Tianxiang Sun [view email]
[v1] Fri, 14 Oct 2022 06:43:42 UTC (2,354 KB)
[v2] Mon, 24 Oct 2022 06:32:25 UTC (1,856 KB)
[v3] Sat, 6 May 2023 11:30:41 UTC (2,668 KB)

Computer Science > Computation and Language

Title:Multitask Pre-training of Modular Prompt for Chinese Few-Shot Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Multitask Pre-training of Modular Prompt for Chinese Few-Shot Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators