Multitask Pre-training of Modular Prompt for Chinese Few-Shot Learning

Tianxiang Sun; Zhengfu He; Qin Zhu; Xipeng Qiu (邱锡鹏); Xuan-Jing Huang (黄萱菁)

doi:10.18653/v1/2023.acl-long.625

Multitask Pre-training of Modular Prompt for Chinese Few-Shot Learning

Tianxiang Sun, Zhengfu He, Qin Zhu, Xipeng Qiu, Xuanjing Huang

Abstract

Prompt tuning is a parameter-efficient approach to adapting pre-trained language models to downstream tasks. Although prompt tuning has been shown to match the performance of full model tuning when training data is sufficient, it tends to struggle in few-shot learning settings. In this paper, we present Multi-task Pre-trained Modular Prompt (MP2) to boost prompt tuning for few-shot learning. MP2 is a set of combinable prompts pre-trained on 38 Chinese tasks. On downstream tasks, the pre-trained prompts are selectively activated and combined, leading to strong compositional generalization to unseen tasks. To bridge the gap between pre-training and fine-tuning, we formulate upstream and downstream tasks into a unified machine reading comprehension task. Extensive experiments under two learning paradigms, i.e., gradient descent and black-box tuning, show that MP2 significantly outperforms prompt tuning, full model tuning, and prior prompt pre-training methods in few-shot settings. In addition, we demonstrate that MP2 can achieve surprisingly fast and strong adaptation to downstream tasks by merely learning 8 parameters to combine the pre-trained modular prompts.

Anthology ID:: 2023.acl-long.625
Volume:: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2023
Address:: Toronto, Canada
Editors:: Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 11156–11172
Language:
URL:: https://aclanthology.org/2023.acl-long.625/
DOI:: 10.18653/v1/2023.acl-long.625
Bibkey:
Cite (ACL):: Tianxiang Sun, Zhengfu He, Qin Zhu, Xipeng Qiu, and Xuanjing Huang. 2023. Multitask Pre-training of Modular Prompt for Chinese Few-Shot Learning. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 11156–11172, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):: Multitask Pre-training of Modular Prompt for Chinese Few-Shot Learning (Sun et al., ACL 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.acl-long.625.pdf
Video:: https://aclanthology.org/2023.acl-long.625.mp4

PDF Cite Search Video Fix data