Enhanced Fine-Tuning of Lightweight Domain-Specific Q&A Model Based on Large Language Models

Zhang, Shenglin; Zhu, Pengtian; Ma, Minghua; Wang, Jiagang; Sun, Yongqian; Li, Dongwen; Wang, Jingyu; Guo, Qianying; Hua, Xiaolei; Zhu, Lin; Pei, Dan

Computer Science > Artificial Intelligence

arXiv:2408.12247 (cs)

[Submitted on 22 Aug 2024 (v1), last revised 23 Aug 2024 (this version, v2)]

Title:Enhanced Fine-Tuning of Lightweight Domain-Specific Q&A Model Based on Large Language Models

Authors:Shenglin Zhang, Pengtian Zhu, Minghua Ma, Jiagang Wang, Yongqian Sun, Dongwen Li, Jingyu Wang, Qianying Guo, Xiaolei Hua, Lin Zhu, Dan Pei

View PDF HTML (experimental)

Abstract:Large language models (LLMs) excel at general question-answering (Q&A) but often fall short in specialized domains due to a lack of domain-specific knowledge. Commercial companies face the dual challenges of privacy protection and resource constraints when involving LLMs for fine-tuning. This paper propose a novel framework, Self-Evolution, designed to address these issues by leveraging lightweight open-source LLMs through multiple iterative fine-tuning rounds. To enhance the efficiency of iterative fine-tuning, Self-Evolution employ a strategy that filters and reinforces the knowledge with higher value during the iterative process. We employed Self-Evolution on Qwen1.5-7B-Chat using 4,000 documents containing rich domain knowledge from China Mobile, achieving a performance score 174% higher on domain-specific question-answering evaluations than Qwen1.5-7B-Chat and even 22% higher than Qwen1.5-72B-Chat. Self-Evolution has been deployed in China Mobile's daily operation and maintenance for 117 days, and it improves the efficiency of locating alarms, fixing problems, and finding related reports, with an average efficiency improvement of over 18.6%. In addition, we release Self-Evolution framework code in this https URL.

Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2408.12247 [cs.AI]
	(or arXiv:2408.12247v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2408.12247

Submission history

From: Pengtian Zhu [view email]
[v1] Thu, 22 Aug 2024 09:36:15 UTC (376 KB)
[v2] Fri, 23 Aug 2024 01:25:26 UTC (368 KB)

Computer Science > Artificial Intelligence

Title:Enhanced Fine-Tuning of Lightweight Domain-Specific Q&A Model Based on Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Enhanced Fine-Tuning of Lightweight Domain-Specific Q&A Model Based on Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators