Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models

Xu, Fangzhi; Wu, Zhiyong; Sun, Qiushi; Ren, Siyu; Yuan, Fei; Yuan, Shuai; Lin, Qika; Qiao, Yu; Liu, Jun

Computer Science > Computation and Language

arXiv:2311.09278 (cs)

[Submitted on 15 Nov 2023 (v1), last revised 18 Feb 2024 (this version, v2)]

Title:Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models

Authors:Fangzhi Xu, Zhiyong Wu, Qiushi Sun, Siyu Ren, Fei Yuan, Shuai Yuan, Qika Lin, Yu Qiao, Jun Liu

View PDF

Abstract:Although Large Language Models (LLMs) demonstrate remarkable ability in processing and generating human-like text, they do have limitations when it comes to comprehending and expressing world knowledge that extends beyond the boundaries of natural language(e.g., chemical molecular formula). Injecting a collection of symbolic data directly into the training of LLMs can be problematic, as it disregards the synergies among different symbolic families and overlooks the need for a balanced mixture of natural and symbolic data. In this work, we tackle these challenges from both a data and framework perspective and introduce Symbol-LLM series models. First, we curated a data collection consisting of 34 tasks and incorporating approximately 20 distinct symbolic families, intending to capture the interrelations and foster synergies between symbols. Then, a two-stage tuning framework succeeds in injecting symbolic knowledge without loss of the generality ability. Extensive experiments on both symbol- and NL-centric tasks demonstrate the balanced and superior performances of Symbol-LLM series models. The project page is this https URL.

Comments:	23 pages, 13 figures
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2311.09278 [cs.CL]
	(or arXiv:2311.09278v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2311.09278

Submission history

From: Fangzhi Xu [view email]
[v1] Wed, 15 Nov 2023 18:59:56 UTC (16,383 KB)
[v2] Sun, 18 Feb 2024 06:24:12 UTC (13,269 KB)

Computer Science > Computation and Language

Title:Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators