InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning

Ying, Huaiyuan; Zhang, Shuo; Li, Linyang; Zhou, Zhejian; Shao, Yunfan; Fei, Zhaoye; Ma, Yichuan; Hong, Jiawei; Liu, Kuikun; Wang, Ziyi; Wang, Yudong; Wu, Zijian; Li, Shuaibin; Zhou, Fengzhe; Liu, Hongwei; Zhang, Songyang; Zhang, Wenwei; Yan, Hang; Qiu, Xipeng; Wang, Jiayu; Chen, Kai; Lin, Dahua

Computer Science > Computation and Language

arXiv:2402.06332 (cs)

[Submitted on 9 Feb 2024 (v1), last revised 24 May 2024 (this version, v2)]

Title:InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning

View PDF

Abstract:The math abilities of large language models can represent their abstract reasoning ability. In this paper, we introduce and open-source our math reasoning LLMs InternLM-Math which is continue pre-trained from InternLM2. We unify chain-of-thought reasoning, reward modeling, formal reasoning, data augmentation, and code interpreter in a unified seq2seq format and supervise our model to be a versatile math reasoner, verifier, prover, and augmenter. These abilities can be used to develop the next math LLMs or self-iteration. InternLM-Math obtains open-sourced state-of-the-art performance under the setting of in-context learning, supervised fine-tuning, and code-assisted reasoning in various informal and formal benchmarks including GSM8K, MATH, Hungary math exam, MathBench-ZH, and MiniF2F. Our pre-trained model achieves 30.3 on the MiniF2F test set without fine-tuning. We further explore how to use LEAN to solve math problems and study its performance under the setting of multi-task learning which shows the possibility of using LEAN as a unified platform for solving and proving in math. Our models, codes, and data are released at \url{this https URL}.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2402.06332 [cs.CL]
	(or arXiv:2402.06332v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2402.06332

Submission history

From: Huaiyuan Ying [view email]
[v1] Fri, 9 Feb 2024 11:22:08 UTC (1,590 KB)
[v2] Fri, 24 May 2024 07:09:21 UTC (1,592 KB)

Computer Science > Computation and Language

Title:InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators