Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement

Xi, Zhiheng; Jin, Senjie; Zhou, Yuhao; Zheng, Rui; Gao, Songyang; Gui, Tao; Zhang, Qi; Huang, Xuanjing

Computer Science > Computation and Language

arXiv:2305.14497v1 (cs)

[Submitted on 23 May 2023 (this version), latest version 18 Apr 2024 (v2)]

Title:Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement

Authors:Zhiheng Xi, Senjie Jin, Yuhao Zhou, Rui Zheng, Songyang Gao, Tao Gui, Qi Zhang, Xuanjing Huang

View PDF

Abstract:Prompting methods such as Chain-of-Thought (CoT) have shed new light on enhancing the reasoning capabilities of large language models, and researchers have extensively explored the generation process of rationales and answers. However, they have overlooked the potential challenges posed by the poor quality of reasoning problems, which may influence the reasoning performance significantly. In this work, we propose Self-Polish (SP), a novel method that facilitates the model's problem-solving process by prompting them to progressively refine the given problems to be more comprehensible and solvable. Specifically, the method teaches models to eliminate irrelevant information, rearrange the logic structure and organize local conditions into new ones parallelly. SP is orthogonal to all other prompting methods, making it convenient to integrate with state-of-the-art techniques for further improvement. We conduct thorough experiments on five benchmarks to illustrate the effectiveness of the proposed method. For example, with Text-davinci-003, our method boosts the performance of standard few-shot prompting by $8.0\%$ on GSM8K and $17.8\%$ on MultiArith; it also improves the performance of CoT by $6.0\%$ on GSM8K and $6.0\%$ on MathQA, respectively. Furthermore, our method also showcases impressive performance on robustness evaluation.

Comments:	Preprint
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2305.14497 [cs.CL]
	(or arXiv:2305.14497v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2305.14497

Submission history

From: Zhiheng Xi [view email]
[v1] Tue, 23 May 2023 19:58:30 UTC (211 KB)
[v2] Thu, 18 Apr 2024 07:27:00 UTC (3,803 KB)

Computer Science > Computation and Language

Title:Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators