Open-SQL Framework: Enhancing Text-to-SQL on Open-source Large Language Models

Chen, Xiaojun; Wang, Tianle; Qiu, Tianhao; Qin, Jianbin; Yang, Min

Computer Science > Computation and Language

arXiv:2405.06674 (cs)

[Submitted on 4 May 2024]

Title:Open-SQL Framework: Enhancing Text-to-SQL on Open-source Large Language Models

Authors:Xiaojun Chen, Tianle Wang, Tianhao Qiu, Jianbin Qin, Min Yang

View PDF HTML (experimental)

Abstract:Despite the success of large language models (LLMs) in Text-to-SQL tasks, open-source LLMs encounter challenges in contextual understanding and response coherence. To tackle these issues, we present \ours, a systematic methodology tailored for Text-to-SQL with open-source LLMs. Our contributions include a comprehensive evaluation of open-source LLMs in Text-to-SQL tasks, the \openprompt strategy for effective question representation, and novel strategies for supervised fine-tuning. We explore the benefits of Chain-of-Thought in step-by-step inference and propose the \openexample method for enhanced few-shot learning. Additionally, we introduce token-efficient techniques, such as \textbf{Variable-length Open DB Schema}, \textbf{Target Column Truncation}, and \textbf{Example Column Truncation}, addressing challenges in large-scale databases. Our findings emphasize the need for further investigation into the impact of supervised fine-tuning on contextual learning capabilities. Remarkably, our method significantly improved Llama2-7B from 2.54\% to 41.04\% and Code Llama-7B from 14.54\% to 48.24\% on the BIRD-Dev dataset. Notably, the performance of Code Llama-7B surpassed GPT-4 (46.35\%) on the BIRD-Dev dataset.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2405.06674 [cs.CL]
	(or arXiv:2405.06674v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2405.06674

Submission history

From: Xiaojun Chen Dr. [view email]
[v1] Sat, 4 May 2024 15:40:17 UTC (318 KB)

Computer Science > Computation and Language

Title:Open-SQL Framework: Enhancing Text-to-SQL on Open-source Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Open-SQL Framework: Enhancing Text-to-SQL on Open-source Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators