Code-Aware Prompting: A study of Coverage Guided Test Generation in Regression Setting using LLM

Ryan, Gabriel; Jain, Siddhartha; Shang, Mingyue; Wang, Shiqi; Ma, Xiaofei; Ramanathan, Murali Krishna; Ray, Baishakhi

Computer Science > Software Engineering

arXiv:2402.00097 (cs)

[Submitted on 31 Jan 2024 (v1), last revised 2 Apr 2024 (this version, v2)]

Title:Code-Aware Prompting: A study of Coverage Guided Test Generation in Regression Setting using LLM

Authors:Gabriel Ryan, Siddhartha Jain, Mingyue Shang, Shiqi Wang, Xiaofei Ma, Murali Krishna Ramanathan, Baishakhi Ray

View PDF HTML (experimental)

Abstract:Testing plays a pivotal role in ensuring software quality, yet conventional Search Based Software Testing (SBST) methods often struggle with complex software units, achieving suboptimal test coverage. Recent works using large language models (LLMs) for test generation have focused on improving generation quality through optimizing the test generation context and correcting errors in model outputs, but use fixed prompting strategies that prompt the model to generate tests without additional guidance. As a result LLM-generated testsuites still suffer from low coverage. In this paper, we present SymPrompt, a code-aware prompting strategy for LLMs in test generation. SymPrompt's approach is based on recent work that demonstrates LLMs can solve more complex logical problems when prompted to reason about the problem in a multi-step fashion. We apply this methodology to test generation by deconstructing the testsuite generation process into a multi-stage sequence, each of which is driven by a specific prompt aligned with the execution paths of the method under test, and exposing relevant type and dependency focal context to the model. Our approach enables pretrained LLMs to generate more complete test cases without any additional training. We implement SymPrompt using the TreeSitter parsing framework and evaluate on a benchmark challenging methods from open source Python projects. SymPrompt enhances correct test generations by a factor of 5 and bolsters relative coverage by 26% for CodeGen2. Notably, when applied to GPT-4, SymPrompt improves coverage by over 2x compared to baseline prompting strategies.

Subjects:	Software Engineering (cs.SE); Machine Learning (cs.LG)
Cite as:	arXiv:2402.00097 [cs.SE]
	(or arXiv:2402.00097v2 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2402.00097

Submission history

From: Siddhartha Jain [view email]
[v1] Wed, 31 Jan 2024 18:21:49 UTC (3,976 KB)
[v2] Tue, 2 Apr 2024 21:23:03 UTC (3,975 KB)

Computer Science > Software Engineering

Title:Code-Aware Prompting: A study of Coverage Guided Test Generation in Regression Setting using LLM

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Code-Aware Prompting: A study of Coverage Guided Test Generation in Regression Setting using LLM

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators