Generalized Planning in PDDL Domains with Pretrained Large Language Models

Silver, Tom; Dan, Soham; Srinivas, Kavitha; Tenenbaum, Joshua B.; Kaelbling, Leslie Pack; Katz, Michael

Computer Science > Artificial Intelligence

arXiv:2305.11014 (cs)

[Submitted on 18 May 2023 (v1), last revised 18 Dec 2023 (this version, v2)]

Title:Generalized Planning in PDDL Domains with Pretrained Large Language Models

Authors:Tom Silver, Soham Dan, Kavitha Srinivas, Joshua B. Tenenbaum, Leslie Pack Kaelbling, Michael Katz

View PDF HTML (experimental)

Abstract:Recent work has considered whether large language models (LLMs) can function as planners: given a task, generate a plan. We investigate whether LLMs can serve as generalized planners: given a domain and training tasks, generate a program that efficiently produces plans for other tasks in the domain. In particular, we consider PDDL domains and use GPT-4 to synthesize Python programs. We also consider (1) Chain-of-Thought (CoT) summarization, where the LLM is prompted to summarize the domain and propose a strategy in words before synthesizing the program; and (2) automated debugging, where the program is validated with respect to the training tasks, and in case of errors, the LLM is re-prompted with four types of feedback. We evaluate this approach in seven PDDL domains and compare it to four ablations and four baselines. Overall, we find that GPT-4 is a surprisingly powerful generalized planner. We also conclude that automated debugging is very important, that CoT summarization has non-uniform impact, that GPT-4 is far superior to GPT-3.5, and that just two training tasks are often sufficient for strong generalization.

Comments:	AAAI 2024
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2305.11014 [cs.AI]
	(or arXiv:2305.11014v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2305.11014

Submission history

From: Tom Silver [view email]
[v1] Thu, 18 May 2023 14:48:20 UTC (1,058 KB)
[v2] Mon, 18 Dec 2023 19:44:09 UTC (1,053 KB)

Computer Science > Artificial Intelligence

Title:Generalized Planning in PDDL Domains with Pretrained Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Generalized Planning in PDDL Domains with Pretrained Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators