Enhancing Code Generation Performance of Smaller Models by Distilling the Reasoning Ability of LLMs.

AllBooks Videos Images Maps News Shopping

Enhancing Code Generation Performance of Smaller Models by ...

Mar 20, 2024 · In this paper, we propose the CodePLAN framework, which aims to transfer LLMs' reasoning capabilities to smaller models through distillation.

Enhancing Code Generation Performance of Smaller Models by ...

arxiv.org › html

Mar 20, 2024 · In this paper, we propose the CodePLAN framework, which aims to transfer LLMs' reasoning capabilities to smaller models through distillation.

[PDF] Enhancing Code Generation Performance of Smaller Models by ...

aclanthology.org › 2024.lrec-main....

May 20, 2024 · In this paper, we propose the. CodePLAN framework, which aims to transfer LLMs' reasoning capabilities to smaller models through distillation.

Revision History for Enhancing Code Generation Performance...

openreview.net › revisions

Consequently, there arises a compelling need for transferring LLMs' code generation reasoning abilities to the smaller models. In this paper, we propose the ...

[PDF] Distilling Complex Reasoning Capabilities from LLMs by ...

ojs.aaai.org › AAAI › article › view

Thus, the goal of our research is to enable complex arithmetic reasoning in small models for deploying at scale. Knowledge distillation (Hinton, Vinyals, and ...

Enhancing Code Generation Performance of Smaller Models

goatstack.ai › topics › enhancing-code-g...

Discover how the CodePLAN framework leverages the reasoning capabilities of LLMs to boost the code generation performance of smaller models by over 130% on ...

Distilling Reasoning Ability in Smaller Models - GoatStack.AI

goatstack.ai › topics › distilling-reasonin...

The CodePLAN framework seeks to distill LLMs' reasoning prowess into smaller models using a multi-task learning approach focusing on both code and solution plan ...

[PDF] arXiv:2404.08148v1 [cs.CL] 11 Apr 2024

www.cs.utexas.edu › papers › li.pre...

Apr 11, 2024 · In this work, we propose a novel approach to distilling reasoning abilities from LLMs by leveraging their capacity to explain solutions. We ...

Distilling Reasoning Capabilities into Smaller Language Models

www.researchgate.net › publication › 37...

Smaller models can be trained on LLMs' data to improve their performance, which can further serve as cost-effective alternatives to LLMs for the given task ( ...

Awesome Knowledge Distillation of LLM Papers - GitHub

github.com › Tebmer › Awesome-Knowl...

Feb 20, 2024 · This survey delves into knowledge distillation (KD) techniques in Large Language Models (LLMs), highlighting KD's crucial role in transferring advanced ...