Long Horizon Temperature Scaling

Shih, Andy; Sadigh, Dorsa; Ermon, Stefano

Computer Science > Machine Learning

arXiv:2302.03686 (cs)

[Submitted on 7 Feb 2023 (v1), last revised 29 Sep 2023 (this version, v2)]

Title:Long Horizon Temperature Scaling

Authors:Andy Shih, Dorsa Sadigh, Stefano Ermon

View PDF

Abstract:Temperature scaling is a popular technique for tuning the sharpness of a model distribution. It is used extensively for sampling likely generations and calibrating model uncertainty, and even features as a controllable parameter to many large language models in deployment. However, autoregressive models rely on myopic temperature scaling that greedily optimizes the next token. To address this, we propose Long Horizon Temperature Scaling (LHTS), a novel approach for sampling from temperature-scaled joint distributions. LHTS is compatible with all likelihood-based models, and optimizes for the long horizon likelihood of samples. We derive a temperature-dependent LHTS objective, and show that finetuning a model on a range of temperatures produces a single model capable of generation with a controllable long horizon temperature parameter. We experiment with LHTS on image diffusion models and character/language autoregressive models, demonstrating advantages over myopic temperature scaling in likelihood and sample quality, and showing improvements in accuracy on a multiple choice analogy task by $10\%$.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2302.03686 [cs.LG]
	(or arXiv:2302.03686v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2302.03686

Submission history

From: Andy Shih [view email]
[v1] Tue, 7 Feb 2023 18:59:32 UTC (1,052 KB)
[v2] Fri, 29 Sep 2023 18:44:40 UTC (890 KB)

Computer Science > Machine Learning

Title:Long Horizon Temperature Scaling

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Long Horizon Temperature Scaling

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators