Latent Autoregressive Source Separation

Authors

  • Emilian Postolache Sapienza University of Rome, Italy
  • Giorgio Mariani Sapienza University of Rome, Italy
  • Michele Mancusi Sapienza University of Rome, Italy
  • Andrea Santilli Sapienza University of Rome, Italy
  • Luca Cosmo Ca’ Foscari University of Venice, Italy University of Lugano, Switzerland
  • Emanuele Rodolà Sapienza University of Rome, Italy

DOI:

https://doi.org/10.1609/aaai.v37i8.26131

Keywords:

ML: Deep Generative Models & Autoencoders, ML: Applications, ML: Bayesian Learning, ML: Deep Neural Network Algorithms, ML: Dimensionality Reduction/Feature Selection, ML: Unsupervised & Self-Supervised Learning

Abstract

Autoregressive models have achieved impressive results over a wide range of domains in terms of generation quality and downstream task performance. In the continuous domain, a key factor behind this success is the usage of quantized latent spaces (e.g., obtained via VQ-VAE autoencoders), which allow for dimensionality reduction and faster inference times. However, using existing pre-trained models to perform new non-trivial tasks is difficult since it requires additional fine-tuning or extensive training to elicit prompting. This paper introduces LASS as a way to perform vector-quantized Latent Autoregressive Source Separation (i.e., de-mixing an input signal into its constituent sources) without requiring additional gradient-based optimization or modifications of existing models. Our separation method relies on the Bayesian formulation in which the autoregressive models are the priors, and a discrete (non-parametric) likelihood function is constructed by performing frequency counts over latent sums of addend tokens. We test our method on images and audio with several sampling strategies (e.g., ancestral, beam search) showing competitive results with existing approaches in terms of separation quality while offering at the same time significant speedups in terms of inference time and scalability to higher dimensional data.

Downloads

Published

2023-06-26

How to Cite

Postolache, E., Mariani, G., Mancusi, M., Santilli, A., Cosmo, L., & Rodolà, E. (2023). Latent Autoregressive Source Separation. Proceedings of the AAAI Conference on Artificial Intelligence, 37(8), 9444-9452. https://doi.org/10.1609/aaai.v37i8.26131

Issue

Section

AAAI Technical Track on Machine Learning III