Learning Transposition-Invariant Interval Features from Symbolic Music and Audio

Lattner, Stefan; Grachten, Maarten; Widmer, Gerhard

Computer Science > Sound

arXiv:1806.08236 (cs)

[Submitted on 21 Jun 2018 (v1), last revised 4 Feb 2019 (this version, v2)]

Title:Learning Transposition-Invariant Interval Features from Symbolic Music and Audio

Authors:Stefan Lattner, Maarten Grachten, Gerhard Widmer

View PDF

Abstract:Many music theoretical constructs (such as scale types, modes, cadences, and chord types) are defined in terms of pitch intervals---relative distances between pitches. Therefore, when computer models are employed in music tasks, it can be useful to operate on interval representations rather than on the raw musical surface. Moreover, interval representations are transposition-invariant, valuable for tasks like audio alignment, cover song detection and music structure analysis. We employ a gated autoencoder to learn fixed-length, invertible and transposition-invariant interval representations from polyphonic music in the symbolic domain and in audio. An unsupervised training method is proposed yielding an organization of intervals in the representation space which is musically plausible. Based on the representations, a transposition-invariant self-similarity matrix is constructed and used to determine repeated sections in symbolic music and in audio, yielding competitive results in the MIREX task "Discovery of Repeated Themes and Sections".

Comments:	Paper accepted at the 19th International Society for Music Information Retrieval Conference, ISMIR 2018, Paris, France, September 23-27; 8 pages, 5 figures
Subjects:	Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:1806.08236 [cs.SD]
	(or arXiv:1806.08236v2 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.1806.08236

Submission history

From: Stefan Lattner [view email]
[v1] Thu, 21 Jun 2018 13:35:44 UTC (931 KB)
[v2] Mon, 4 Feb 2019 17:09:38 UTC (932 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.SD

< prev | next >

new | recent | 2018-06

Change to browse by:

cs
cs.LG
eess
eess.AS

References & Citations

DBLP - CS Bibliography

listing | bibtex

Stefan Lattner
Maarten Grachten
Gerhard Widmer

export BibTeX citation

Computer Science > Sound

Title:Learning Transposition-Invariant Interval Features from Symbolic Music and Audio

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:Learning Transposition-Invariant Interval Features from Symbolic Music and Audio

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators