Combining CNN and Broad Learning for Music Classification

Huan TANG
Ning CHEN

Publication
IEICE TRANSACTIONS on Information and Systems   Vol.E103-D    No.3    pp.695-701
Publication Date: 2020/03/01
Publicized: 2019/12/05
Online ISSN: 1745-1361
DOI: 10.1587/transinf.2019EDP7175
Type of Manuscript: PAPER
Category: Music Information Processing
Keyword: 
deep learning,  broad learning,  random convolutional neural network (RCNN),  music classification,  

Full Text: PDF(2.4MB)>>
Buy this Article



Summary: 
Music classification has been inspired by the remarkable success of deep learning. To enhance efficiency and ensure high performance at the same time, a hybrid architecture that combines deep learning and Broad Learning (BL) is proposed for music classification tasks. At the feature extraction stage, the Random CNN (RCNN) is adopted to analyze the Mel-spectrogram of the input music sound. Compared with conventional CNN, RCNN has more flexible structure to adapt to the variance contained in different types of music. At the prediction stage, the BL technique is introduced to enhance the prediction accuracy and reduce the training time as well. Experimental results on three benchmark datasets (GTZAN, Ballroom, and Emotion) demonstrate that: i) The proposed scheme achieves higher classification accuracy than the deep learning based one, which combines CNN and LSTM, on all three benchmark datasets. ii) Both RCNN and BL contribute to the performance improvement of the proposed scheme. iii) The introduction of BL also helps to enhance the prediction efficiency of the proposed scheme.


open access publishing via