ARTÍCULO
TITULO

SampleCNN: End-to-End Deep Convolutional Neural Networks Using Very Small Filters for Music Classification

Jongpil Lee    
Jiyoung Park    
Keunhyoung Luke Kim and Juhan Nam    

Resumen

Convolutional Neural Networks (CNN) have been applied to diverse machine learning tasks for different modalities of raw data in an end-to-end fashion. In the audio domain, a raw waveform-based approach has been explored to directly learn hierarchical characteristics of audio. However, the majority of previous studies have limited their model capacity by taking a frame-level structure similar to short-time Fourier transforms. We previously proposed a CNN architecture which learns representations using sample-level filters beyond typical frame-level input representations. The architecture showed comparable performance to the spectrogram-based CNN model in music auto-tagging. In this paper, we extend the previous work in three ways. First, considering the sample-level model requires much longer training time, we progressively downsample the input signals and examine how it affects the performance. Second, we extend the model using multi-level and multi-scale feature aggregation technique and subsequently conduct transfer learning for several music classification tasks. Finally, we visualize filters learned by the sample-level CNN in each layer to identify hierarchically learned features and show that they are sensitive to log-scaled frequency.

 Artículos similares

       
 
Bojan Ilijoski, Katarina Trojachanec Dineva, Biljana Tojtovska Ribarski, Petar Petrov, Teodora Mladenovska, Milena Trajanoska, Ivana Gjorshoska and Petre Lameski    
A bite from a bug may expose the affected person to serious, life-threatening conditions, which may require immediate medical attention. The identification of the bug bite may be challenging even for experienced medical personnel due to the different man... ver más
Revista: Applied Sciences

 
Xi Lyu, Yushan Sun, Lifeng Wang, Jiehui Tan and Liwen Zhang    
This study aims to solve the problems of sparse reward, single policy, and poor environmental adaptability in the local motion planning task of autonomous underwater vehicles (AUVs). We propose a two-layer deep deterministic policy gradient algorithm-bas... ver más

 
Tao Zhou, Liang Luo, Yuanxin He, Zhiwei Fan and Shengchen Ji    
The panel block is a quite important ?intermediate product? in the shipbuilding process. However, the assembly efficiency of the panel block assembly line is not high. Therefore, rational scheduling optimization is of great significance for improving shi... ver más
Revista: Applied Sciences

 
Pengyu Zhang, Jie Zhang and Jiangming Kan    
The continuous path of a manipulator is often discretized into a series of independent action poses during path tracking, and the inverse kinematic solution of the manipulator?s poses is computationally challenging and yields inconsistent results. This r... ver más
Revista: Applied Sciences

 
Emmanouil Koutoulakis, Louis Marage, Emmanouil Markodimitrakis, Leone Aubignac, Catherine Jenny, Igor Bessieres and Alain Lalande    
MR-Linac is a recent device combining a linear accelerator with an MRI scanner. The improved soft tissue contrast of MR images is used for optimum delineation of tumors or organs at risk (OARs) and precise treatment delivery. Automatic segmentation of OA... ver más
Revista: Algorithms