Inicio  /  Applied Sciences  /  Vol: 10 Par: 6 (2020)  /  Artículo
ARTÍCULO
TITULO

An Unsupervised Deep Learning System for Acoustic Scene Analysis

Mou Wang    
Xiao-Lei Zhang and Susanto Rahardja    

Resumen

Acoustic scene analysis has attracted a lot of attention recently. Existing methods are mostly supervised, which requires well-predefined acoustic scene categories and accurate labels. In practice, there exists a large amount of unlabeled audio data, but labeling large-scale data is not only costly but also time-consuming. Unsupervised acoustic scene analysis on the other hand does not require manual labeling but is known to have significantly lower performance and therefore has not been well explored. In this paper, a new unsupervised method based on deep auto-encoder networks and spectral clustering is proposed. It first extracts a bottleneck feature from the original acoustic feature of audio clips by an auto-encoder network, and then employs spectral clustering to further reduce the noise and unrelated information in the bottleneck feature. Finally, it conducts hierarchical clustering on the low-dimensional output of the spectral clustering. To fully utilize the spatial information of stereo audio, we further apply the binaural representation and conduct joint clustering on that. To the best of our knowledge, this is the first time that a binaural representation is being used in unsupervised learning. Experimental results show that the proposed method outperforms the state-of-the-art competing methods.

 Artículos similares

       
 
María J. Verdú, Luisa M. Regueras, Juan P. de Castro and Elena Verdú    
Learning Management Systems provide teachers with many functionalities to offer materials to students, interact with them and manage their courses. Recognizing teachers? instructing styles from their course designs would allow recommendations and best pr... ver más
Revista: Applied Sciences

 
Dominik Stallmann and Barbara Hammer    
Novel neural network models that can handle complex tasks with fewer examples than before are being developed for a wide range of applications. In some fields, even the creation of a few labels is a laborious task and impractical, especially for data tha... ver más
Revista: Algorithms

 
Alireza Saberironaghi, Jing Ren and Moustafa El-Gindy    
Over the last few decades, detecting surface defects has attracted significant attention as a challenging task. There are specific classes of problems that can be solved using traditional image processing techniques. However, these techniques struggle wi... ver más
Revista: Algorithms

 
Navaneethakrishna Makaram, Sarvagya Gupta, Matthew Pesce, Jeffrey Bolton, Scellig Stone, Daniel Haehn, Marc Pomplun, Christos Papadelis, Phillip Pearl, Alexander Rotenberg, Patricia Ellen Grant and Eleonora Tamilia    
In drug-resistant epilepsy, a visual inspection of intracranial electroencephalography (iEEG) signals is often needed to localize the epileptogenic zone (EZ) and guide neurosurgery. The visual assessment of iEEG time-frequency (TF) images is an alternati... ver más
Revista: Algorithms

 
Çaglar Uyulan, David Mayor, Tony Steffert, Tim Watson and Duncan Banks    
The field of signal processing using machine and deep learning algorithms has undergone significant growth in the last few years, with a wide scope of practical applications for electroencephalography (EEG). Transcutaneous electroacupuncture stimulation ... ver más
Revista: Applied Sciences