Inicio  /  Applied Sciences  /  Vol: 12 Par: 2 (2022)  /  Artículo
ARTÍCULO
TITULO

Sound Source Separation Mechanisms of Different Deep Networks Explained from the Perspective of Auditory Perception

Han Li    
Kean Chen    
Lei Wang    
Jianben Liu    
Baoquan Wan and Bing Zhou    

Resumen

Thanks to the development of deep learning, various sound source separation networks have been proposed and made significant progress. However, the study on the underlying separation mechanisms is still in its infancy. In this study, deep networks are explained from the perspective of auditory perception mechanisms. For separating two arbitrary sound sources from monaural recordings, three different networks with different parameters are trained and achieve excellent performances. The networks? output can obtain an average scale-invariant signal-to-distortion ratio improvement (SI-SDRi) higher than 10 dB, comparable with the human performance to separate natural sources. More importantly, the most intuitive principle?proximity?is explored through simultaneous and sequential organization experiments. Results show that regardless of network structures and parameters, the proximity principle is learned spontaneously by all networks. If components are proximate in frequency or time, they are not easily separated by networks. Moreover, the frequency resolution at low frequencies is better than at high frequencies. These behavior characteristics of all three networks are highly consistent with those of the human auditory system, which implies that the learned proximity principle is not accidental, but the optimal strategy selected by networks and humans when facing the same task. The emergence of the auditory-like separation mechanisms provides the possibility to develop a universal system that can be adapted to all sources and scenes.

 Artículos similares

       
 
Francisco Fernández-Zacarías, Juan Luis Beira-Jiménez, Virginia Puyana-Romero and Ricardo Hernández-Molina    
The study aims to diagnose the sound pressure levels inside incubators in a controlled environment under free-field conditions. The tests were carried out in a semi-anechoic room under the standard UNE-EN ISO 3745:2012/A1:2018 in three different operatin... ver más
Revista: Acoustics

 
Grigory Dolgikh, Yuri Morgunov, Alexander Burenin, Vladimir Bezotvetnykh, Vladimir Luchin, Aleksandr Golov and Alexander Tagiltsev    
The methodological and technical possibilities of monitoring temperature fields in the Sea of Japan by acoustic thermometry methods are presented. The proposed tomographic method for monitoring the dynamics and structure of water is based on the transmis... ver más

 
Yan Liang, Yu Chen, Zhou Meng, Xin Zhou and Yichi Zhang    
This paper proposes an underwater broadband target depth estimation method based on the multipath arrival structure in medium and short-range deep-sea environments. The proposed approach involves separating the multipath rays arriving at the vertical lin... ver más

 
Sergey Pereselkov, Venedikt Kuz?kin, Matthias Ehrhardt, Sergey Tkachenko, Pavel Rybyanets and Nikolay Ladykin    
In this paper, we study the variations of holograms of a moving source in an inhomogeneous ocean waveguide. It is assumed that intense internal waves (internal solitons) are the reason for the inhomogeneities of the shallow water waveguide. The results o... ver más

 
Xing Zhao, Xiaoyang Jia, Lin Li and Hanyu Wang    
In this paper, we aim to address the challenge of airflow interference during fault detection in high-speed train bogies by introducing a flow field and investigating the characteristics of the sound field distribution of critical components under its in... ver más
Revista: Applied Sciences