Inicio  /  Applied Sciences  /  Vol: 12 Par: 2 (2022)  /  Artículo
ARTÍCULO
TITULO

Sound Source Separation Mechanisms of Different Deep Networks Explained from the Perspective of Auditory Perception

Han Li    
Kean Chen    
Lei Wang    
Jianben Liu    
Baoquan Wan and Bing Zhou    

Resumen

Thanks to the development of deep learning, various sound source separation networks have been proposed and made significant progress. However, the study on the underlying separation mechanisms is still in its infancy. In this study, deep networks are explained from the perspective of auditory perception mechanisms. For separating two arbitrary sound sources from monaural recordings, three different networks with different parameters are trained and achieve excellent performances. The networks? output can obtain an average scale-invariant signal-to-distortion ratio improvement (SI-SDRi) higher than 10 dB, comparable with the human performance to separate natural sources. More importantly, the most intuitive principle?proximity?is explored through simultaneous and sequential organization experiments. Results show that regardless of network structures and parameters, the proximity principle is learned spontaneously by all networks. If components are proximate in frequency or time, they are not easily separated by networks. Moreover, the frequency resolution at low frequencies is better than at high frequencies. These behavior characteristics of all three networks are highly consistent with those of the human auditory system, which implies that the learned proximity principle is not accidental, but the optimal strategy selected by networks and humans when facing the same task. The emergence of the auditory-like separation mechanisms provides the possibility to develop a universal system that can be adapted to all sources and scenes.

 Artículos similares

       
 
Grigory Dolgikh, Yuri Morgunov, Alexander Burenin, Vladimir Bezotvetnykh, Vladimir Luchin, Aleksandr Golov and Alexander Tagiltsev    
The methodological and technical possibilities of monitoring temperature fields in the Sea of Japan by acoustic thermometry methods are presented. The proposed tomographic method for monitoring the dynamics and structure of water is based on the transmis... ver más

 
Youngmin Kim, Donghwan Kim, Sunho Park, Yonghwa Kim, Jisoo Hong, Sunghee Hong, Jinsoo Jeong, Byounghyo Lee and Hyeonchan Oh    
Digital music is one of the most important commodities on the market due to music royalty distribution in Korea. As the music market has been transformed into a digital music market by means such as downloading and streaming, the distribution of music ro... ver más
Revista: Applied Sciences

 
Yan Liang, Yu Chen, Zhou Meng, Xin Zhou and Yichi Zhang    
This paper proposes an underwater broadband target depth estimation method based on the multipath arrival structure in medium and short-range deep-sea environments. The proposed approach involves separating the multipath rays arriving at the vertical lin... ver más

 
Sergey Pereselkov, Venedikt Kuz?kin, Matthias Ehrhardt, Sergey Tkachenko, Pavel Rybyanets and Nikolay Ladykin    
In this paper, we study the variations of holograms of a moving source in an inhomogeneous ocean waveguide. It is assumed that intense internal waves (internal solitons) are the reason for the inhomogeneities of the shallow water waveguide. The results o... ver más

 
Ming Li, Kefeng Liu, Hongchen Li, Yiyuan Sun, Xi Chen and Kefeng Mao    
At present, some shortcomings of the research on coupling modeling of the oceanic front?sound field may need attention: (1) Most of the acoustic propagation simulation is based on ideal front models, but the application of investigated marine data is lac... ver más