Inicio  /  Information  /  Vol: 14 Par: 10 (2023)  /  Artículo
ARTÍCULO
TITULO

Sound Event Detection in Domestic Environment Using Frequency-Dynamic Convolution and Local Attention

Grigorios-Aris Cheimariotis and Nikolaos Mitianoudis    

Resumen

This work describes a methodology for sound event detection in domestic environments. Efficient solutions in this task can support the autonomous living of the elderly. The methodology deals with the ?Challenge on Detection and Classification of Acoustic Scenes and Events (DCASE)? 2023, and more specifically with Task 4a ?Sound event detection of domestic activities?. This task involves the detection of 10 common events in domestic environments in 10 s sound clips. The events may have arbitrary duration in the 10 s clip. The main components of the methodology are data augmentation on mel-spectrograms that represent the sound clips, feature extraction by passing spectrograms through a frequency-dynamic convolution network with an extra attention module in sequence with each convolution, concatenation of these features with BEATs embeddings, and use of BiGRU for sequence modeling. Also, a mean teacher model is employed for leveraging unlabeled data. This research focuses on the effect of data augmentation techniques, of the feature extraction models, and on self-supervised learning. The main contribution is the proposed feature extraction model, which uses weighted attention on frequency in each convolution, combined in sequence with a local attention module adopted by computer vision. The proposed system features promising and robust performance.

 Artículos similares

       
 
Soo-Jong Kim and Yong-Joo Chung    
To alleviate the problem of performance degradation due to the varied sound durations of competing classes in sound event detection, we propose a method that utilizes multi-scale features for sound event detection. We employed a feature-pyramid component... ver más
Revista: Applied Sciences

 
Francesca Terranova, Alessandra Raffa, Stefano Floridia, Clara Monaco and Livio Favaro    
Cetacean bycatch is increasing worldwide and poses a threat to the conservation of several delphinids. The bottlenose dolphin (Tursiops truncatus) is frequently involved in bycatch incidents, due to its coastal distribution and opportunistic behaviour. T... ver más

 
Diego de Benito-Gorrón, Daniel Ramos and Doroteo T. Toledano    
The Sound Event Detection task aims to determine the temporal locations of acoustic events in audio clips. In recent years, the relevance of this field is rising due to the introduction of datasets such as Google AudioSet or DESED (Domestic Environment S... ver más
Revista: Applied Sciences

 
Oleksandr Zaporozhets and Larisa Levchenko    
Aircraft performance and noise database together with operational weights (depending on flight distances) and operational procedures (including low noise procedures) significantly influence results of noise exposure contour maps assessment in conditions ... ver más
Revista: Aerospace

 
Jinxiang Zeng, Du Zhang, Zhiyi Li and Xiaolin Li    
Aiming at the audio event recognition problem of speech recognition, a decision fusion method based on the Transformer and Causal Dilated Convolutional Network (TCDCN) framework is proposed. This method can adjust the model sound events for a long time a... ver más
Revista: Applied Sciences