Inicio  /  Information  /  Vol: 12 Par: 4 (2021)  /  Artículo
ARTÍCULO
TITULO

A Distributed Approach to Speaker Count Problem in an Open-Set Scenario by Clustering Pitch Features

Sakshi Pandey and Amit Banerjee    

Resumen

Counting the number of speakers in an audio sample can lead to innovative applications, such as a real-time ranking system. Researchers have studied advanced machine learning approaches for solving the speaker count problem. However, these solutions are not efficient in real-time environments, as it requires pre-processing of a finite set of data samples. Another approach for solving the problem is via unsupervised learning or by using audio processing techniques. The research in this category is limited and does not consider the large-scale open set environment. In this paper, we propose a distributed clustering approach to address the speaker count problem. The separability of the speaker is computed using statistical pitch parameters. The proposed solution uses multiple microphones available in smartphones in a large geographical area to capture and extract statistical pitch features from the audio samples. These features are shared between the nodes to estimate the number of speakers in the neighborhood. One of the major challenges is to reduce the error count that arises due to the proximity of the users and multiple microphones. We evaluate the algorithm?s performance using real smartphones in a multi-group arrangement by capturing parallel conversations between the users in both indoor and outdoor scenarios. The average error count distance is 1.667 in a multi-group scenario. The average error count distances in indoor environments are 16% which is better than in the outdoor environment.

 Artículos similares

       
 
Pablo Brusola, Sergio Garcia-Nieto, Jose Vicente Salcedo, Miguel Martinez and Robert H. Bishop    
This paper presents a mathematical modeling approach utilizing a fuzzy modeling framework for fixed-wing aircraft systems with the goal of creating a highly desirable mathematical representation for model-based control design applications. The starting p... ver más
Revista: Aerospace

 
Saikat Das, Mohammad Ashrafuzzaman, Frederick T. Sheldon and Sajjan Shiva    
The distributed denial of service (DDoS) attack is one of the most pernicious threats in cyberspace. Catastrophic failures over the past two decades have resulted in catastrophic and costly disruption of services across all sectors and critical infrastru... ver más
Revista: Algorithms

 
Michele Tonan, Alberto Pasetto and Alberto Doria    
In this paper, the possibility of harvesting energy from the vibrations of a plate is analyzed. The harvester takes the form of a cantilever dynamic vibration absorber equipped with a piezoelectric layer and tuned by means of a tip mass to the first mode... ver más
Revista: Applied Sciences

 
Hamid Reza Ahmadi, Zaher Rahimi and Wojciech Sumelka    
In this study, the behavior of double-walled carbon nanotubes (DWCNTs) used as mass sensors is explored under various boundary conditions; particular attention is paid to the crucial topic of resonant nanomechanical mass sensors. In the presented approac... ver más
Revista: Applied Sciences

 
Lilai Jin, Sarah J. Higgins, James A. Thompson, Michael P. Strager, Sean E. Collins and Jason A. Hubbart    
Saturated hydraulic conductivity (Ksat) is a hydrologic flux parameter commonly used to determine water movement through the saturated soil zone. Understanding the influences of land-use-specific Ksat on the model estimation error of water balance compon... ver más
Revista: Water