Redirigiendo al acceso original de articulo en 24 segundos...
Inicio  /  Applied Sciences  /  Vol: 12 Par: 3 (2022)  /  Artículo
ARTÍCULO
TITULO

Evaluation of Tacotron Based Synthesizers for Spanish and Basque

Víctor García    
Inma Hernáez and Eva Navas    

Resumen

In this paper, we describe the implementation and evaluation of Text to Speech synthesizers based on neural networks for Spanish and Basque. Several voices were built, all of them using a limited number of data. The system applies Tacotron 2 to compute mel-spectrograms from the input sequence, followed by WaveGlow as neural vocoder to obtain the audio signals from the spectrograms. The limited number of data used for training the models leads to synthesis errors in some sentences. To automatically detect those errors, we developed a new method that is able to find the sentences that have lost the alignment during the inference process. To mitigate the problem, we implemented a guided attention providing the system with the explicit duration of the phonemes. The resulting system was evaluated to assess its robustness, quality and naturalness both with objective and subjective measures. The results reveal the capacity of the system to produce good quality and natural audios.

Palabras claves

 Artículos similares

       
 
Sorin Zoican, Roxana Zoican, Dan Galatchi and Marius Vochin    
This paper illustrates a general framework in which a neural network application can be easily integrated and proposes a traffic forecasting approach that uses neural networks based on graphs. Neural networks based on graphs have the advantage of capturi... ver más
Revista: Applied Sciences

 
Jui-Fa Chen, Yu-Ting Liao and Po-Chun Wang    
Climate change has exacerbated severe rainfall events, leading to rapid and unpredictable fluctuations in river water levels. This environment necessitates the development of real-time, automated systems for water level detection. Due to degradation, tra... ver más
Revista: Water

 
Ling Zhou, Peng Yan, Yanjun Zhang, Honglei Lei, Shuren Hao, Yueqiang Ma and Shaoyou Sun    
The optimization of the production scheme for enhanced geothermal systems (EGS) in geothermal fields is crucial for enhancing heat production efficiency and prolonging the lifespan of thermal reservoirs. In this study, the 4100?4300 m granite diorite str... ver más
Revista: Water

 
Urszula Libal and Pawel Biernacki    
An automatic honey bee classification system based on audio signals for tracking the frequency of workers and drones entering and leaving a hive.
Revista: Applied Sciences

 
Myung-Kyo Seo and Won-Young Yun    
The steel industry is typical process manufacturing, and the quality and cost of the products can be improved by efficient operation of equipment. This paper proposes an efficient diagnosis and monitoring method for the gearbox, which is a key piece of m... ver más
Revista: Applied Sciences