Redirigiendo al acceso original de articulo en 23 segundos...
ARTÍCULO
TITULO

Review of existing text-to-speech algorithms

Nikita Kireev    
Eugene Ilyushin    

Resumen

Scientists have long been working on algorithms for translate text written in natural language into speech. But the quality of work these algorithms left much to be desired until the moment when the application of deep learning methods was not possible. With the advent of the necessary computing resources and the accumulation of a sufficient amount of data for training, these methods have become widely used in machine learning in general and, of course, in speech synthesis in particular. A significant improvement in the quality of the work of text-to-speech algorithms has led to their widespread use, namely in mobile devices, smart speakers, voice assistants, etc. But it is worth noting that the algorithms of this class, developed at the moment, do not always correctly cope with the task. For example, they cannot always correctly emphasize or voice the necessary parts of the text with the necessary intonation. Thus, the study of methods and means of synthesizing speech has become even more relevant.There are many different ways to synthesize speech by text, such as parametric synthesis, compilation synthesis, subject-oriented synthesis, and full speech synthesis by the rules. The purpose of this work is to review existing algorithms for translating text to speech and conducting their comparative analysis. The main algorithms were considered: WaveNet, DeepVoice, Tacatron, DeepVoice 2, DeepVoice 3 and Tacatron 2. In the course of their comparison, it was determined that the best at the moment are DeepVoice 3 and Tacatron 2, since the assessments of the quality of their work are closest to professionally recorded speech.

 Artículos similares

       
 
Armando Silva-Afonso and Carla Pimentel-Rodrigues    
The objective of this article is to deepen knowledge about the existing connections, at the level of urban environments, between energy, water, and nutrients (or food). Energy and basic resources?water and food?are closely interconnected, which is why th... ver más
Revista: Water

 
Yan Zhang, Bingfei Chu, Tianming Huang, Shengwen Qi, Michael Manga, Huai Zhang, Bowen Zheng and Yuxin Zhou    
Carbon geological storage (CGS) is an important global practice implemented to mitigate the effects of CO2 emissions on temperature, climate, sea level, and biodiversity. The monitoring of CGS leakage and the impact of storage on hydrogeological properti... ver más
Revista: Water

 
Marwah Abdulrazzaq Naser, Aso Ahmed Majeed, Muntadher Alsabah, Taha Raad Al-Shaikhli and Kawa M. Kaky    
Cardiovascular disease is the leading cause of global mortality and responsible for millions of deaths annually. The mortality rate and overall consequences of cardiac disease can be reduced with early disease detection. However, conventional diagnostic ... ver más
Revista: Algorithms

 
Thomas Rötger, Chris Eyers and Roberta Fusaro    
The request for faster and greener civil aviation is urging the worldwide scientific community and aerospace industry to develop a new generation of supersonic aircraft, which are expected to be environmentally sustainable, and to guarantee a high level ... ver más
Revista: Aerospace

 
Liushuai Cao, Yanyan Pan, Gang Gao, Linjie Li and Decheng Wan    
Wakes produced by underwater vehicles, particularly submarines, in density-stratified fluids play a pivotal role across military, academic, and engineering domains. In comparison to homogeneous fluid environments, wakes in stratified flows exhibit distin... ver más