Inicio  /  Information  /  Vol: 11 Par: 3 (2020)  /  Artículo
ARTÍCULO
TITULO

Bimodal Emotion Recognition Model for Minnan Songs

Zhenglong Xiang    
Xialei Dong    
Yuanxiang Li    
Fei Yu    
Xing Xu and Hongrun Wu    

Resumen

Most of the existing research papers study the emotion recognition of Minnan songs from the perspectives of music analysis theory and music appreciation. However, these investigations do not explore any possibility of carrying out an automatic emotion recognition of Minnan songs. In this paper, we propose a model that consists of four main modules to classify the emotion of Minnan songs by using the bimodal data?song lyrics and audio. In the proposed model, an attention-based Long Short-Term Memory (LSTM) neural network is applied to extract lyrical features, and a Convolutional Neural Network (CNN) is used to extract the audio features from the spectrum. Then, two kinds of extracted features are concatenated by multimodal compact bilinear pooling, and finally, the concatenated features are input to the classifying module to determine the song emotion. We designed three experiment groups to investigate the classifying performance of combinations of the four main parts, the comparisons of proposed model with the current approaches and the influence of a few key parameters on the performance of emotion recognition. The results show that the proposed model exhibits better performance over all other experimental groups. The accuracy, precision and recall of the proposed model exceed 0.80 in a combination of appropriate parameters.

 Artículos similares

       
 
Sakib Shahriar, Noora Al Roken and Imran Zualkernan    
The automatic classification of poems into various categories, such as by author or era, is an interesting problem. However, most current work categorizing Arabic poems into eras or emotions has utilized traditional feature engineering and machine learni... ver más
Revista: Computers

 
Seunguook Lim and Jihie Kim    
Emotion recognition in conversation (ERC) is receiving more and more attention, as interactions between humans and machines increase in a variety of services such as chat-bot and virtual assistants. As emotional expressions within a conversation can heav... ver más
Revista: Algorithms

 
Mashael Aldayel, Amira Kharrat and Abeer Al-Nafjan    
Individual choices and preferences are important factors that impact decision making. Artificial intelligence can predict decisions by objectively detecting individual choices and preferences using natural language processing, computer vision, and machin... ver más
Revista: Applied Sciences

 
Huan-Yu Chen, Chuen-Horng Lin, Jyun-Wei Lai and Yung-Kuan Chan    
This paper proposes a multi?convolutional neural network (CNN)-based system for the detection, tracking, and recognition of the emotions of dogs in surveillance videos. This system detects dogs in each frame of a video, tracks the dogs in the video, and ... ver más
Revista: Applied Sciences

 
Jialin Zhang, Mairidan Wushouer, Gulanbaier Tuerhong and Hanfang Wang    
Emotional speech synthesis is an important branch of human?computer interaction technology that aims to generate emotionally expressive and comprehensible speech based on the input text. With the rapid development of speech synthesis technology based on ... ver más
Revista: Applied Sciences