Inicio  /  Applied Sciences  /  Vol: 13 Par: 12 (2023)  /  Artículo
ARTÍCULO
TITULO

MTR-SAM: Visual Multimodal Text Recognition and Sentiment Analysis in Public Opinion Analysis on the Internet

Xing Liu    
Fupeng Wei    
Wei Jiang    
Qiusheng Zheng    
Yaqiong Qiao    
Jizong Liu    
Liyue Niu    
Ziwei Chen and Hangcheng Dong    

Resumen

Existing methods for monitoring internet public opinion rely primarily on regular crawling of textual information on web pages but cannot quickly and accurately acquire and identify textual information in images and videos and discriminate sentiment. The problems make this a challenging research point for multimodal information detection in an internet public opinion scenario. In this paper, we look at how to dynamically monitor the internet opinion information (mostly images and videos) that different websites post. Based on the most recent advancements in text recognition, this paper proposes a new method of visual multimodal text recognition and sentiment analysis (MTR-SAM) for internet public opinion analysis scenarios. In the detection module, a LK-PAN network with large sensory fields is proposed to enhance the CML distillation strategy, and an RSE-FPN with a residual attention mechanism is used to improve feature map representation. Second, it proposes that the original CTC decoder be replaced with a GTC method to solve earlier problems with text detection at arbitrary rotation angles. Additionally, the performance of scene text detection for arbitrary rotation angles is improved using a sinusoidal loss function for rotation recognition. Finally, the improved sentiment analysis model is used to predict the sentiment polarity of the text recognition results. The experimental results show that the new method proposed in this paper improves recognition speed by 31.77%, recognition accuracy by 10.78% on the video dataset, and the F1 score of the multimodal sentiment analysis model by 4.42% on the self-built internet public opinion dataset (lab dataset). The method proposed provides significant technical support for internet public opinion analysis in multimodal domains.

 Artículos similares

       
 
Yan Wang, Nan Guan, Jie Li and Xiaoli Wang    
Fourier ptychographic microscopy (FPM) is a computational imaging technology that has endless vitality and application potential in digital pathology. Colored pathological image analysis is the foundation of clinical diagnosis, basic research, and most b... ver más
Revista: Applied Sciences

 
Jinjia Zhou and Jian Yang    
Compressive Sensing (CS) has emerged as a transformative technique in image compression, offering innovative solutions to challenges in efficient signal representation and acquisition. This paper provides a comprehensive exploration of the key components... ver más
Revista: Information

 
Jorge Juarez-Lucero, Maria Guevara-Villa, Anabel Sanchez-Sanchez, Raquel Diaz-Hernandez and Leopoldo Altamirano-Robles    
Sodium dodecyl sulfate?polyacrylamide gel electrophoresis (SDS-PAGE) is used to identify protein presence, absence, or overexpression and usually, their interpretation is visual. Some published methods can localize the position of proteins using image an... ver más
Revista: Algorithms

 
Kalyan Chatterjee, M. Raju, N. Selvamuthukumaran, M. Pramod, B. Krishna Kumar, Anjan Bandyopadhyay and Saurav Mallik    
According to global data on visual impairment from the World Health Organization in 2010, an estimated 285 million individuals, including 39 million who are blind, face visual impairments. These individuals use non-contact methods such as voice commands ... ver más
Revista: Information

 
Makrina Viola Kosti, Maurice Benayoun, Nefeli Georgakopoulou, Sotiris Diplaris, Theodora Pistola, Vasileios-Rafail Xefteris, Athina Tsanousa, Kalliopi Valsamidou, Panagiota Koulali, Yash Shekhawat, Piera Sciama, Ilias Kalisperakis, Stefanos Vrochidis and Ioannis Kompatsiaris    
Demographic change confronts us with an ever-increasing number of elderly people who face isolation and socialization issues. Background: The main challenge of this study is to inject emotional and aesthetic aspects into the design process of a virtual r... ver más
Revista: Applied Sciences