Inicio  /  Applied Sciences  /  Vol: 12 Par: 13 (2022)  /  Artículo
ARTÍCULO
TITULO

Similar Word Replacement Method for Improving News Commenter Analysis

Deun Lee and Sunoh Choi    

Resumen

In Korea, it is common to read and comment on news stories on portal sites. To influence public opinion, some people write comments repeatedly, some of which are similar to those posted by others. This has become a serious social issue. In our previous research, we collected approximately 2.68 million news comments posted in April 2017. We classified the political stance of each author using a deep learning model (seq2seq), and evaluated how many similar comments each user wrote, as well as how similar each comment was to those posted by other people, using the Jaccard similarity coefficient. However, as our previous model used Jaccard?s similarity only, the meaning of the comments was not considered. To solve this problem, we propose similar word replacement (SWR) using word2vec and a method to analyze the similarity between user comments and classify the political stance of each user. In this study, we showed that when our model used SWR rather than Jaccard?s similarity, its ability to detect similarity between comments increased 3.2 times, and the accuracy of political stance classification improved by 6%.

Palabras claves

 Artículos similares

       
 
Aliya Jangabylova, Alexander Krassovitskiy, Rustam Mussabayev and Irina Ualiyeva    
The documents similarity metric is a substantial tool applied in areas such as determining topic in relation to documents, plagiarism detection, or problems necessary to capture the semantic, syntactic, or structural similarity of texts. Evaluated result... ver más
Revista: Computation

 
Ju-Sang Lee, Joon-Choul Shin and Choel-Young Ock    
Natural language models brought rapid developments to Natural Language Processing (NLP) performance following the emergence of large-scale deep learning models. Language models have previously used token units to represent natural language while reducing... ver más
Revista: Applied Sciences

 
Boris Melnikov     Pág. 1 - 8
In the paper, we consider all possible subsets of the set of potential roots forming in some situations semi-lattices, by intersection and / or by union. Such structures arise in two similar problems in the theory of formal languages. Specifically, for s... ver más

 
Boris Melnikov     Pág. 1 - 9
In the paper, we consider all possible subsets of the set of potential roots forming in some situations semi­lattices, by intersection and / or by union. Such structures arise in two similar problems in the theory of formal languages. Specifically, for s... ver más

 
Arafat Hossain, Md. Karimuzzaman, Md. Moyazzem Hossain and Azizur Rahman    
Text analytics are well-known in the modern era for extracting information and patterns from text. However, no study has attempted to illustrate the pattern and priorities of newspaper headlines in Bangladesh using a combination of text analytics techniq... ver más
Revista: Information