ARTÍCULO
TITULO

Russian text corpora for deception detection studies

T. A. Litvinova    
O. V. Zagorovskaya    
O. A. Litvinova    

Resumen

Text-based deception detection is presently on the way to gain even more significance as related studies certainly have both theoretical and practical value and a range of applications for police, security, and customs, as well as predatory communications, e.g. Internet scams). For these studies designing text corpora is essential. Text-based deception detection has been mostly dealt with using English as well as a few other European languages. There is not sufficient research into the problem with the use of Slavic languages, which is mostly due to no corresponding corpora available. In this article we propose an overview of existing text corpora employed in studies of text-based deception detection as well as a detailed description of available Russian corpora specially designed for text-based deception detection.

 Artículos similares

       
 
Fedor Krasnov     Pág. 79 - 84
Visualization of multidimensional data is the most important stage of data research. Often, decisions on the further stages of the study are made from the flat view of the data based on "rough proportions". High visibility and persuasiveness of represent... ver más

 
Daniel Pauly, Chiara Piroddi, Lincoln Hood, Nicolas Bailly, Elaine Chu, Vicky Lam, Evgeny A. Pakhomov, Leonid K. Pshenichnov, Vladimir I. Radchenko and Maria Lourdes D. Palomares    
Following a brief review of their biology, this contribution is an attempt to provide a global overview of the catches of mesopelagic fishes (of which 2.68 million tonnes were officially reported to the FAO) throughout the world ocean from 1950 to 2018, ... ver más

 
Olga Novikova,Yuliya Kalugina,Albina Mustafina,Irina Edeleva     Pág. pp. 66 - 86
The present article is concerned with identifying the linguistic and extralinguistic features of Instagram, Twitter, and Live Journal hypertexts, depending on their functional focus. The relevance of the topic is due to the need for a more detailed study... ver más

 
O. A. Mitrofanova,A. D. Moskvina     Pág. 91 - 96
In this work we investigate the role of statistical data on function words for automatic identification of genre and topical characteristics of Russian texts. We use the ratio of semantically related prepositions as the principal linguistic parameter. We... ver más

 
E. V. Tretyak     Pág. 17 - 26
The paper deals with the application of distributional semantic methods to the task of measuring similarity between several translations of the original text. In particular, Word2Vec neural network toolkit is employed for comparison between two translati... ver más