ARTÍCULO
TITULO

The use of probabilistic latent semantic analysis to identify scientific subject spaces and to evaluate the completeness of covering the results of dissertation studies

Petro Lizunov    
Andrii Biloshchytskyi    
Alexander Kuchansky    
Yurii Andrashko    
Svitlana Biloshchytska    

Resumen

The study considers the possibilities of using latent semantic analysis for the tasks of identifying scientific subject spaces and evaluating the completeness of covering the results of dissertation research by science degree seekers.A probabilistic thematic model was built to make it possible to cluster the publications of scholars in scientific areas, taking into account the citation network, which was an important step for solving the problem of identifying scientific subject spaces. As a result of constructing the model, the problem of increasing instability of clustering the citation graph in connection with a decrease in the number of clusters was solved. This problem would arise when combining clusters built on the basis of citation graph clustering, taking into account the similarity of abstracts of scientific publications.In the article, the presentation of text documents is described based on a probabilistic thematic model using n-grams. A probabilistic thematic model was built for the task of determining the completeness of covering the materials of an author?s dissertation research in scientific publications. The approximate values of the threshold coefficients were calculated to evaluate whether the articles of an author included the research provisions that were reflected in the text of the author?s abstract of the dissertation. The probabilistic thematic model for an author?s publications was practised on the basis of the BigARTM tool. Using the constructed model and with the help of a special regularizer, a matrix was found to evaluate the relevance of topics specified by the segments of an author?s dissertation abstracts to documents that are produced by the author?s publications.Important aspects of the possibilities of using latent semantic analysis were studied to identify tasks of scientific subject spaces and to reveal the completeness of covering the results of dissertation research science degree seekers.

 Artículos similares

       
 
Alessandro Rasulo, Sofia Nardoianni, Azzurra Evangelisti and Mauro D?Apuzzo    
Transportation networks are one of the most vulnerable civil infrastructures during an earthquake and an estimation of traffic impacts in the post-earthquake scenario is a crucial aspect in the context of risk assessment and evaluation of remedial measur... ver más
Revista: Infrastructures

 
German Michel Guzman-Acevedo, Juan A. Quintana-Rodriguez, Jose Ramon Gaxiola-Camacho, Guadalupe Esteban Vazquez-Becerra, Vanessa Torres-Moreno and Jesus Guadalupe Monjardin-Quevedo    
In recent years, Interferometric Synthetic Aperture Radar (InSAR) technology has been able to determine the semi-static behavior of bridges. However, most of the research about the use of InSAR in the monitoring of bridges has been applied only in determ... ver más
Revista: Infrastructures

 
Aldo Fiori, Irene Pomarico, Antonio Zarlenga, Vittorio Catani and Guido Leone    
This work extends the overlay and index methods for intrinsic groundwater vulnerability, that typically involve the soil surface and the vadose zone, to groundwater (saturated) transport. The method is ?hybrid? as it combines the standard overlay and ind... ver más
Revista: Water

 
Noah J. Bagazinski and Faez Ahmed    
Ship design is a years-long process that requires balancing complex design trade-offs to create a ship that is efficient and effective. Finding new ways to improve the ship design process could lead to significant cost savings in the time and effort requ... ver más

 
Marco Seracini and Stephen R. Brown    
In this article, we introduce a new mathematical functional whose minimization determines the quality of the solution for the exemplar-based inpainting-by-patch problem. The new functional expression includes finite difference terms in a similar fashion ... ver más
Revista: Applied Sciences