ARTÍCULO
TITULO

Improvement of the method for scientific publications clustering based on n-gram analysis and fuzzy method for selecting research partners

Petro Lizunov    
Andrii Biloshchytskyi    
Alexander Kuchansky    
Yurii Andrashko    
Svitlana Biloshchytska    

Resumen

For the problem of formation of project teams, in particular, scientific research project groups, there was proposed the comprehensive method, which consists of the two-stage method for clustering the graph of citation of scientists» publications and the method of fuzzy inference for coordination of experts» opinions on the selection of potential partners and their inclusion in the project group.The essence of the two-stage method for clustering publications of scientists is clustering the citation graph based on the proximity of abstracts of publications. The distance between publications is calculated based on the determined metrics and approaches of the n-gram analysis. The described method allows identifying the areas research of scientists, which is a necessary component of the rational choice of a partner for the formation of a project team and is the input information for experts who form this group. The next step is the application of the method of fuzzy inference, which is constructed to coordinate opinions of experts on the creation of project teams. This method consists of three stages. At the first stage, fuzzification is performed through the introduction of function of scientist»s belonging to the area of scientific research. The second phase of fuzzy inference is the statement of experts» requirements to candidates for a place in a project group. At the final stage, defuzzification with the use of the method of the weight center takes place. To verify the fuzzy method for identification of research project groups, the organizations-executors for a fundamental scientific research were determined.Described methods can be used for the problem of formation of scientific research groups and identification the similarities between the fragments of text information based on the n-gram analysis, which is used in the problem of identification of incomplete duplicates between fragments of text information.

 Artículos similares

       
 
Juliano Prado Stradioto, Ariel Orlei Michaloski (Author)     Pág. e51335
The economic growth of a country is directly linked to the growth of several sectors, in which the construction sector is prominent. The objective was to investigate by means of ergonomic analysis the external coating activity performed on building façad... ver más

 
Denis Zolotariov     Pág. 53 - 58
The article is devoted to the research and development of the mechanism of interaction between Wolfram Mathematica programs and Apache Kafka queue to provide the ability to build event-driven applications based on it. The subject of the research is the p... ver más

 
Muhamad Syafiq Abdul Ghani,Norhaslinda Zainal Abidin,Rosshairy Abd Rahman,Antoni Wibowo,Azatuliffah Alwi     Pág. pp. 18 - 31
The improvement of technology brings a significant impact on transportation industries. The taxi industry has undergone tremendous changes with the existent of e-hailing service in the industry. Due to the introduction of mobile applications, e-hailing s... ver más

 
Muhamad Ikhsan Sahal Guntur,Wahyu Setyaningrum     Pág. pp. 159 - 173
The researchers implemented the quasi-experimental method of research. This research was conducted at SMAN 1 Ngemplak, Indonesia. The research was conducted from March to April 2020. In this study, the samples consisted of 70 students divided into two cl... ver más

 
Lina Aulia     Pág. 68 - 72
PT PLB (pseudonym) is a company that produces household appliances and kitchen utensils. The country kettle is one type of product made by PT PLB. The productivity of the country kettle production line decreased by 56%. This was due to defective products... ver más