Inicio  /  Applied Sciences  /  Vol: 10 Par: 13 (2020)  /  Artículo
ARTÍCULO
TITULO

Optimization of Associative Knowledge Graph using TF-IDF based Ranking Score

Hyun-Jin Kim    
Ji-Won Baek and Kyungyong Chung    

Resumen

This study proposes the optimization method of the associative knowledge graph using TF-IDF based ranking scores. The proposed method calculates TF-IDF weights in all documents and generates term ranking. Based on the terms with high scores from TF-IDF based ranking, optimized transactions are generated. News data are first collected through crawling and then are converted into a corpus through preprocessing. Unnecessary data are removed through preprocessing including lowercase conversion, removal of punctuation marks and stop words. In the document term matrix, words are extracted and then transactions are generated. In the data cleaning process, the Apriori algorithm is applied to generate association rules and make a knowledge graph. To optimize the generated knowledge graph, the proposed method utilizes TF-IDF based ranking scores to remove terms with low scores and recreate transactions. Based on the result, the association rule algorithm is applied to create an optimized knowledge model. The performance is evaluated in rule generation speed and usefulness of association rules. The association rule generation speed of the proposed method is about 22 seconds faster. And the lift value of the proposed method for usefulness is about 0.43 to 2.51 higher than that of each one of conventional association rule algorithms.

 Artículos similares

       
 
Lisa Richiardi, Cristina Pignata, Elisabetta Fea, Silvia Bonetta and Elisabetta Carraro    
The microbiological quality assessment of drinking water (DW) and drinking water sources (DWSs) is based on the detection of indicator microorganisms (IMs). However, the relationship between IMs and pathogens has been questioned, as pathogens have been d... ver más
Revista: Water

 
Feifei Tao, Yanling Pi, Meng Zhang, Chi Yuan and Menghua Deng    
With the rapid development of water conservancy engineering and infrastructure construction, there are many safety hazards in the construction process of water conservancy engineering, so it is of great significance to study the potential hazards in the ... ver más
Revista: Water

 
Wei Li, Jun Zhang, Fang Wang and Hanyun Zhou    
The underactuated unmanned surface vessel (USV) has been identified as a promising solution for future maritime transport. However, the challenges of precise trajectory tracking and obstacle avoidance remain unresolved for USVs. To this end, this paper m... ver más

 
Xiangxu Lei, Shengfu Xia, Hongkang Liu, Xiaozhen Wang, Zhenwei Li, Baomin Han, Jizhang Sang, You Zhao and Hao Luo    
The Changchun Observatory of the National Astronomical Observatories, Chinese Academy of Sciences, and the Shanghai Astronomical Observatory are used to generate very short arc (VSA) angle observations of objects in low Earth orbit (LEO) and geostationar... ver más
Revista: Applied Sciences

 
Jianyu Wang, Shuo Ma, Pengpeng Jiao, Lanxin Ji, Xu Sun and Huapu Lu    
This study explores risk factors influencing the at-fault party in traffic accidents and analyzes their impact on traffic accident severity. Based on the traffic accident data of Shenyang City, Liaoning Province, China, from 2018 to 2020, 19 attribute va... ver más
Revista: Applied Sciences