Inicio  /  Information  /  Vol: 9 Par: 9 (2018)  /  Artículo
ARTÍCULO
TITULO

A Web Page Clustering Method Based on Formal Concept Analysis

Zuping Zhang    
Jing Zhao and Xiping Yan    

Resumen

Web page clustering is an important technology for sorting network resources. By extraction and clustering based on the similarity of the Web page, a large amount of information on a Web page can be organized effectively. In this paper, after describing the extraction of Web feature words, calculation methods for the weighting of feature words are studied deeply. Taking Web pages as objects and Web feature words as attributes, a formal context is constructed for using formal concept analysis. An algorithm for constructing a concept lattice based on cross data links was proposed and was successfully applied. This method can be used to cluster the Web pages using the concept lattice hierarchy. Experimental results indicate that the proposed algorithm is better than previous competitors with regard to time consumption and the clustering effect.

 Artículos similares

       
 
Salim M Zaki     Pág. pp. 4 - 16
The number of devices connected to the Internet using mobile devices is increasing every day. Charge for mobile data over 3G and 4G networks is high in some countries which pushes users to browse the Internet through text-only service. Facebook proposed ... ver más

 
Denys Klochkov and Jan Mulawka    
The evolution of web development and web applications has resulted in creation of numerous tools and frameworks that facilitate the development process. Even though those frameworks make web development faster and more efficient, there are certain downsi... ver más
Revista: Information

 
Elena Guseva, Boris Karetkin, Diana Batyrgazieva, Natalia Menshutina and Victor Panfilov    
The number of studies aimed at proving the prebiotic properties of certain substances or compositions has been actively increasing, which has led to a large accumulation of scientific information that is fragmented and not systematized. Moreover, a numbe... ver más
Revista: Applied Sciences

 
Muhammad Lookman Hossain Khan,Agus Setiawan,Iwan Kustiawan     Pág. 85 - 93
Students now use not only computers or laptops, but also small devices such as mobile phones. The students at higher education institutions spent a lot of time on the internet to find the course materials. Sometimes the teacher sends the materials, but t... ver más
Revista: Invotec

 
Ioan Badarinza,Adrian Sterca,Florian Mircea Boian     Pág. 26 - 33
In this paper we use the user's recent web browsing history in order to provide better query suggestions in an information retrieval system. We have built a Chrome browser plugin that collects each web page visited by a user and submits it to our query s... ver más