Redirigiendo al acceso original de articulo en 24 segundos...
Inicio  /  Algorithms  /  Vol: 17 Par: 3 (2024)  /  Artículo
ARTÍCULO
TITULO

Exploratory Data Analysis and Searching Cliques in Graphs

András Hubai    
Sándor Szabó and Bogdán Zaválnij    

Resumen

The principal component analysis is a well-known and widely used technique to determine the essential dimension of a data set. Broadly speaking, it aims to find a low-dimensional linear manifold that retains a large part of the information contained in the original data set. It may be the case that one cannot approximate the entirety of the original data set using a single low-dimensional linear manifold even though large subsets of it are amenable to such approximations. For these cases we raise the related but different challenge (problem) of locating subsets of a high dimensional data set that are approximately 1-dimensional. Naturally, we are interested in the largest of such subsets. We propose a method for finding these 1-dimensional manifolds by finding cliques in a purpose-built auxiliary graph.

 Artículos similares

       
 
Sheng He, Geng Niu, Xuefeng Sang, Xiaozhong Sun, Junxian Yin and Heting Chen    
Accurate and reliable discharge estimation plays an important role in water resource management as well as downstream applications such as ecosystem conservation and flood control. Recently, data-driven machine learning (ML) techniques showed seemingly i... ver más
Revista: Water

 
Fatma Yaprakdal and Merve Varol Arisoy    
In the smart grid paradigm, precise electrical load forecasting (ELF) offers significant advantages for enhancing grid reliability and informing energy planning decisions. Specifically, mid-term ELF is a key priority for power system planning and operati... ver más
Revista: Applied Sciences

 
Surasit Songma, Theera Sathuphan and Thanakorn Pamutha    
This article examines intrusion detection systems in depth using the CSE-CIC-IDS-2018 dataset. The investigation is divided into three stages: to begin, data cleaning, exploratory data analysis, and data normalization procedures (min-max and Z-score) are... ver más
Revista: Computers

 
Sarah Benjelloun, Mohamed El Mehdi El Aissi, Younes Lakhrissi and Safae El Haj Ben Ali    
Thanks to continuously evolving data management solutions, data-driven strategies are considered the main success factor in many domains. These strategies consider data as the backbone, allowing advanced data analytics. However, in the agricultural field... ver más

 
Dominik Molitor, Wullianallur Raghupathi, Aditya Saharia and Viju Raghupathi    
While data breaches are a frequent and universal phenomenon, the characteristics and dimensions of data breaches are unexplored. In this novel exploratory research, we apply machine learning (ML) and text analytics to a comprehensive collection of data b... ver más
Revista: Information