Inicio  /  Algorithms  /  Vol: 14 Par: 8 (2021)  /  Artículo
ARTÍCULO
TITULO

Computational Complexity and ILP Models for Pattern Problems in the Logical Analysis of Data

Giuseppe Lancia and Paolo Serafini    

Resumen

Logical Analysis of Data is a procedure aimed at identifying relevant features in data sets with both positive and negative samples. The goal is to build Boolean formulas, represented by strings over {0,1,-} called patterns, which can be used to classify new samples as positive or negative. Since a data set can be explained in alternative ways, many computational problems arise related to the choice of a particular set of patterns. In this paper we study the computational complexity of several of these pattern problems (showing that they are, in general, computationally hard) and we propose some integer programming models that appear to be effective. We describe an ILP model for finding the minimum-size set of patterns explaining a given set of samples and another one for the problem of determining whether two sets of patterns are equivalent, i.e., they explain exactly the same samples. We base our first model on a polynomial procedure that computes all patterns compatible with a given set of samples. Computational experiments substantiate the effectiveness of our models on fairly large instances. Finally, we conjecture that the existence of an effective ILP model for finding a minimum-size set of patterns equivalent to a given set of patterns is unlikely, due to the problem being NP-hard and co-NP-hard at the same time.

 Artículos similares

       
 
George Tzoumakis, Konstantinos Fotopoulos and George Lampeas    
Future liquid hydrogen-powered aircraft requires the design and optimization of a large number of systems and subsystems, with cryogenic tanks being one of the largest and most critical. Considering previous space applications, these tanks are usually st... ver más
Revista: Aerospace

 
Yu Chen, Jianwan Ding, Yu Chen and Dong Yan    
The introduction of a dynamic model in robot trajectory tracking control design can significantly improve its trajectory tracking accuracy, but there are many uncertainties in the robot dynamic model which can be dealt with through robust control and ada... ver más
Revista: Applied Sciences

 
Radoslaw Piotr Katarzyniak, Grzegorz Popek and Marcin Zurawski    
This article presents a model of an architecture of an artificial cognitive agent that performs the function of generating autoepistemic membership statements used to communicate beliefs about the belonging of an observed external object to a category wi... ver más
Revista: Applied Sciences

 
Hannes Zöschg    
Trash racks installed at hydropower plants cause head losses that reduce energy output. Previous research has thoroughly investigated head losses through both experimental and field studies. However, only a limited number of numerical studies have been p... ver más
Revista: Water

 
Falah Amer Abdulazeez, Ismail Taha Ahmed and Baraa Tareq Hammad    
A significant quantity of malware is created on purpose every day. Users of smartphones and computer networks now mostly worry about malware. These days, malware detection is a major concern in the cybersecurity area. Several factors can impact malware d... ver más
Revista: Applied Sciences