Redirigiendo al acceso original de articulo en 15 segundos...
Inicio  /  Applied Sciences  /  Vol: 12 Par: 6 (2022)  /  Artículo
ARTÍCULO
TITULO

Using Feature Selection with Machine Learning for Generation of Insurance Insights

Ayman Taha    
Bernard Cosgrave and Susan Mckeever    

Resumen

Insurance is a data-rich sector, hosting large volumes of customer data that is analysed to evaluate risk. Machine learning techniques are increasingly used in the effective management of insurance risk. Insurance datasets by their nature, however, are often of poor quality with noisy subsets of data (or features). Choosing the right features of data is a significant pre-processing step in the creation of machine learning models. The inclusion of irrelevant and redundant features has been demonstrated to affect the performance of learning models. In this article, we propose a framework for improving predictive machine learning techniques in the insurance sector via the selection of relevant features. The experimental results, based on five publicly available real insurance datasets, show the importance of applying feature selection for the removal of noisy features before performing machine learning techniques, to allow the algorithm to focus on influential features. An additional business benefit is the revelation of the most and least important features in the datasets. These insights can prove useful for decision making and strategy development in areas/business problems that are not limited to the direct target of the downstream algorithms. In our experiments, machine learning techniques based on a set of selected features suggested by feature selection algorithms outperformed the full feature set for a set of real insurance datasets. Specifically, 20% and 50% of features in our five datasets had improved downstream clustering and classification performance when compared to whole datasets. This indicates the potential for feature selection in the insurance sector to both improve model performance and to highlight influential features for business insights.

 Artículos similares

       
 
Mohammad Shokouhifar, Mohamad Hasanvand, Elaheh Moharamkhani and Frank Werner    
Heart disease is a global health concern of paramount importance, causing a significant number of fatalities and disabilities. Precise and timely diagnosis of heart disease is pivotal in preventing adverse outcomes and improving patient well-being, there... ver más
Revista: Algorithms

 
Min-Joon Kim and Thi-Thu-Huong Le    
This study delves into the intricate relationship between fluctuations in the real exchange rate and the trade balance, situated within the framework of a ?two-country? trade theory model. Despite a wealth of prior research on the impact of exchange rate... ver más
Revista: Information

 
Seokjoon Kwon, Jae-Hyeon Park, Hee-Deok Jang, Hyunwoo Nam and Dong Eui Chang    
Deep learning algorithms are widely used for pattern recognition in electronic noses, which are sensor arrays for gas mixtures. One of the challenges of using electronic noses is sensor drift, which can degrade the accuracy of the system over time, even ... ver más
Revista: Applied Sciences

 
Ziqi Liu, Shogo Okamoto, Tomohito Kuroda and Yasuhiro Akiyama    
Gait stability indices are crucial for identifying individuals at risk of falling while walking. The margin of stability is one such index, known for its good construct validity. Generally, the measurement of this stability index requires a motion captur... ver más
Revista: Applied Sciences

 
Omar Abdulkhaleq Aldabash and Mehmet Fatih Akay    
An IDS (Intrusion Detection System) is essential for network security experts, as it allows one to identify and respond to abnormal traffic present in a network. An IDS can be utilized for evaluating the various types of malicious attacks. Hence, detecti... ver más
Revista: Applied Sciences