Inicio  /  Applied Sciences  /  Vol: 12 Par: 2 (2022)  /  Artículo
ARTÍCULO
TITULO

Different Scales of Medical Data Classification Based on Machine Learning Techniques: A Comparative Study

Heba Aly Elzeheiry    
Sherief Barakat and Amira Rezk    

Resumen

In recent years, medical data have vastly increased due to the continuous generation of digital data. The different forms of medical data, such as reports, textual, numerical, monitoring, and laboratory data generate the so-called medical big data. This paper aims to find the best algorithm which predicts new medical data with high accuracy, since good prediction accuracy is essential in medical fields. To achieve the study?s goal, the best accuracy algorithm and least processing time algorithm are defined through an experiment and comparison of seven different algorithms, including Naïve bayes, linear model, regression, decision tree, random forest, gradient boosted tree, and J48. The conducted experiments have allowed the prediction of new medical big data that reach the algorithm with the best accuracy and processing time. Here, we find that the best accuracy classification algorithm is the random forest with accuracy values of 97.58%, 83.59%, and 90% for heart disease, M-health, and diabetes datasets, respectively. The Naïve bayes has the lowest processing time with values of 0.078, 7.683, and 22.374 s for heart disease, M-health, and diabetes datasets, respectively. In addition, the best result of the experiment is obtained by the combination of the CFS feature selection algorithm with the Random Forest classification algorithm. The results of applying RF with the combination of CFS on the heart disease dataset are as follows: Accuracy of 90%, precision of 83.3%, sensitivity of 100, and consuming time of 3 s. Moreover, the results of applying this combination on the M-health dataset are as follows: Accuracy of 83.59%, precision of 74.3%, sensitivity of 93.1, and consuming time of 13.481 s. Furthermore, the results on the diabetes dataset are as follows: Accuracy of 97.58%, precision of 86.39%, sensitivity of 97.14, and consuming time of 56.508 s.

 Artículos similares

       
 
Zhu Wang, Junfeng Cheng and Hongtao Hu    
Port operations have been suffering from hybrid uncertainty, leading to various disruptions in efficiency and tenacity. However, these essential uncertain factors are often considered separately in literature during berth and quay crane assignments, lead... ver más

 
Ranran Huang, Tao Xue and Jie Wu    
A two-point cylindrical-focused laser differential interferometer (2P-CFLDI) system and a conventional Z-type Schlieren were used to measure the hypersonic turbulent boundary layer on a flat plate at Mach number Ma = 6 and Reynolds number Re = 1.08 × 106... ver más
Revista: Aerospace

 
Arvid Åkerblom, Martin Passad, Alessandro Ercole, Niklas Zettervall, Elna J. K. Nilsson and Christer Fureby    
With growing interest in sustainable civil supersonic and hypersonic aviation, there is a need to model the combustion of alternative, sustainable jet fuels. This work presents numerical simulations of several related phenomena, including laminar flames,... ver más
Revista: Aerospace

 
Giorgio Lazzarinetti, Riccardo Dondi, Sara Manzoni and Italo Zoppis    
Solving combinatorial problems on complex networks represents a primary issue which, on a large scale, requires the use of heuristics and approximate algorithms. Recently, neural methods have been proposed in this context to find feasible solutions for r... ver más
Revista: Algorithms

 
Qiang Cheng, Yong Cao, Zhifeng Liu, Lingli Cui, Tao Zhang and Lei Xu    
The computer numerically controlled (CNC) system is the key functional component of CNC machine tool control systems, and the servo drive system is an important part of CNC systems. The complex working environment will lead to frequent failure of servo d... ver más
Revista: Applied Sciences