Inicio  /  Applied System Innovation  /  Vol: 4 Par: 1 (2021)  /  Artículo
ARTÍCULO
TITULO

A Comparative Analysis of Active Learning for Biomedical Text Mining

Usman Naseem    
Matloob Khushi    
Shah Khalid Khan    
Kamran Shaukat and Mohammad Ali Moni    

Resumen

An enormous amount of clinical free-text information, such as pathology reports, progress reports, clinical notes and discharge summaries have been collected at hospitals and medical care clinics. These data provide an opportunity of developing many useful machine learning applications if the data could be transferred into a learn-able structure with appropriate labels for supervised learning. The annotation of this data has to be performed by qualified clinical experts, hence, limiting the use of this data due to the high cost of annotation. An underutilised technique of machine learning that can label new data called active learning (AL) is a promising candidate to address the high cost of the label the data. AL has been successfully applied to labelling speech recognition and text classification, however, there is a lack of literature investigating its use for clinical purposes. We performed a comparative investigation of various AL techniques using ML and deep learning (DL)-based strategies on three unique biomedical datasets. We investigated random sampling (RS), least confidence (LC), informative diversity and density (IDD), margin and maximum representativeness-diversity (MRD) AL query strategies. Our experiments show that AL has the potential to significantly reducing the cost of manual labelling. Furthermore, pre-labelling performed using AL expediates the labelling process by reducing the time required for labelling.

 Artículos similares

       
 
George Westergaard, Utku Erden, Omar Abdallah Mateo, Sullaiman Musah Lampo, Tahir Cetin Akinci and Oguzhan Topsakal    
Automated Machine Learning (AutoML) tools are revolutionizing the field of machine learning by significantly reducing the need for deep computer science expertise. Designed to make ML more accessible, they enable users to build high-performing models wit... ver más
Revista: Information

 
Hamed Taherdoost and Mitra Madanchian    
Blockchain technology has become a powerful disruptive force that upends established ideas in several industries. A fascinating point of convergence is that of blockchain technology and Business Process Management (BPM), where the distributed and immutab... ver más
Revista: Information

 
Marcin Klosok, Daria Gendosz de Carrillo, Piotr Laszczyca, Tomasz Plociniczak, Halina Jedrzejowska-Szypulka and Tomasz Sawczyn    
Revista: Applied Sciences

 
Siarhei Autsou, Karolina Kudelina, Toomas Vaimann, Anton Rassõlkin and Ants Kallaste    
Servomotors have found widespread application in many areas, such as manufacturing, robotics, automation, and others. Thus, the control of servomotors is divided into various principles and methods, leading to a high diversity of control systems. This ar... ver más
Revista: Applied Sciences

 
Carolina Bona-Sánchez, Heidi Salokangas and Kaisa Sorsa    
This study explores the complexities of cost behavior in the textile industry, conducting a comparative analysis between firms in the Nordic countries and Spain. Our main goal is to examine how distinct economic and corporate governance models impact the... ver más
Revista: Applied Sciences