REVISTA
Applied Sciences

TODAS

Inicio / Applied Sciences / Vol: 13 Par: 18 (2023) / Art�culo

ART�CULO

TITULO

Improving Software Defect Prediction in Noisy Imbalanced Datasets

Haoxiang Shi

Jun Ai

Jingyu Liu and Jiaxi Xu

Resumen

Software defect prediction is a popular method for optimizing software testing and improving software quality and reliability. However, software defect datasets usually have quality problems, such as class imbalance and data noise. Oversampling by generating the minority class samples is one of the most well-known methods to improving the quality of datasets; however, it often introduces overfitting noise to datasets. To better improve the quality of these datasets, this paper proposes a method called US-PONR, which uses undersampling to remove duplicate samples from version iterations and then uses oversampling through propensity score matching to reduce class imbalance and noise samples in datasets. The effectiveness of this method was validated in a software prediction experiment that involved 24 versions of software data in 11 projects from PROMISE in noisy environments that varied from 0% to 30% noise level. The experiments showed a significant improvement in the quality of datasets pre-processed by US-PONR in noisy imbalanced datasets, especially the noisiest ones, compared with 12 other advanced dataset processing methods. The experiments also demonstrated that the US-PONR method can effectively identify the label noise samples and remove them.

Palabras claves

software defect prediction - class imbalance - undersampling - propensity score matching - oversampling - noise reduction

Acceso

P�GINAS

pp. 0 - 0

N�MERO

Volumen: 13 Parte: 18 (2023)

MATERIAS

INGENIER�A Y CONSTRUCCI�N CIVIL
TECNOLOG�A

REVISTAS SIMILARES

Applied Sciences
Information
Infrastructures

DOI

https://doi.org/10.3390/app131810466

Art�culos similares

Modeling Variability in Seismic Analysis of Concrete Gravity Dams: A Parametric Analysis of Koyna and Pine Flat Dams

Acceso

Bikram Kesharee Patra, Rocio L. Segura and Ashutosh Bagchi

This study addresses the vital issue of the variability associated with modeling decisions in dam seismic analysis. Traditionally, structural modeling and simulations employ a progressive approach, where more complex models are gradually incorporated. Fo... ver m�s

Revista: Infrastructures

A Numerical Investigation of the Nonlinear Flow and Heat Transfer Mechanism in Rough Fractured Rock Accounting for Fluid Phase Transition Effects

Acceso

Xianshan Liu, Xiaolei Luo, Shaowei Liu, Pugang Zhang, Man Li and Yuhua Pan

The study of the seepage and heat transfer law of three-dimensional rough fractures is of great significance in improving the heat extraction efficiency of underground thermal reservoirs. However, the phase transition effects of fluids during the thermal... ver m�s

Revista: Water

Improving the Safety and Security of Software Systems by Mediating SAP Verification

Acceso

Maram Fahaad Almufareh and Mamoona Humayun

Security and performance (SAP) are two critical NFRs that affect the successful completion of software projects. Organizations need to follow the practices that are vital to SAP verification. These practices must be incorporated into the software develop... ver m�s

Revista: Applied Sciences

The Contact Phase in Vehicle?Pedestrian Accident Reconstruction

Acceso

Bogdan Benea and Adrian Soica

The need for continuous research to refine the models used in forensic accident reconstruction appears with the development of new car models that satisfy consumer complaints. This paper analyzed a sub-sequence of car and pedestrian accidents from the pe... ver m�s

Revista: Applied Sciences

Structural Design of Aerostatic Bearing Based on Multi-Objective Particle Swarm Optimization Algorithm

Acceso

Biqing Ye, Guixin Yu, Yidong Zhang and Gang Li

Aerostatic bearings are considered crucial components that can improve the measurement accuracy of ground simulation tests of space equipment. A structural optimization design method is proposed to enhance the static performance of aerostatic bearings. A... ver m�s

Revista: Applied Sciences

Revistas destacadas

Acceso directo a los n�meros publicados en la revista Infrastructures

Infrastructures

Acceso directo a los n�meros publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los n�meros publicados en la revista BiT

Acceso directo a los n�meros publicados en la revista Revista de la Construcci�n

Revista de la Construcci�n

Ver todas las revistas disponibles