Redirigiendo al acceso original de articulo en 23 segundos...
Inicio  /  Information  /  Vol: 15 Par: 1 (2024)  /  Artículo
ARTÍCULO
TITULO

Data Augmentation with Cross-Modal Variational Autoencoders (DACMVA) for Cancer Survival Prediction

Sara Rajaram and Cassie S. Mitchell    

Resumen

The ability to translate Generative Adversarial Networks (GANs) and Variational Autoencoders (VAEs) into different modalities and data types is essential to improve Deep Learning (DL) for predictive medicine. This work presents DACMVA, a novel framework to conduct data augmentation in a cross-modal dataset by translating between modalities and oversampling imputations of missing data. DACMVA was inspired by previous work on the alignment of latent spaces in Autoencoders. DACMVA is a DL data augmentation pipeline that improves the performance in a downstream prediction task. The unique DACMVA framework leverages a cross-modal loss to improve the imputation quality and employs training strategies to enable regularized latent spaces. Oversampling of augmented data is integrated into the prediction training. It is empirically demonstrated that the new DACMVA framework is effective in the often-neglected scenario of DL training on tabular data with continuous labels. Specifically, DACMVA is applied towards cancer survival prediction on tabular gene expression data where there is a portion of missing data in a given modality. DACMVA significantly (p << 0.001, one-sided Wilcoxon signed-rank test) outperformed the non-augmented baseline and competing augmentation methods with varying percentages of missing data (4%, 90%, 95% missing). As such, DACMVA provides significant performance improvements, even in very-low-data regimes, over existing state-of-the-art methods, including TDImpute and oversampling alone.

 Artículos similares

       
 
Wenhao Sun, Yidong Zou, Yunhe Wang, Boyi Xiao, Haichuan Zhang and Zhihuai Xiao    
In the practical production environment, the complexity and variability of hydroelectric units often result in a need for more fault data, leading to inadequate accuracy in fault identification for data-driven intelligent diagnostic models. To address th... ver más
Revista: Water

 
François Legrand, Richard Macwan, Alain Lalande, Lisa Métairie and Thomas Decourselle    
Automated Cardiac Magnetic Resonance segmentation serves as a crucial tool for the evaluation of cardiac function, facilitating faster clinical assessments that prove advantageous for both practitioners and patients alike. Recent studies have predominant... ver más
Revista: Algorithms

 
Fabi Prezja, Leevi Annala, Sampsa Kiiskinen and Timo Ojala    
Diagnosing knee joint osteoarthritis (KOA), a major cause of disability worldwide, is challenging due to subtle radiographic indicators and the varied progression of the disease. Using deep learning for KOA diagnosis requires broad, comprehensive dataset... ver más
Revista: Algorithms

 
Daniel Rusche, Nils Englert, Marlen Runz, Svetlana Hetjens, Cord Langner, Timo Gaiser and Cleo-Aron Weis    
Background: In this study focusing on colorectal carcinoma (CRC), we address the imperative task of predicting post-surgery treatment needs by identifying crucial tumor features within whole slide images of solid tumors, analogous to locating a needle in... ver más
Revista: Applied Sciences

 
Songpu Li, Xinran Yu and Peng Chen    
Model robustness is an important index in medical cybersecurity, and hard-negative samples in electronic medical records can provide more gradient information, which can effectively improve the robustness of a model. However, hard negatives pose difficul... ver más
Revista: Applied Sciences