Inicio  /  Applied Sciences  /  Vol: 12 Par: 20 (2022)  /  Artículo
ARTÍCULO
TITULO

Imputation Methods for scRNA Sequencing Data

Mengyuan Wang    
Jiatao Gan    
Changfeng Han    
Yanbing Guo    
Kaihao Chen    
Ya-zhou Shi and Ben-gong Zhang    

Resumen

More and more researchers use single-cell RNA sequencing (scRNA-seq) technology to characterize the transcriptional map at the single-cell level. They use it to study the heterogeneity of complex tissues, transcriptome dynamics, and the diversity of unknown organisms. However, there are generally lots of technical and biological noises in the scRNA-seq data since the randomness of gene expression patterns. These data are often characterized by high-dimension, sparsity, large number of ?dropout? values, and affected by batch effects. A large number of ?dropout? values in scRNA-seq data seriously conceal the important relationship between genes and hinder the downstream analysis. Therefore, the imputation of dropout values of scRNA-seq data is particularly important. We classify, analyze and compare the current advanced scRNA-seq data imputation methods from different angles. Through the comparison and analysis of the principle, advantages and disadvantages of the algorithm, it can provide suggestions for the selection of imputation methods for specific problems and diverse data, and have basic research significance for the downstream function analysis of data.

 Artículos similares

       
 
Xinxi Lu, Lijuan Yuan, Ruifeng Li, Zhihuan Xing, Ning Yao and Yichun Yu    
In recent years, the development of computer technology has promoted the informatization and intelligentization of hospital management systems and thus produced a large amount of medical data. These medical data are valuable resources for research. We ca... ver más
Revista: Algorithms

 
Hsin-Yu Chen, Zoran Vojinovic, Weicheng Lo and Jhe-Wei Lee    
The development of civilization and the preservation of environmental ecosystems are strongly dependent on water resources. Typically, an insufficient supply of surface water resources for domestic, industrial, and agricultural needs is supplemented with... ver más
Revista: Water

 
Saul G. Ramirez, Gustavious Paul Williams, Norman L. Jones, Daniel P. Ames and Jani Radebaugh    
Obtaining and managing groundwater data is difficult as it is common for time series datasets representing groundwater levels at wells to have large gaps of missing data. To address this issue, many methods have been developed to infill or impute the mis... ver más
Revista: Water

 
Yufan Qian, Limei Tian, Baichen Zhai, Shufan Zhang and Rui Wu    
Missing observations in time series will distort the data characteristics, change the dataset expectations, high-order distances, and other statistics, and increase the difficulty of data analysis. Therefore, data imputation needs to be performed first. ... ver más
Revista: Algorithms

 
Reza Shahbazian and Irina Trubitsyna    
Insights and analysis are only as good as the available data. Data cleaning is one of the most important steps to create quality data decision making. Machine learning (ML) helps deal with data quickly, and to create error-free or limited-error datasets.... ver más
Revista: Information