Education, Science, Technology, Innovation and Life
Open Access
Sign In

A Review of scRNA-seq Imputation Methods

Download as PDF

DOI: 10.23977/acss.2025.090112 | Downloads: 16 | Views: 759

Author(s)

Zhiqiang Zhang 1

Affiliation(s)

1 School of Information, Yunnan Normal, University, Kunming, China

Corresponding Author

Zhiqiang Zhang

ABSTRACT

Single-cell RNA sequencing (scRNA-seq) has emerged as a powerful tool for profiling gene expression at the individual cell level, enabling the discovery of cellular heterogeneity that traditional bulk RNA sequencing cannot capture. However, technical limitations such as low transcript capture efficiency, amplification biases, and limited sequencing depth have led to pervasive dropout events, where true gene expression is obscured by excessive zero counts. This review systematically examines and compares the principal imputation methods developed to address these challenges in scRNA-seq data analysis. We categorize these approaches into two broad groups: model-based methods and deep learning methods. Model-based techniques utilize probabilistic models or matrix factorization to exploit similarities among cells and genes—either independently or in combination—to predict and restore missing values. In contrast, deep learning methods leverage the capabilities of autoencoders, graph neural networks, and other innovative network architectures, including generative adversarial networks, to capture complex nonlinear relationships within high-dimensional, noisy data. While model-based approaches offer greater interpretability through explicit statistical assumptions, they are often limited by their sensitivity to noise and data sparsity. Deep learning strategies, although computationally intensive and less interpretable, excel in recovering intricate data structures in large-scale datasets. By providing a comprehensive overview of these imputation strategies, this review aims to guide researchers in selecting the most appropriate methods for their specific datasets and downstream analyses, and to suggest future directions for improving imputation accuracy and integrating multi-omics data.

KEYWORDS

Single-cell RNA Sequencing (scRNA-seq); Imputation; Dropout; Model-based Methods; Deep Learning; Autoencoder; Graph Neural Network; Cellular Heterogeneity; Gene Expression; Data Recovery

CITE THIS PAPER

Zhiqiang Zhang, A Review of scRNA-seq Imputation Methods. Advances in Computer, Signals and Systems (2025) Vol. 9: 79-83. DOI: http://dx.doi.org/10.23977/acss.2025.090112.

REFERENCES

[1] Stevenson K, Uversky V N. Single-cell RNA-Seq: a next generation sequencing tool for a high-resolution view of the individual cell[J]. Journal of Biomolecular Structure and Dynamics, 2020, 38(12): 3730-3735.
[2] Islam S, Zeisel A, Joost S, et al. Quantitative single-cell RNA-seq with unique molecular identifiers[J]. Nature methods, 2014, 11(2): 163-166. 
[3] Cheng Y, Ma X, Yuan L, et al. Evaluating imputation methods for single-cell RNA-seq data[J]. BMC bioinformatics, 2023, 24(1): 302. 
[4] Dai C, Jiang Y, Yin C, et al. scIMC: a platform for benchmarking comparison and visualization analysis of scRNA-seq data imputation methods[J]. Nucleic Acids Research, 2022, 50(9): 4877-4899. 
[5] Van Dijk D, Sharma R, Nainys J, et al. Recovering gene interactions from single-cell data using data diffusion[J]. Cell, 2018, 174(3): 716-729. e27. 
[6] Huang M, Wang J, Torre E, et al. SAVER: gene expression recovery for single-cell RNA sequencing[J]. Nature methods, 2018, 15(7): 539-542. 
[7] Ye W, Ji G, Ye P, et al. scNPF: an integrative framework assisted by network propagation and network fusion for preprocessing of single-cell RNA-seq data[J]. Bmc Genomics, 2019, 20: 1-16. 
[8] Elyanow R, Dumitrascu B, Engelhardt B E, et al. netNMF-sc: leveraging gene–gene interactions for imputation and dimensionality reduction in single-cell expression analysis[J]. Genome research, 2020, 30(2): 195-204. 
[9] Li W V, Li J J. An accurate and robust imputation method scImpute for single-cell RNA-seq data[J]. Nature communications, 2018, 9(1): 997. 
[10] Gong W, Kwak I Y, Pota P, et al. DrImpute: imputing dropout events in single cell RNA sequencing data[J]. BMC bioinformatics, 2018, 19: 1-10.
[11] Chen M, Zhou X. VIPER: variability-preserving imputation for accurate gene expression recovery in single-cell RNA sequencing studies[J]. Genome biology, 2018, 19(1): 196.
[12] Jin K, Ou-Yang L, Zhao X M, et al. scTSSR: gene expression recovery for single-cell RNA sequencing using two-side sparse self-representation[J]. Bioinformatics, 2020, 36(10): 3131-3138.
[13] Peng T, Zhu Q, Yin P, et al. SCRABBLE: single-cell RNA-seq imputation constrained by bulk RNA-seq data[J]. Genome biology, 2019, 20: 1-12. 
[14] Qi J, Zhou Y, Zhao Z, et al. SDImpute: a statistical block imputation method based on cell-level and gene-level information for dropouts in single-cell RNA-seq data[J]. PLoS Computational Biology, 2021, 17(6): e1009118.
[15] Li W V, Li J J. An accurate and robust imputation method scImpute for single-cell RNA-seq data[J]. Nature communications, 2018, 9(1): 997. 
[16] Eraslan G, Simon L M, Mircea M, et al. Single-cell RNA-seq denoising using a deep count autoencoder[J]. Nature communications, 2019, 10(1): 390. 
[17] Lopez R, Regier J, Cole M B, et al. Deep generative modeling for single-cell transcriptomics[J]. Nature methods, 2018, 15(12): 1053-1058. 
[18] Arisdakessian C, Poirion O, Yunits B, et al. DeepImpute: an accurate, fast, and scalable deep neural network method to impute single-cell RNA-seq data[J]. Genome biology, 2019, 20: 1-14.
[19] Rao J, Zhou X, Lu Y, et al. Imputing single-cell rna-seq data by combining graph convolution and autoencoder neural networks. iScience[J]. 2021.
[20] Wang J, Ma A, Chang Y, et al. scGNN is a novel graph neural network framework for single-cell RNA-Seq analyses[J]. Nature communications, 2021, 12(1): 1882.
[21] Xu Y, Zhang Z, You L, et al. scIGANs: single-cell RNA-seq imputation using generative adversarial networks[J]. Nucleic acids research, 2020, 48(15): e85-e85.
[22] Zhou X, Chai H, Zhao H, et al. Imputing missing RNA-sequencing data from DNA methylation by using a transfer learning–based neural network[J]. GigaScience, 2020, 9(7): giaa076.
[23] Zhang Z, Liu L. scIDPMs: Single-cell RNA-seq imputation using diffusion probabilistic models[J]. IEEE Journal of Biomedical and Health Informatics, 2024.

Downloads: 38553
Visits: 697955

Sponsors, Associates, and Links


All published work is licensed under a Creative Commons Attribution 4.0 International License.

Copyright © 2016 - 2031 Clausius Scientific Press Inc. All Rights Reserved.