Cargando…

A benchmark of batch-effect correction methods for single-cell RNA sequencing data

BACKGROUND: Large-scale single-cell transcriptomic datasets generated using different technologies contain batch-specific systematic variations that present a challenge to batch-effect removal and data integration. With continued growth expected in scRNA-seq data, achieving effective batch integrati...

Descripción completa

Detalles Bibliográficos
Autores principales:	Tran, Hoa Thi Nhu, Ang, Kok Siong, Chevrier, Marion, Zhang, Xiaomeng, Lee, Nicole Yee Shin, Goh, Michelle, Chen, Jinmiao
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2020
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6964114/ https://www.ncbi.nlm.nih.gov/pubmed/31948481 http://dx.doi.org/10.1186/s13059-019-1850-9

Descripción
Sumario:	BACKGROUND: Large-scale single-cell transcriptomic datasets generated using different technologies contain batch-specific systematic variations that present a challenge to batch-effect removal and data integration. With continued growth expected in scRNA-seq data, achieving effective batch integration with available computational resources is crucial. Here, we perform an in-depth benchmark study on available batch correction methods to determine the most suitable method for batch-effect removal. RESULTS: We compare 14 methods in terms of computational runtime, the ability to handle large datasets, and batch-effect correction efficacy while preserving cell type purity. Five scenarios are designed for the study: identical cell types with different technologies, non-identical cell types, multiple batches, big data, and simulated data. Performance is evaluated using four benchmarking metrics including kBET, LISI, ASW, and ARI. We also investigate the use of batch-corrected data to study differential gene expression. CONCLUSION: Based on our results, Harmony, LIGER, and Seurat 3 are the recommended methods for batch integration. Due to its significantly shorter runtime, Harmony is recommended as the first method to try, with the other methods as viable alternatives.

A benchmark of batch-effect correction methods for single-cell RNA sequencing data

Ejemplares similares