Cargando…

BATMAN: Fast and Accurate Integration of Single-Cell RNA-Seq Datasets via Minimum-Weight Matching

Single-cell RNA-sequencing (scRNA-seq) is a set of technologies used to profile gene expression at the level of individual cells. Although the throughput of scRNA-seq experiments is steadily growing in terms of the number of cells, large datasets are not yet commonly generated owing to prohibitively...

Descripción completa

Detalles Bibliográficos
Autores principales: Mandric, Igor, Hill, Brian L., Freund, Malika K., Thompson, Michael, Halperin, Eran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7276436/
https://www.ncbi.nlm.nih.gov/pubmed/32504875
http://dx.doi.org/10.1016/j.isci.2020.101185
_version_ 1783542953697869824
author Mandric, Igor
Hill, Brian L.
Freund, Malika K.
Thompson, Michael
Halperin, Eran
author_facet Mandric, Igor
Hill, Brian L.
Freund, Malika K.
Thompson, Michael
Halperin, Eran
author_sort Mandric, Igor
collection PubMed
description Single-cell RNA-sequencing (scRNA-seq) is a set of technologies used to profile gene expression at the level of individual cells. Although the throughput of scRNA-seq experiments is steadily growing in terms of the number of cells, large datasets are not yet commonly generated owing to prohibitively high costs. Integrating multiple datasets into one can improve power in scRNA-seq experiments, and efficient integration is very important for downstream analyses such as identifying cell-type-specific eQTLs. State-of-the-art scRNA-seq integration methods are based on the mutual nearest neighbor paradigm and fail to both correct for batch effects and maintain the local structure of the datasets. In this paper, we propose a novel scRNA-seq dataset integration method called BATMAN (BATch integration via minimum-weight MAtchiNg). Across multiple simulations and real datasets, we show that our method significantly outperforms state-of-the-art tools with respect to existing metrics for batch effects by up to 80% while retaining cell-to-cell relationships.
format Online
Article
Text
id pubmed-7276436
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-72764362020-06-10 BATMAN: Fast and Accurate Integration of Single-Cell RNA-Seq Datasets via Minimum-Weight Matching Mandric, Igor Hill, Brian L. Freund, Malika K. Thompson, Michael Halperin, Eran iScience Article Single-cell RNA-sequencing (scRNA-seq) is a set of technologies used to profile gene expression at the level of individual cells. Although the throughput of scRNA-seq experiments is steadily growing in terms of the number of cells, large datasets are not yet commonly generated owing to prohibitively high costs. Integrating multiple datasets into one can improve power in scRNA-seq experiments, and efficient integration is very important for downstream analyses such as identifying cell-type-specific eQTLs. State-of-the-art scRNA-seq integration methods are based on the mutual nearest neighbor paradigm and fail to both correct for batch effects and maintain the local structure of the datasets. In this paper, we propose a novel scRNA-seq dataset integration method called BATMAN (BATch integration via minimum-weight MAtchiNg). Across multiple simulations and real datasets, we show that our method significantly outperforms state-of-the-art tools with respect to existing metrics for batch effects by up to 80% while retaining cell-to-cell relationships. Elsevier 2020-05-20 /pmc/articles/PMC7276436/ /pubmed/32504875 http://dx.doi.org/10.1016/j.isci.2020.101185 Text en © 2020 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Mandric, Igor
Hill, Brian L.
Freund, Malika K.
Thompson, Michael
Halperin, Eran
BATMAN: Fast and Accurate Integration of Single-Cell RNA-Seq Datasets via Minimum-Weight Matching
title BATMAN: Fast and Accurate Integration of Single-Cell RNA-Seq Datasets via Minimum-Weight Matching
title_full BATMAN: Fast and Accurate Integration of Single-Cell RNA-Seq Datasets via Minimum-Weight Matching
title_fullStr BATMAN: Fast and Accurate Integration of Single-Cell RNA-Seq Datasets via Minimum-Weight Matching
title_full_unstemmed BATMAN: Fast and Accurate Integration of Single-Cell RNA-Seq Datasets via Minimum-Weight Matching
title_short BATMAN: Fast and Accurate Integration of Single-Cell RNA-Seq Datasets via Minimum-Weight Matching
title_sort batman: fast and accurate integration of single-cell rna-seq datasets via minimum-weight matching
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7276436/
https://www.ncbi.nlm.nih.gov/pubmed/32504875
http://dx.doi.org/10.1016/j.isci.2020.101185
work_keys_str_mv AT mandricigor batmanfastandaccurateintegrationofsinglecellrnaseqdatasetsviaminimumweightmatching
AT hillbrianl batmanfastandaccurateintegrationofsinglecellrnaseqdatasetsviaminimumweightmatching
AT freundmalikak batmanfastandaccurateintegrationofsinglecellrnaseqdatasetsviaminimumweightmatching
AT thompsonmichael batmanfastandaccurateintegrationofsinglecellrnaseqdatasetsviaminimumweightmatching
AT halperineran batmanfastandaccurateintegrationofsinglecellrnaseqdatasetsviaminimumweightmatching