Cargando…

Chord: an ensemble machine learning algorithm to identify doublets in single-cell RNA sequencing data

High-throughput single-cell RNA sequencing (scRNA-seq) is a popular method, but it is accompanied by doublet rate problems that disturb the downstream analysis. Several computational approaches have been developed to detect doublets. However, most of these methods may yield satisfactory performance...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiong, Ke-Xu, Zhou, Han-Lin, Lin, Cong, Yin, Jian-Hua, Kristiansen, Karsten, Yang, Huan-Ming, Li, Gui-Bo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9151659/
https://www.ncbi.nlm.nih.gov/pubmed/35637301
http://dx.doi.org/10.1038/s42003-022-03476-9
_version_ 1784717519316058112
author Xiong, Ke-Xu
Zhou, Han-Lin
Lin, Cong
Yin, Jian-Hua
Kristiansen, Karsten
Yang, Huan-Ming
Li, Gui-Bo
author_facet Xiong, Ke-Xu
Zhou, Han-Lin
Lin, Cong
Yin, Jian-Hua
Kristiansen, Karsten
Yang, Huan-Ming
Li, Gui-Bo
author_sort Xiong, Ke-Xu
collection PubMed
description High-throughput single-cell RNA sequencing (scRNA-seq) is a popular method, but it is accompanied by doublet rate problems that disturb the downstream analysis. Several computational approaches have been developed to detect doublets. However, most of these methods may yield satisfactory performance in some datasets but lack stability in others; thus, it is difficult to regard a single method as the gold standard which can be applied to all types of scenarios. It is a difficult and time-consuming task for researchers to choose the most appropriate software. We here propose Chord which implements a machine learning algorithm that integrates multiple doublet detection methods to address these issues. Chord had higher accuracy and stability than the individual approaches on different datasets containing real and synthetic data. Moreover, Chord was designed with a modular architecture port, which has high flexibility and adaptability to the incorporation of any new tools. Chord is a general solution to the doublet detection problem.
format Online
Article
Text
id pubmed-9151659
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-91516592022-06-01 Chord: an ensemble machine learning algorithm to identify doublets in single-cell RNA sequencing data Xiong, Ke-Xu Zhou, Han-Lin Lin, Cong Yin, Jian-Hua Kristiansen, Karsten Yang, Huan-Ming Li, Gui-Bo Commun Biol Article High-throughput single-cell RNA sequencing (scRNA-seq) is a popular method, but it is accompanied by doublet rate problems that disturb the downstream analysis. Several computational approaches have been developed to detect doublets. However, most of these methods may yield satisfactory performance in some datasets but lack stability in others; thus, it is difficult to regard a single method as the gold standard which can be applied to all types of scenarios. It is a difficult and time-consuming task for researchers to choose the most appropriate software. We here propose Chord which implements a machine learning algorithm that integrates multiple doublet detection methods to address these issues. Chord had higher accuracy and stability than the individual approaches on different datasets containing real and synthetic data. Moreover, Chord was designed with a modular architecture port, which has high flexibility and adaptability to the incorporation of any new tools. Chord is a general solution to the doublet detection problem. Nature Publishing Group UK 2022-05-30 /pmc/articles/PMC9151659/ /pubmed/35637301 http://dx.doi.org/10.1038/s42003-022-03476-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Xiong, Ke-Xu
Zhou, Han-Lin
Lin, Cong
Yin, Jian-Hua
Kristiansen, Karsten
Yang, Huan-Ming
Li, Gui-Bo
Chord: an ensemble machine learning algorithm to identify doublets in single-cell RNA sequencing data
title Chord: an ensemble machine learning algorithm to identify doublets in single-cell RNA sequencing data
title_full Chord: an ensemble machine learning algorithm to identify doublets in single-cell RNA sequencing data
title_fullStr Chord: an ensemble machine learning algorithm to identify doublets in single-cell RNA sequencing data
title_full_unstemmed Chord: an ensemble machine learning algorithm to identify doublets in single-cell RNA sequencing data
title_short Chord: an ensemble machine learning algorithm to identify doublets in single-cell RNA sequencing data
title_sort chord: an ensemble machine learning algorithm to identify doublets in single-cell rna sequencing data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9151659/
https://www.ncbi.nlm.nih.gov/pubmed/35637301
http://dx.doi.org/10.1038/s42003-022-03476-9
work_keys_str_mv AT xiongkexu chordanensemblemachinelearningalgorithmtoidentifydoubletsinsinglecellrnasequencingdata
AT zhouhanlin chordanensemblemachinelearningalgorithmtoidentifydoubletsinsinglecellrnasequencingdata
AT lincong chordanensemblemachinelearningalgorithmtoidentifydoubletsinsinglecellrnasequencingdata
AT yinjianhua chordanensemblemachinelearningalgorithmtoidentifydoubletsinsinglecellrnasequencingdata
AT kristiansenkarsten chordanensemblemachinelearningalgorithmtoidentifydoubletsinsinglecellrnasequencingdata
AT yanghuanming chordanensemblemachinelearningalgorithmtoidentifydoubletsinsinglecellrnasequencingdata
AT liguibo chordanensemblemachinelearningalgorithmtoidentifydoubletsinsinglecellrnasequencingdata