Cargando…

Detection of correlated hidden factors from single cell transcriptomes using Iteratively Adjusted-SVA (IA-SVA)

Single cell RNA-sequencing (scRNA-seq) precisely characterizes gene expression levels and dissects variation in expression associated with the state (technical or biological) and the type of the cell, which is averaged out in bulk measurements. Multiple and correlated sources contribute to gene expr...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Donghyung, Cheng, Anthony, Lawlor, Nathan, Bolisetty, Mohan, Ucar, Duygu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6242813/
https://www.ncbi.nlm.nih.gov/pubmed/30451954
http://dx.doi.org/10.1038/s41598-018-35365-9
_version_ 1783371841359839232
author Lee, Donghyung
Cheng, Anthony
Lawlor, Nathan
Bolisetty, Mohan
Ucar, Duygu
author_facet Lee, Donghyung
Cheng, Anthony
Lawlor, Nathan
Bolisetty, Mohan
Ucar, Duygu
author_sort Lee, Donghyung
collection PubMed
description Single cell RNA-sequencing (scRNA-seq) precisely characterizes gene expression levels and dissects variation in expression associated with the state (technical or biological) and the type of the cell, which is averaged out in bulk measurements. Multiple and correlated sources contribute to gene expression variation in single cells, which makes their estimation difficult with the existing methods developed for batch correction (e.g., surrogate variable analysis (SVA)) that estimate orthogonal transformations of these sources. We developed iteratively adjusted surrogate variable analysis (IA-SVA) that can estimate hidden factors even when they are correlated with other sources of variation by identifying a set of genes associated with each hidden factor in an iterative manner. Analysis of scRNA-seq data from human cells showed that IA-SVA could accurately capture hidden variation arising from technical (e.g., stacked doublet cells) or biological sources (e.g., cell type or cell-cycle stage). Furthermore, IA-SVA delivers a set of genes associated with the detected hidden source to be used in downstream data analyses. As a proof of concept, IA-SVA recapitulated known marker genes for islet cell subsets (e.g., alpha, beta), which improved the grouping of subsets into distinct clusters. Taken together, IA-SVA is an effective and novel method to dissect multiple and correlated sources of variation in scRNA-seq data.
format Online
Article
Text
id pubmed-6242813
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-62428132018-11-27 Detection of correlated hidden factors from single cell transcriptomes using Iteratively Adjusted-SVA (IA-SVA) Lee, Donghyung Cheng, Anthony Lawlor, Nathan Bolisetty, Mohan Ucar, Duygu Sci Rep Article Single cell RNA-sequencing (scRNA-seq) precisely characterizes gene expression levels and dissects variation in expression associated with the state (technical or biological) and the type of the cell, which is averaged out in bulk measurements. Multiple and correlated sources contribute to gene expression variation in single cells, which makes their estimation difficult with the existing methods developed for batch correction (e.g., surrogate variable analysis (SVA)) that estimate orthogonal transformations of these sources. We developed iteratively adjusted surrogate variable analysis (IA-SVA) that can estimate hidden factors even when they are correlated with other sources of variation by identifying a set of genes associated with each hidden factor in an iterative manner. Analysis of scRNA-seq data from human cells showed that IA-SVA could accurately capture hidden variation arising from technical (e.g., stacked doublet cells) or biological sources (e.g., cell type or cell-cycle stage). Furthermore, IA-SVA delivers a set of genes associated with the detected hidden source to be used in downstream data analyses. As a proof of concept, IA-SVA recapitulated known marker genes for islet cell subsets (e.g., alpha, beta), which improved the grouping of subsets into distinct clusters. Taken together, IA-SVA is an effective and novel method to dissect multiple and correlated sources of variation in scRNA-seq data. Nature Publishing Group UK 2018-11-19 /pmc/articles/PMC6242813/ /pubmed/30451954 http://dx.doi.org/10.1038/s41598-018-35365-9 Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Lee, Donghyung
Cheng, Anthony
Lawlor, Nathan
Bolisetty, Mohan
Ucar, Duygu
Detection of correlated hidden factors from single cell transcriptomes using Iteratively Adjusted-SVA (IA-SVA)
title Detection of correlated hidden factors from single cell transcriptomes using Iteratively Adjusted-SVA (IA-SVA)
title_full Detection of correlated hidden factors from single cell transcriptomes using Iteratively Adjusted-SVA (IA-SVA)
title_fullStr Detection of correlated hidden factors from single cell transcriptomes using Iteratively Adjusted-SVA (IA-SVA)
title_full_unstemmed Detection of correlated hidden factors from single cell transcriptomes using Iteratively Adjusted-SVA (IA-SVA)
title_short Detection of correlated hidden factors from single cell transcriptomes using Iteratively Adjusted-SVA (IA-SVA)
title_sort detection of correlated hidden factors from single cell transcriptomes using iteratively adjusted-sva (ia-sva)
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6242813/
https://www.ncbi.nlm.nih.gov/pubmed/30451954
http://dx.doi.org/10.1038/s41598-018-35365-9
work_keys_str_mv AT leedonghyung detectionofcorrelatedhiddenfactorsfromsinglecelltranscriptomesusingiterativelyadjustedsvaiasva
AT chenganthony detectionofcorrelatedhiddenfactorsfromsinglecelltranscriptomesusingiterativelyadjustedsvaiasva
AT lawlornathan detectionofcorrelatedhiddenfactorsfromsinglecelltranscriptomesusingiterativelyadjustedsvaiasva
AT bolisettymohan detectionofcorrelatedhiddenfactorsfromsinglecelltranscriptomesusingiterativelyadjustedsvaiasva
AT ucarduygu detectionofcorrelatedhiddenfactorsfromsinglecelltranscriptomesusingiterativelyadjustedsvaiasva