Cargando…

Identification of cell-type-specific marker genes from co-expression patterns in tissue samples

MOTIVATION: Marker genes, defined as genes that are expressed primarily in a single-cell type, can be identified from the single-cell transcriptome; however, such data are not always available for the many uses of marker genes, such as deconvolution of bulk tissue. Marker genes for a cell type, howe...

Descripción completa

Detalles Bibliográficos
Autores principales: Qiu, Yixuan, Wang, Jiebiao, Lei, Jing, Roeder, Kathryn
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8504631/
https://www.ncbi.nlm.nih.gov/pubmed/33904573
http://dx.doi.org/10.1093/bioinformatics/btab257
_version_ 1784581358860894208
author Qiu, Yixuan
Wang, Jiebiao
Lei, Jing
Roeder, Kathryn
author_facet Qiu, Yixuan
Wang, Jiebiao
Lei, Jing
Roeder, Kathryn
author_sort Qiu, Yixuan
collection PubMed
description MOTIVATION: Marker genes, defined as genes that are expressed primarily in a single-cell type, can be identified from the single-cell transcriptome; however, such data are not always available for the many uses of marker genes, such as deconvolution of bulk tissue. Marker genes for a cell type, however, are highly correlated in bulk data, because their expression levels depend primarily on the proportion of that cell type in the samples. Therefore, when many tissue samples are analyzed, it is possible to identify these marker genes from the correlation pattern. RESULTS: To capitalize on this pattern, we develop a new algorithm to detect marker genes by combining published information about likely marker genes with bulk transcriptome data in the form of a semi-supervised algorithm. The algorithm then exploits the correlation structure of the bulk data to refine the published marker genes by adding or removing genes from the list. AVAILABILITY AND IMPLEMENTATION: We implement this method as an R package markerpen, hosted on CRAN (https://CRAN.R-project.org/package=markerpen). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-8504631
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-85046312021-10-13 Identification of cell-type-specific marker genes from co-expression patterns in tissue samples Qiu, Yixuan Wang, Jiebiao Lei, Jing Roeder, Kathryn Bioinformatics Original Papers MOTIVATION: Marker genes, defined as genes that are expressed primarily in a single-cell type, can be identified from the single-cell transcriptome; however, such data are not always available for the many uses of marker genes, such as deconvolution of bulk tissue. Marker genes for a cell type, however, are highly correlated in bulk data, because their expression levels depend primarily on the proportion of that cell type in the samples. Therefore, when many tissue samples are analyzed, it is possible to identify these marker genes from the correlation pattern. RESULTS: To capitalize on this pattern, we develop a new algorithm to detect marker genes by combining published information about likely marker genes with bulk transcriptome data in the form of a semi-supervised algorithm. The algorithm then exploits the correlation structure of the bulk data to refine the published marker genes by adding or removing genes from the list. AVAILABILITY AND IMPLEMENTATION: We implement this method as an R package markerpen, hosted on CRAN (https://CRAN.R-project.org/package=markerpen). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-04-27 /pmc/articles/PMC8504631/ /pubmed/33904573 http://dx.doi.org/10.1093/bioinformatics/btab257 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Qiu, Yixuan
Wang, Jiebiao
Lei, Jing
Roeder, Kathryn
Identification of cell-type-specific marker genes from co-expression patterns in tissue samples
title Identification of cell-type-specific marker genes from co-expression patterns in tissue samples
title_full Identification of cell-type-specific marker genes from co-expression patterns in tissue samples
title_fullStr Identification of cell-type-specific marker genes from co-expression patterns in tissue samples
title_full_unstemmed Identification of cell-type-specific marker genes from co-expression patterns in tissue samples
title_short Identification of cell-type-specific marker genes from co-expression patterns in tissue samples
title_sort identification of cell-type-specific marker genes from co-expression patterns in tissue samples
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8504631/
https://www.ncbi.nlm.nih.gov/pubmed/33904573
http://dx.doi.org/10.1093/bioinformatics/btab257
work_keys_str_mv AT qiuyixuan identificationofcelltypespecificmarkergenesfromcoexpressionpatternsintissuesamples
AT wangjiebiao identificationofcelltypespecificmarkergenesfromcoexpressionpatternsintissuesamples
AT leijing identificationofcelltypespecificmarkergenesfromcoexpressionpatternsintissuesamples
AT roederkathryn identificationofcelltypespecificmarkergenesfromcoexpressionpatternsintissuesamples