Cargando…

Ontology-aware classification of tissue and cell-type signals in gene expression profiles across platforms and technologies

Motivation: Leveraging gene expression data through large-scale integrative analyses for multicellular organisms is challenging because most samples are not fully annotated to their tissue/cell-type of origin. A computational method to classify samples using their entire gene expression profiles is...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Young-suk, Krishnan, Arjun, Zhu, Qian, Troyanskaya, Olga G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3834796/
https://www.ncbi.nlm.nih.gov/pubmed/24037214
http://dx.doi.org/10.1093/bioinformatics/btt529
_version_ 1782292044904923136
author Lee, Young-suk
Krishnan, Arjun
Zhu, Qian
Troyanskaya, Olga G.
author_facet Lee, Young-suk
Krishnan, Arjun
Zhu, Qian
Troyanskaya, Olga G.
author_sort Lee, Young-suk
collection PubMed
description Motivation: Leveraging gene expression data through large-scale integrative analyses for multicellular organisms is challenging because most samples are not fully annotated to their tissue/cell-type of origin. A computational method to classify samples using their entire gene expression profiles is needed. Such a method must be applicable across thousands of independent studies, hundreds of gene expression technologies and hundreds of diverse human tissues and cell-types. Results: We present Unveiling RNA Sample Annotation (URSA) that leverages the complex tissue/cell-type relationships and simultaneously estimates the probabilities associated with hundreds of tissues/cell-types for any given gene expression profile. URSA provides accurate and intuitive probability values for expression profiles across independent studies and outperforms other methods, irrespective of data preprocessing techniques. Moreover, without re-training, URSA can be used to classify samples from diverse microarray platforms and even from next-generation sequencing technology. Finally, we provide a molecular interpretation for the tissue and cell-type models as the biological basis for URSA’s classifications. Availability and implementation: An interactive web interface for using URSA for gene expression analysis is available at: ursa.princeton.edu. The source code is available at https://bitbucket.org/youngl/ursa_backend. Contact: ogt@cs.princeton.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-3834796
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-38347962013-11-21 Ontology-aware classification of tissue and cell-type signals in gene expression profiles across platforms and technologies Lee, Young-suk Krishnan, Arjun Zhu, Qian Troyanskaya, Olga G. Bioinformatics Original Papers Motivation: Leveraging gene expression data through large-scale integrative analyses for multicellular organisms is challenging because most samples are not fully annotated to their tissue/cell-type of origin. A computational method to classify samples using their entire gene expression profiles is needed. Such a method must be applicable across thousands of independent studies, hundreds of gene expression technologies and hundreds of diverse human tissues and cell-types. Results: We present Unveiling RNA Sample Annotation (URSA) that leverages the complex tissue/cell-type relationships and simultaneously estimates the probabilities associated with hundreds of tissues/cell-types for any given gene expression profile. URSA provides accurate and intuitive probability values for expression profiles across independent studies and outperforms other methods, irrespective of data preprocessing techniques. Moreover, without re-training, URSA can be used to classify samples from diverse microarray platforms and even from next-generation sequencing technology. Finally, we provide a molecular interpretation for the tissue and cell-type models as the biological basis for URSA’s classifications. Availability and implementation: An interactive web interface for using URSA for gene expression analysis is available at: ursa.princeton.edu. The source code is available at https://bitbucket.org/youngl/ursa_backend. Contact: ogt@cs.princeton.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2013-12-01 2013-09-12 /pmc/articles/PMC3834796/ /pubmed/24037214 http://dx.doi.org/10.1093/bioinformatics/btt529 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Lee, Young-suk
Krishnan, Arjun
Zhu, Qian
Troyanskaya, Olga G.
Ontology-aware classification of tissue and cell-type signals in gene expression profiles across platforms and technologies
title Ontology-aware classification of tissue and cell-type signals in gene expression profiles across platforms and technologies
title_full Ontology-aware classification of tissue and cell-type signals in gene expression profiles across platforms and technologies
title_fullStr Ontology-aware classification of tissue and cell-type signals in gene expression profiles across platforms and technologies
title_full_unstemmed Ontology-aware classification of tissue and cell-type signals in gene expression profiles across platforms and technologies
title_short Ontology-aware classification of tissue and cell-type signals in gene expression profiles across platforms and technologies
title_sort ontology-aware classification of tissue and cell-type signals in gene expression profiles across platforms and technologies
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3834796/
https://www.ncbi.nlm.nih.gov/pubmed/24037214
http://dx.doi.org/10.1093/bioinformatics/btt529
work_keys_str_mv AT leeyoungsuk ontologyawareclassificationoftissueandcelltypesignalsingeneexpressionprofilesacrossplatformsandtechnologies
AT krishnanarjun ontologyawareclassificationoftissueandcelltypesignalsingeneexpressionprofilesacrossplatformsandtechnologies
AT zhuqian ontologyawareclassificationoftissueandcelltypesignalsingeneexpressionprofilesacrossplatformsandtechnologies
AT troyanskayaolgag ontologyawareclassificationoftissueandcelltypesignalsingeneexpressionprofilesacrossplatformsandtechnologies