Cargando…
Ontology-aware classification of tissue and cell-type signals in gene expression profiles across platforms and technologies
Motivation: Leveraging gene expression data through large-scale integrative analyses for multicellular organisms is challenging because most samples are not fully annotated to their tissue/cell-type of origin. A computational method to classify samples using their entire gene expression profiles is...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3834796/ https://www.ncbi.nlm.nih.gov/pubmed/24037214 http://dx.doi.org/10.1093/bioinformatics/btt529 |
_version_ | 1782292044904923136 |
---|---|
author | Lee, Young-suk Krishnan, Arjun Zhu, Qian Troyanskaya, Olga G. |
author_facet | Lee, Young-suk Krishnan, Arjun Zhu, Qian Troyanskaya, Olga G. |
author_sort | Lee, Young-suk |
collection | PubMed |
description | Motivation: Leveraging gene expression data through large-scale integrative analyses for multicellular organisms is challenging because most samples are not fully annotated to their tissue/cell-type of origin. A computational method to classify samples using their entire gene expression profiles is needed. Such a method must be applicable across thousands of independent studies, hundreds of gene expression technologies and hundreds of diverse human tissues and cell-types. Results: We present Unveiling RNA Sample Annotation (URSA) that leverages the complex tissue/cell-type relationships and simultaneously estimates the probabilities associated with hundreds of tissues/cell-types for any given gene expression profile. URSA provides accurate and intuitive probability values for expression profiles across independent studies and outperforms other methods, irrespective of data preprocessing techniques. Moreover, without re-training, URSA can be used to classify samples from diverse microarray platforms and even from next-generation sequencing technology. Finally, we provide a molecular interpretation for the tissue and cell-type models as the biological basis for URSA’s classifications. Availability and implementation: An interactive web interface for using URSA for gene expression analysis is available at: ursa.princeton.edu. The source code is available at https://bitbucket.org/youngl/ursa_backend. Contact: ogt@cs.princeton.edu Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-3834796 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-38347962013-11-21 Ontology-aware classification of tissue and cell-type signals in gene expression profiles across platforms and technologies Lee, Young-suk Krishnan, Arjun Zhu, Qian Troyanskaya, Olga G. Bioinformatics Original Papers Motivation: Leveraging gene expression data through large-scale integrative analyses for multicellular organisms is challenging because most samples are not fully annotated to their tissue/cell-type of origin. A computational method to classify samples using their entire gene expression profiles is needed. Such a method must be applicable across thousands of independent studies, hundreds of gene expression technologies and hundreds of diverse human tissues and cell-types. Results: We present Unveiling RNA Sample Annotation (URSA) that leverages the complex tissue/cell-type relationships and simultaneously estimates the probabilities associated with hundreds of tissues/cell-types for any given gene expression profile. URSA provides accurate and intuitive probability values for expression profiles across independent studies and outperforms other methods, irrespective of data preprocessing techniques. Moreover, without re-training, URSA can be used to classify samples from diverse microarray platforms and even from next-generation sequencing technology. Finally, we provide a molecular interpretation for the tissue and cell-type models as the biological basis for URSA’s classifications. Availability and implementation: An interactive web interface for using URSA for gene expression analysis is available at: ursa.princeton.edu. The source code is available at https://bitbucket.org/youngl/ursa_backend. Contact: ogt@cs.princeton.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2013-12-01 2013-09-12 /pmc/articles/PMC3834796/ /pubmed/24037214 http://dx.doi.org/10.1093/bioinformatics/btt529 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Papers Lee, Young-suk Krishnan, Arjun Zhu, Qian Troyanskaya, Olga G. Ontology-aware classification of tissue and cell-type signals in gene expression profiles across platforms and technologies |
title | Ontology-aware classification of tissue and cell-type signals in gene expression profiles across platforms and technologies |
title_full | Ontology-aware classification of tissue and cell-type signals in gene expression profiles across platforms and technologies |
title_fullStr | Ontology-aware classification of tissue and cell-type signals in gene expression profiles across platforms and technologies |
title_full_unstemmed | Ontology-aware classification of tissue and cell-type signals in gene expression profiles across platforms and technologies |
title_short | Ontology-aware classification of tissue and cell-type signals in gene expression profiles across platforms and technologies |
title_sort | ontology-aware classification of tissue and cell-type signals in gene expression profiles across platforms and technologies |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3834796/ https://www.ncbi.nlm.nih.gov/pubmed/24037214 http://dx.doi.org/10.1093/bioinformatics/btt529 |
work_keys_str_mv | AT leeyoungsuk ontologyawareclassificationoftissueandcelltypesignalsingeneexpressionprofilesacrossplatformsandtechnologies AT krishnanarjun ontologyawareclassificationoftissueandcelltypesignalsingeneexpressionprofilesacrossplatformsandtechnologies AT zhuqian ontologyawareclassificationoftissueandcelltypesignalsingeneexpressionprofilesacrossplatformsandtechnologies AT troyanskayaolgag ontologyawareclassificationoftissueandcelltypesignalsingeneexpressionprofilesacrossplatformsandtechnologies |