Cargando…

OrthoClust: an orthology-based network framework for clustering data across multiple species

Increasingly, high-dimensional genomics data are becoming available for many organisms.Here, we develop OrthoClust for simultaneously clustering data across multiple species. OrthoClust is a computational framework that integrates the co-association networks of individual species by utilizing the or...

Descripción completa

Detalles Bibliográficos
Autores principales: Yan, Koon-Kiu, Wang, Daifeng, Rozowsky, Joel, Zheng, Henry, Cheng, Chao, Gerstein, Mark
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4289247/
https://www.ncbi.nlm.nih.gov/pubmed/25249401
http://dx.doi.org/10.1186/gb-2014-15-8-r100
_version_ 1782352075227660288
author Yan, Koon-Kiu
Wang, Daifeng
Rozowsky, Joel
Zheng, Henry
Cheng, Chao
Gerstein, Mark
author_facet Yan, Koon-Kiu
Wang, Daifeng
Rozowsky, Joel
Zheng, Henry
Cheng, Chao
Gerstein, Mark
author_sort Yan, Koon-Kiu
collection PubMed
description Increasingly, high-dimensional genomics data are becoming available for many organisms.Here, we develop OrthoClust for simultaneously clustering data across multiple species. OrthoClust is a computational framework that integrates the co-association networks of individual species by utilizing the orthology relationships of genes between species. It outputs optimized modules that are fundamentally cross-species, which can either be conserved or species-specific. We demonstrate the application of OrthoClust using the RNA-Seq expression profiles of Caenorhabditis elegans and Drosophila melanogaster from the modENCODE consortium. A potential application of cross-species modules is to infer putative analogous functions of uncharacterized elements like non-coding RNAs based on guilt-by-association. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/gb-2014-15-8-r100) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4289247
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42892472015-01-11 OrthoClust: an orthology-based network framework for clustering data across multiple species Yan, Koon-Kiu Wang, Daifeng Rozowsky, Joel Zheng, Henry Cheng, Chao Gerstein, Mark Genome Biol Method Increasingly, high-dimensional genomics data are becoming available for many organisms.Here, we develop OrthoClust for simultaneously clustering data across multiple species. OrthoClust is a computational framework that integrates the co-association networks of individual species by utilizing the orthology relationships of genes between species. It outputs optimized modules that are fundamentally cross-species, which can either be conserved or species-specific. We demonstrate the application of OrthoClust using the RNA-Seq expression profiles of Caenorhabditis elegans and Drosophila melanogaster from the modENCODE consortium. A potential application of cross-species modules is to infer putative analogous functions of uncharacterized elements like non-coding RNAs based on guilt-by-association. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/gb-2014-15-8-r100) contains supplementary material, which is available to authorized users. BioMed Central 2014-08-28 2014 /pmc/articles/PMC4289247/ /pubmed/25249401 http://dx.doi.org/10.1186/gb-2014-15-8-r100 Text en © Yan et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Method
Yan, Koon-Kiu
Wang, Daifeng
Rozowsky, Joel
Zheng, Henry
Cheng, Chao
Gerstein, Mark
OrthoClust: an orthology-based network framework for clustering data across multiple species
title OrthoClust: an orthology-based network framework for clustering data across multiple species
title_full OrthoClust: an orthology-based network framework for clustering data across multiple species
title_fullStr OrthoClust: an orthology-based network framework for clustering data across multiple species
title_full_unstemmed OrthoClust: an orthology-based network framework for clustering data across multiple species
title_short OrthoClust: an orthology-based network framework for clustering data across multiple species
title_sort orthoclust: an orthology-based network framework for clustering data across multiple species
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4289247/
https://www.ncbi.nlm.nih.gov/pubmed/25249401
http://dx.doi.org/10.1186/gb-2014-15-8-r100
work_keys_str_mv AT yankoonkiu orthoclustanorthologybasednetworkframeworkforclusteringdataacrossmultiplespecies
AT wangdaifeng orthoclustanorthologybasednetworkframeworkforclusteringdataacrossmultiplespecies
AT rozowskyjoel orthoclustanorthologybasednetworkframeworkforclusteringdataacrossmultiplespecies
AT zhenghenry orthoclustanorthologybasednetworkframeworkforclusteringdataacrossmultiplespecies
AT chengchao orthoclustanorthologybasednetworkframeworkforclusteringdataacrossmultiplespecies
AT gersteinmark orthoclustanorthologybasednetworkframeworkforclusteringdataacrossmultiplespecies