Cargando…

OrthoClust: an orthology-based network framework for clustering data across multiple species

Increasingly, high-dimensional genomics data are becoming available for many organisms.Here, we develop OrthoClust for simultaneously clustering data across multiple species. OrthoClust is a computational framework that integrates the co-association networks of individual species by utilizing the or...

Descripción completa

Detalles Bibliográficos
Autores principales: Yan, Koon-Kiu, Wang, Daifeng, Rozowsky, Joel, Zheng, Henry, Cheng, Chao, Gerstein, Mark
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4289247/
https://www.ncbi.nlm.nih.gov/pubmed/25249401
http://dx.doi.org/10.1186/gb-2014-15-8-r100
Descripción
Sumario:Increasingly, high-dimensional genomics data are becoming available for many organisms.Here, we develop OrthoClust for simultaneously clustering data across multiple species. OrthoClust is a computational framework that integrates the co-association networks of individual species by utilizing the orthology relationships of genes between species. It outputs optimized modules that are fundamentally cross-species, which can either be conserved or species-specific. We demonstrate the application of OrthoClust using the RNA-Seq expression profiles of Caenorhabditis elegans and Drosophila melanogaster from the modENCODE consortium. A potential application of cross-species modules is to infer putative analogous functions of uncharacterized elements like non-coding RNAs based on guilt-by-association. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/gb-2014-15-8-r100) contains supplementary material, which is available to authorized users.