Cargando…

i-ADHoRe 3.0—fast and sensitive detection of genomic homology in extremely large data sets

Comparative genomics is a powerful means to gain insight into the evolutionary processes that shape the genomes of related species. As the number of sequenced genomes increases, the development of software to perform accurate cross-species analyses becomes indispensable. However, many implementation...

Descripción completa

Detalles Bibliográficos
Autores principales: Proost, Sebastian, Fostier, Jan, De Witte, Dieter, Dhoedt, Bart, Demeester, Piet, Van de Peer, Yves, Vandepoele, Klaas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3258164/
https://www.ncbi.nlm.nih.gov/pubmed/22102584
http://dx.doi.org/10.1093/nar/gkr955
_version_ 1782221239933206528
author Proost, Sebastian
Fostier, Jan
De Witte, Dieter
Dhoedt, Bart
Demeester, Piet
Van de Peer, Yves
Vandepoele, Klaas
author_facet Proost, Sebastian
Fostier, Jan
De Witte, Dieter
Dhoedt, Bart
Demeester, Piet
Van de Peer, Yves
Vandepoele, Klaas
author_sort Proost, Sebastian
collection PubMed
description Comparative genomics is a powerful means to gain insight into the evolutionary processes that shape the genomes of related species. As the number of sequenced genomes increases, the development of software to perform accurate cross-species analyses becomes indispensable. However, many implementations that have the ability to compare multiple genomes exhibit unfavorable computational and memory requirements, limiting the number of genomes that can be analyzed in one run. Here, we present a software package to unveil genomic homology based on the identification of conservation of gene content and gene order (collinearity), i-ADHoRe 3.0, and its application to eukaryotic genomes. The use of efficient algorithms and support for parallel computing enable the analysis of large-scale data sets. Unlike other tools, i-ADHoRe can process the Ensembl data set, containing 49 species, in 1 h. Furthermore, the profile search is more sensitive to detect degenerate genomic homology than chaining pairwise collinearity information based on transitive homology. From ultra-conserved collinear regions between mammals and birds, by integrating coexpression information and protein–protein interactions, we identified more than 400 regions in the human genome showing significant functional coherence. The different algorithmical improvements ensure that i-ADHoRe 3.0 will remain a powerful tool to study genome evolution.
format Online
Article
Text
id pubmed-3258164
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-32581642012-01-17 i-ADHoRe 3.0—fast and sensitive detection of genomic homology in extremely large data sets Proost, Sebastian Fostier, Jan De Witte, Dieter Dhoedt, Bart Demeester, Piet Van de Peer, Yves Vandepoele, Klaas Nucleic Acids Res Methods Online Comparative genomics is a powerful means to gain insight into the evolutionary processes that shape the genomes of related species. As the number of sequenced genomes increases, the development of software to perform accurate cross-species analyses becomes indispensable. However, many implementations that have the ability to compare multiple genomes exhibit unfavorable computational and memory requirements, limiting the number of genomes that can be analyzed in one run. Here, we present a software package to unveil genomic homology based on the identification of conservation of gene content and gene order (collinearity), i-ADHoRe 3.0, and its application to eukaryotic genomes. The use of efficient algorithms and support for parallel computing enable the analysis of large-scale data sets. Unlike other tools, i-ADHoRe can process the Ensembl data set, containing 49 species, in 1 h. Furthermore, the profile search is more sensitive to detect degenerate genomic homology than chaining pairwise collinearity information based on transitive homology. From ultra-conserved collinear regions between mammals and birds, by integrating coexpression information and protein–protein interactions, we identified more than 400 regions in the human genome showing significant functional coherence. The different algorithmical improvements ensure that i-ADHoRe 3.0 will remain a powerful tool to study genome evolution. Oxford University Press 2012-01 2011-11-17 /pmc/articles/PMC3258164/ /pubmed/22102584 http://dx.doi.org/10.1093/nar/gkr955 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Proost, Sebastian
Fostier, Jan
De Witte, Dieter
Dhoedt, Bart
Demeester, Piet
Van de Peer, Yves
Vandepoele, Klaas
i-ADHoRe 3.0—fast and sensitive detection of genomic homology in extremely large data sets
title i-ADHoRe 3.0—fast and sensitive detection of genomic homology in extremely large data sets
title_full i-ADHoRe 3.0—fast and sensitive detection of genomic homology in extremely large data sets
title_fullStr i-ADHoRe 3.0—fast and sensitive detection of genomic homology in extremely large data sets
title_full_unstemmed i-ADHoRe 3.0—fast and sensitive detection of genomic homology in extremely large data sets
title_short i-ADHoRe 3.0—fast and sensitive detection of genomic homology in extremely large data sets
title_sort i-adhore 3.0—fast and sensitive detection of genomic homology in extremely large data sets
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3258164/
https://www.ncbi.nlm.nih.gov/pubmed/22102584
http://dx.doi.org/10.1093/nar/gkr955
work_keys_str_mv AT proostsebastian iadhore30fastandsensitivedetectionofgenomichomologyinextremelylargedatasets
AT fostierjan iadhore30fastandsensitivedetectionofgenomichomologyinextremelylargedatasets
AT dewittedieter iadhore30fastandsensitivedetectionofgenomichomologyinextremelylargedatasets
AT dhoedtbart iadhore30fastandsensitivedetectionofgenomichomologyinextremelylargedatasets
AT demeesterpiet iadhore30fastandsensitivedetectionofgenomichomologyinextremelylargedatasets
AT vandepeeryves iadhore30fastandsensitivedetectionofgenomichomologyinextremelylargedatasets
AT vandepoeleklaas iadhore30fastandsensitivedetectionofgenomichomologyinextremelylargedatasets