Cargando…
i-ADHoRe 3.0—fast and sensitive detection of genomic homology in extremely large data sets
Comparative genomics is a powerful means to gain insight into the evolutionary processes that shape the genomes of related species. As the number of sequenced genomes increases, the development of software to perform accurate cross-species analyses becomes indispensable. However, many implementation...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3258164/ https://www.ncbi.nlm.nih.gov/pubmed/22102584 http://dx.doi.org/10.1093/nar/gkr955 |
_version_ | 1782221239933206528 |
---|---|
author | Proost, Sebastian Fostier, Jan De Witte, Dieter Dhoedt, Bart Demeester, Piet Van de Peer, Yves Vandepoele, Klaas |
author_facet | Proost, Sebastian Fostier, Jan De Witte, Dieter Dhoedt, Bart Demeester, Piet Van de Peer, Yves Vandepoele, Klaas |
author_sort | Proost, Sebastian |
collection | PubMed |
description | Comparative genomics is a powerful means to gain insight into the evolutionary processes that shape the genomes of related species. As the number of sequenced genomes increases, the development of software to perform accurate cross-species analyses becomes indispensable. However, many implementations that have the ability to compare multiple genomes exhibit unfavorable computational and memory requirements, limiting the number of genomes that can be analyzed in one run. Here, we present a software package to unveil genomic homology based on the identification of conservation of gene content and gene order (collinearity), i-ADHoRe 3.0, and its application to eukaryotic genomes. The use of efficient algorithms and support for parallel computing enable the analysis of large-scale data sets. Unlike other tools, i-ADHoRe can process the Ensembl data set, containing 49 species, in 1 h. Furthermore, the profile search is more sensitive to detect degenerate genomic homology than chaining pairwise collinearity information based on transitive homology. From ultra-conserved collinear regions between mammals and birds, by integrating coexpression information and protein–protein interactions, we identified more than 400 regions in the human genome showing significant functional coherence. The different algorithmical improvements ensure that i-ADHoRe 3.0 will remain a powerful tool to study genome evolution. |
format | Online Article Text |
id | pubmed-3258164 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-32581642012-01-17 i-ADHoRe 3.0—fast and sensitive detection of genomic homology in extremely large data sets Proost, Sebastian Fostier, Jan De Witte, Dieter Dhoedt, Bart Demeester, Piet Van de Peer, Yves Vandepoele, Klaas Nucleic Acids Res Methods Online Comparative genomics is a powerful means to gain insight into the evolutionary processes that shape the genomes of related species. As the number of sequenced genomes increases, the development of software to perform accurate cross-species analyses becomes indispensable. However, many implementations that have the ability to compare multiple genomes exhibit unfavorable computational and memory requirements, limiting the number of genomes that can be analyzed in one run. Here, we present a software package to unveil genomic homology based on the identification of conservation of gene content and gene order (collinearity), i-ADHoRe 3.0, and its application to eukaryotic genomes. The use of efficient algorithms and support for parallel computing enable the analysis of large-scale data sets. Unlike other tools, i-ADHoRe can process the Ensembl data set, containing 49 species, in 1 h. Furthermore, the profile search is more sensitive to detect degenerate genomic homology than chaining pairwise collinearity information based on transitive homology. From ultra-conserved collinear regions between mammals and birds, by integrating coexpression information and protein–protein interactions, we identified more than 400 regions in the human genome showing significant functional coherence. The different algorithmical improvements ensure that i-ADHoRe 3.0 will remain a powerful tool to study genome evolution. Oxford University Press 2012-01 2011-11-17 /pmc/articles/PMC3258164/ /pubmed/22102584 http://dx.doi.org/10.1093/nar/gkr955 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Online Proost, Sebastian Fostier, Jan De Witte, Dieter Dhoedt, Bart Demeester, Piet Van de Peer, Yves Vandepoele, Klaas i-ADHoRe 3.0—fast and sensitive detection of genomic homology in extremely large data sets |
title | i-ADHoRe 3.0—fast and sensitive detection of genomic homology in extremely large data sets |
title_full | i-ADHoRe 3.0—fast and sensitive detection of genomic homology in extremely large data sets |
title_fullStr | i-ADHoRe 3.0—fast and sensitive detection of genomic homology in extremely large data sets |
title_full_unstemmed | i-ADHoRe 3.0—fast and sensitive detection of genomic homology in extremely large data sets |
title_short | i-ADHoRe 3.0—fast and sensitive detection of genomic homology in extremely large data sets |
title_sort | i-adhore 3.0—fast and sensitive detection of genomic homology in extremely large data sets |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3258164/ https://www.ncbi.nlm.nih.gov/pubmed/22102584 http://dx.doi.org/10.1093/nar/gkr955 |
work_keys_str_mv | AT proostsebastian iadhore30fastandsensitivedetectionofgenomichomologyinextremelylargedatasets AT fostierjan iadhore30fastandsensitivedetectionofgenomichomologyinextremelylargedatasets AT dewittedieter iadhore30fastandsensitivedetectionofgenomichomologyinextremelylargedatasets AT dhoedtbart iadhore30fastandsensitivedetectionofgenomichomologyinextremelylargedatasets AT demeesterpiet iadhore30fastandsensitivedetectionofgenomichomologyinextremelylargedatasets AT vandepeeryves iadhore30fastandsensitivedetectionofgenomichomologyinextremelylargedatasets AT vandepoeleklaas iadhore30fastandsensitivedetectionofgenomichomologyinextremelylargedatasets |