Cargando…

An alternative approach to multiple genome comparison

Genome comparison is now a crucial step for genome annotation and identification of regulatory motifs. Genome comparison aims for instance at finding genomic regions either specific to or in one-to-one correspondance between individuals/strains/species. It serves e.g. to pre-annotate a new genome by...

Descripción completa

Detalles Bibliográficos
Autores principales: Mancheron, Alban, Uricaru, Raluca, Rivals, Eric
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3159434/
https://www.ncbi.nlm.nih.gov/pubmed/21646341
http://dx.doi.org/10.1093/nar/gkr177
_version_ 1782210466823536640
author Mancheron, Alban
Uricaru, Raluca
Rivals, Eric
author_facet Mancheron, Alban
Uricaru, Raluca
Rivals, Eric
author_sort Mancheron, Alban
collection PubMed
description Genome comparison is now a crucial step for genome annotation and identification of regulatory motifs. Genome comparison aims for instance at finding genomic regions either specific to or in one-to-one correspondance between individuals/strains/species. It serves e.g. to pre-annotate a new genome by automatically transfering annotations from a known one. However, efficiency, flexibility and objectives of current methods do not suit the whole spectrum of applications, genome sizes and organizations. Innovative approaches are still needed. Hence, we propose an alternative way of comparing multiple genomes based on segmentation by similarity. In this framework, rather than being formulated as a complex optimization problem, genome comparison is seen as a segmentation question for which a single optimal solution can be found in almost linear time. We apply our method to analyse three strains of a virulent pathogenic bacteria, Ehrlichia ruminantium, and identify 92 new genes. We also find out that a substantial number of genes thought to be strain specific have potential orthologs in the other strains. Our solution is implemented in an efficient program, qod, equipped with a user-friendly interface, and enables the automatic transfer of annotations betwen compared genomes or contigs (Video in Supplementary Data). Because it somehow disregards the relative order of genomic blocks, qod can handle unfinished genomes, which due to the difficulty of sequencing completion may become an interesting characteristic for the future. Availabilty: http://www.atgc-montpellier.fr/qod.
format Online
Article
Text
id pubmed-3159434
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-31594342011-08-22 An alternative approach to multiple genome comparison Mancheron, Alban Uricaru, Raluca Rivals, Eric Nucleic Acids Res Methods Online Genome comparison is now a crucial step for genome annotation and identification of regulatory motifs. Genome comparison aims for instance at finding genomic regions either specific to or in one-to-one correspondance between individuals/strains/species. It serves e.g. to pre-annotate a new genome by automatically transfering annotations from a known one. However, efficiency, flexibility and objectives of current methods do not suit the whole spectrum of applications, genome sizes and organizations. Innovative approaches are still needed. Hence, we propose an alternative way of comparing multiple genomes based on segmentation by similarity. In this framework, rather than being formulated as a complex optimization problem, genome comparison is seen as a segmentation question for which a single optimal solution can be found in almost linear time. We apply our method to analyse three strains of a virulent pathogenic bacteria, Ehrlichia ruminantium, and identify 92 new genes. We also find out that a substantial number of genes thought to be strain specific have potential orthologs in the other strains. Our solution is implemented in an efficient program, qod, equipped with a user-friendly interface, and enables the automatic transfer of annotations betwen compared genomes or contigs (Video in Supplementary Data). Because it somehow disregards the relative order of genomic blocks, qod can handle unfinished genomes, which due to the difficulty of sequencing completion may become an interesting characteristic for the future. Availabilty: http://www.atgc-montpellier.fr/qod. Oxford University Press 2011-08 2011-06-06 /pmc/articles/PMC3159434/ /pubmed/21646341 http://dx.doi.org/10.1093/nar/gkr177 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Mancheron, Alban
Uricaru, Raluca
Rivals, Eric
An alternative approach to multiple genome comparison
title An alternative approach to multiple genome comparison
title_full An alternative approach to multiple genome comparison
title_fullStr An alternative approach to multiple genome comparison
title_full_unstemmed An alternative approach to multiple genome comparison
title_short An alternative approach to multiple genome comparison
title_sort alternative approach to multiple genome comparison
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3159434/
https://www.ncbi.nlm.nih.gov/pubmed/21646341
http://dx.doi.org/10.1093/nar/gkr177
work_keys_str_mv AT mancheronalban analternativeapproachtomultiplegenomecomparison
AT uricaruraluca analternativeapproachtomultiplegenomecomparison
AT rivalseric analternativeapproachtomultiplegenomecomparison
AT mancheronalban alternativeapproachtomultiplegenomecomparison
AT uricaruraluca alternativeapproachtomultiplegenomecomparison
AT rivalseric alternativeapproachtomultiplegenomecomparison