Cargando…

Proteny: discovering and visualizing statistically significant syntenic clusters at the proteome level

Background: With more and more genomes being sequenced, detecting synteny between genomes becomes more and more important. However, for microorganisms the genomic divergence quickly becomes large, resulting in different codon usage and shuffling of gene order and gene elements such as exons. Results...

Descripción completa

Detalles Bibliográficos
Autores principales: Gehrmann, Thies, Reinders, Marcel J.T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4612220/
https://www.ncbi.nlm.nih.gov/pubmed/26116928
http://dx.doi.org/10.1093/bioinformatics/btv389
_version_ 1782396140303417344
author Gehrmann, Thies
Reinders, Marcel J.T.
author_facet Gehrmann, Thies
Reinders, Marcel J.T.
author_sort Gehrmann, Thies
collection PubMed
description Background: With more and more genomes being sequenced, detecting synteny between genomes becomes more and more important. However, for microorganisms the genomic divergence quickly becomes large, resulting in different codon usage and shuffling of gene order and gene elements such as exons. Results: We present Proteny, a methodology to detect synteny between diverged genomes. It operates on the amino acid sequence level to be insensitive to codon usage adaptations and clusters groups of exons disregarding order to handle diversity in genomic ordering between genomes. Furthermore, Proteny assigns significance levels to the syntenic clusters such that they can be selected on statistical grounds. Finally, Proteny provides novel ways to visualize results at different scales, facilitating the exploration and interpretation of syntenic regions. We test the performance of Proteny on a standard ground truth dataset, and we illustrate the use of Proteny on two closely related genomes (two different strains of Aspergillus niger) and on two distant genomes (two species of Basidiomycota). In comparison to other tools, we find that Proteny finds clusters with more true homologies in fewer clusters that contain more genes, i.e. Proteny is able to identify a more consistent synteny. Further, we show how genome rearrangements, assembly errors, gene duplications and the conservation of specific genes can be easily studied with Proteny. Availability and implementation: Proteny is freely available at the Delft Bioinformatics Lab website http://bioinformatics.tudelft.nl/dbl/software. Contact: t.gehrmann@tudelft.nl Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-4612220
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-46122202015-10-22 Proteny: discovering and visualizing statistically significant syntenic clusters at the proteome level Gehrmann, Thies Reinders, Marcel J.T. Bioinformatics Original Papers Background: With more and more genomes being sequenced, detecting synteny between genomes becomes more and more important. However, for microorganisms the genomic divergence quickly becomes large, resulting in different codon usage and shuffling of gene order and gene elements such as exons. Results: We present Proteny, a methodology to detect synteny between diverged genomes. It operates on the amino acid sequence level to be insensitive to codon usage adaptations and clusters groups of exons disregarding order to handle diversity in genomic ordering between genomes. Furthermore, Proteny assigns significance levels to the syntenic clusters such that they can be selected on statistical grounds. Finally, Proteny provides novel ways to visualize results at different scales, facilitating the exploration and interpretation of syntenic regions. We test the performance of Proteny on a standard ground truth dataset, and we illustrate the use of Proteny on two closely related genomes (two different strains of Aspergillus niger) and on two distant genomes (two species of Basidiomycota). In comparison to other tools, we find that Proteny finds clusters with more true homologies in fewer clusters that contain more genes, i.e. Proteny is able to identify a more consistent synteny. Further, we show how genome rearrangements, assembly errors, gene duplications and the conservation of specific genes can be easily studied with Proteny. Availability and implementation: Proteny is freely available at the Delft Bioinformatics Lab website http://bioinformatics.tudelft.nl/dbl/software. Contact: t.gehrmann@tudelft.nl Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2015-11-01 2015-06-23 /pmc/articles/PMC4612220/ /pubmed/26116928 http://dx.doi.org/10.1093/bioinformatics/btv389 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Gehrmann, Thies
Reinders, Marcel J.T.
Proteny: discovering and visualizing statistically significant syntenic clusters at the proteome level
title Proteny: discovering and visualizing statistically significant syntenic clusters at the proteome level
title_full Proteny: discovering and visualizing statistically significant syntenic clusters at the proteome level
title_fullStr Proteny: discovering and visualizing statistically significant syntenic clusters at the proteome level
title_full_unstemmed Proteny: discovering and visualizing statistically significant syntenic clusters at the proteome level
title_short Proteny: discovering and visualizing statistically significant syntenic clusters at the proteome level
title_sort proteny: discovering and visualizing statistically significant syntenic clusters at the proteome level
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4612220/
https://www.ncbi.nlm.nih.gov/pubmed/26116928
http://dx.doi.org/10.1093/bioinformatics/btv389
work_keys_str_mv AT gehrmannthies protenydiscoveringandvisualizingstatisticallysignificantsyntenicclustersattheproteomelevel
AT reindersmarceljt protenydiscoveringandvisualizingstatisticallysignificantsyntenicclustersattheproteomelevel