Cargando…
Proteny: discovering and visualizing statistically significant syntenic clusters at the proteome level
Background: With more and more genomes being sequenced, detecting synteny between genomes becomes more and more important. However, for microorganisms the genomic divergence quickly becomes large, resulting in different codon usage and shuffling of gene order and gene elements such as exons. Results...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4612220/ https://www.ncbi.nlm.nih.gov/pubmed/26116928 http://dx.doi.org/10.1093/bioinformatics/btv389 |
_version_ | 1782396140303417344 |
---|---|
author | Gehrmann, Thies Reinders, Marcel J.T. |
author_facet | Gehrmann, Thies Reinders, Marcel J.T. |
author_sort | Gehrmann, Thies |
collection | PubMed |
description | Background: With more and more genomes being sequenced, detecting synteny between genomes becomes more and more important. However, for microorganisms the genomic divergence quickly becomes large, resulting in different codon usage and shuffling of gene order and gene elements such as exons. Results: We present Proteny, a methodology to detect synteny between diverged genomes. It operates on the amino acid sequence level to be insensitive to codon usage adaptations and clusters groups of exons disregarding order to handle diversity in genomic ordering between genomes. Furthermore, Proteny assigns significance levels to the syntenic clusters such that they can be selected on statistical grounds. Finally, Proteny provides novel ways to visualize results at different scales, facilitating the exploration and interpretation of syntenic regions. We test the performance of Proteny on a standard ground truth dataset, and we illustrate the use of Proteny on two closely related genomes (two different strains of Aspergillus niger) and on two distant genomes (two species of Basidiomycota). In comparison to other tools, we find that Proteny finds clusters with more true homologies in fewer clusters that contain more genes, i.e. Proteny is able to identify a more consistent synteny. Further, we show how genome rearrangements, assembly errors, gene duplications and the conservation of specific genes can be easily studied with Proteny. Availability and implementation: Proteny is freely available at the Delft Bioinformatics Lab website http://bioinformatics.tudelft.nl/dbl/software. Contact: t.gehrmann@tudelft.nl Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-4612220 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-46122202015-10-22 Proteny: discovering and visualizing statistically significant syntenic clusters at the proteome level Gehrmann, Thies Reinders, Marcel J.T. Bioinformatics Original Papers Background: With more and more genomes being sequenced, detecting synteny between genomes becomes more and more important. However, for microorganisms the genomic divergence quickly becomes large, resulting in different codon usage and shuffling of gene order and gene elements such as exons. Results: We present Proteny, a methodology to detect synteny between diverged genomes. It operates on the amino acid sequence level to be insensitive to codon usage adaptations and clusters groups of exons disregarding order to handle diversity in genomic ordering between genomes. Furthermore, Proteny assigns significance levels to the syntenic clusters such that they can be selected on statistical grounds. Finally, Proteny provides novel ways to visualize results at different scales, facilitating the exploration and interpretation of syntenic regions. We test the performance of Proteny on a standard ground truth dataset, and we illustrate the use of Proteny on two closely related genomes (two different strains of Aspergillus niger) and on two distant genomes (two species of Basidiomycota). In comparison to other tools, we find that Proteny finds clusters with more true homologies in fewer clusters that contain more genes, i.e. Proteny is able to identify a more consistent synteny. Further, we show how genome rearrangements, assembly errors, gene duplications and the conservation of specific genes can be easily studied with Proteny. Availability and implementation: Proteny is freely available at the Delft Bioinformatics Lab website http://bioinformatics.tudelft.nl/dbl/software. Contact: t.gehrmann@tudelft.nl Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2015-11-01 2015-06-23 /pmc/articles/PMC4612220/ /pubmed/26116928 http://dx.doi.org/10.1093/bioinformatics/btv389 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Gehrmann, Thies Reinders, Marcel J.T. Proteny: discovering and visualizing statistically significant syntenic clusters at the proteome level |
title | Proteny: discovering and visualizing statistically significant syntenic clusters at the proteome level |
title_full | Proteny: discovering and visualizing statistically significant syntenic clusters at the proteome level |
title_fullStr | Proteny: discovering and visualizing statistically significant syntenic clusters at the proteome level |
title_full_unstemmed | Proteny: discovering and visualizing statistically significant syntenic clusters at the proteome level |
title_short | Proteny: discovering and visualizing statistically significant syntenic clusters at the proteome level |
title_sort | proteny: discovering and visualizing statistically significant syntenic clusters at the proteome level |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4612220/ https://www.ncbi.nlm.nih.gov/pubmed/26116928 http://dx.doi.org/10.1093/bioinformatics/btv389 |
work_keys_str_mv | AT gehrmannthies protenydiscoveringandvisualizingstatisticallysignificantsyntenicclustersattheproteomelevel AT reindersmarceljt protenydiscoveringandvisualizingstatisticallysignificantsyntenicclustersattheproteomelevel |