Cargando…

Protein domain architectures provide a fast, efficient and scalable alternative to sequence-based methods for comparative functional genomics

A functional comparative genome analysis is essential to understand the mechanisms underlying bacterial evolution and adaptation. Detection of functional orthologs using standard global sequence similarity methods faces several problems; the need for defining arbitrary acceptance thresholds for simi...

Descripción completa

Detalles Bibliográficos
Autores principales: Koehorst, Jasper J., Saccenti, Edoardo, Schaap, Peter J., Martins dos Santos, Vitor A. P., Suarez-Diez, Maria
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000Research 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5031134/
https://www.ncbi.nlm.nih.gov/pubmed/27703668
http://dx.doi.org/10.12688/f1000research.9416.3
_version_ 1782454757963595776
author Koehorst, Jasper J.
Saccenti, Edoardo
Schaap, Peter J.
Martins dos Santos, Vitor A. P.
Suarez-Diez, Maria
author_facet Koehorst, Jasper J.
Saccenti, Edoardo
Schaap, Peter J.
Martins dos Santos, Vitor A. P.
Suarez-Diez, Maria
author_sort Koehorst, Jasper J.
collection PubMed
description A functional comparative genome analysis is essential to understand the mechanisms underlying bacterial evolution and adaptation. Detection of functional orthologs using standard global sequence similarity methods faces several problems; the need for defining arbitrary acceptance thresholds for similarity and alignment length, lateral gene acquisition and the high computational cost for finding bi-directional best matches at a large scale. We investigated the use of protein domain architectures for large scale functional comparative analysis as an alternative method. The performance of both approaches was assessed through functional comparison of 446 bacterial genomes sampled at different taxonomic levels. We show that protein domain architectures provide a fast and efficient alternative to methods based on sequence similarity to identify groups of functionally equivalent proteins within and across taxonomic boundaries, and it is suitable for large scale comparative analysis. Running both methods in parallel pinpoints potential functional adaptations that may add to bacterial fitness.
format Online
Article
Text
id pubmed-5031134
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher F1000Research
record_format MEDLINE/PubMed
spelling pubmed-50311342016-10-03 Protein domain architectures provide a fast, efficient and scalable alternative to sequence-based methods for comparative functional genomics Koehorst, Jasper J. Saccenti, Edoardo Schaap, Peter J. Martins dos Santos, Vitor A. P. Suarez-Diez, Maria F1000Res Research Article A functional comparative genome analysis is essential to understand the mechanisms underlying bacterial evolution and adaptation. Detection of functional orthologs using standard global sequence similarity methods faces several problems; the need for defining arbitrary acceptance thresholds for similarity and alignment length, lateral gene acquisition and the high computational cost for finding bi-directional best matches at a large scale. We investigated the use of protein domain architectures for large scale functional comparative analysis as an alternative method. The performance of both approaches was assessed through functional comparison of 446 bacterial genomes sampled at different taxonomic levels. We show that protein domain architectures provide a fast and efficient alternative to methods based on sequence similarity to identify groups of functionally equivalent proteins within and across taxonomic boundaries, and it is suitable for large scale comparative analysis. Running both methods in parallel pinpoints potential functional adaptations that may add to bacterial fitness. F1000Research 2017-06-27 /pmc/articles/PMC5031134/ /pubmed/27703668 http://dx.doi.org/10.12688/f1000research.9416.3 Text en Copyright: © 2017 Koehorst JJ et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Koehorst, Jasper J.
Saccenti, Edoardo
Schaap, Peter J.
Martins dos Santos, Vitor A. P.
Suarez-Diez, Maria
Protein domain architectures provide a fast, efficient and scalable alternative to sequence-based methods for comparative functional genomics
title Protein domain architectures provide a fast, efficient and scalable alternative to sequence-based methods for comparative functional genomics
title_full Protein domain architectures provide a fast, efficient and scalable alternative to sequence-based methods for comparative functional genomics
title_fullStr Protein domain architectures provide a fast, efficient and scalable alternative to sequence-based methods for comparative functional genomics
title_full_unstemmed Protein domain architectures provide a fast, efficient and scalable alternative to sequence-based methods for comparative functional genomics
title_short Protein domain architectures provide a fast, efficient and scalable alternative to sequence-based methods for comparative functional genomics
title_sort protein domain architectures provide a fast, efficient and scalable alternative to sequence-based methods for comparative functional genomics
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5031134/
https://www.ncbi.nlm.nih.gov/pubmed/27703668
http://dx.doi.org/10.12688/f1000research.9416.3
work_keys_str_mv AT koehorstjasperj proteindomainarchitecturesprovideafastefficientandscalablealternativetosequencebasedmethodsforcomparativefunctionalgenomics
AT saccentiedoardo proteindomainarchitecturesprovideafastefficientandscalablealternativetosequencebasedmethodsforcomparativefunctionalgenomics
AT schaappeterj proteindomainarchitecturesprovideafastefficientandscalablealternativetosequencebasedmethodsforcomparativefunctionalgenomics
AT martinsdossantosvitorap proteindomainarchitecturesprovideafastefficientandscalablealternativetosequencebasedmethodsforcomparativefunctionalgenomics
AT suarezdiezmaria proteindomainarchitecturesprovideafastefficientandscalablealternativetosequencebasedmethodsforcomparativefunctionalgenomics