Cargando…

Using Protein Clusters from Whole Proteomes to Construct and Augment a Dendrogram

In this paper we present a new ab initio approach for constructing an unrooted dendrogram using protein clusters, an approach that has the potential for estimating relationships among several thousands of species based on their putative proteomes. We employ an open-source software program called pCl...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Yunyun, Call, Douglas R., Broschat, Shira L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3590580/
https://www.ncbi.nlm.nih.gov/pubmed/23509450
http://dx.doi.org/10.1155/2013/191586
_version_ 1782261888160104448
author Zhou, Yunyun
Call, Douglas R.
Broschat, Shira L.
author_facet Zhou, Yunyun
Call, Douglas R.
Broschat, Shira L.
author_sort Zhou, Yunyun
collection PubMed
description In this paper we present a new ab initio approach for constructing an unrooted dendrogram using protein clusters, an approach that has the potential for estimating relationships among several thousands of species based on their putative proteomes. We employ an open-source software program called pClust that was developed for use in metagenomic studies. Sequence alignment is performed by pClust using the Smith-Waterman algorithm, which is known to give optimal alignment and, hence, greater accuracy than BLAST-based methods. Protein clusters generated by pClust are used to create protein profiles for each species in the dendrogram, these profiles forming a correlation filter library for use with a new taxon. To augment the dendrogram with a new taxon, a protein profile for the taxon is created using BLASTp, and this new taxon is placed into a position within the dendrogram corresponding to the highest correlation with profiles in the correlation filter library. This work was initiated because of our interest in plasmids, and each step is illustrated using proteomes from Gram-negative bacterial plasmids. Proteomes for 527 plasmids were used to generate the dendrogram, and to demonstrate the utility of the insertion algorithm twelve recently sequenced pAKD plasmids were used to augment the dendrogram.
format Online
Article
Text
id pubmed-3590580
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-35905802013-03-18 Using Protein Clusters from Whole Proteomes to Construct and Augment a Dendrogram Zhou, Yunyun Call, Douglas R. Broschat, Shira L. Adv Bioinformatics Research Article In this paper we present a new ab initio approach for constructing an unrooted dendrogram using protein clusters, an approach that has the potential for estimating relationships among several thousands of species based on their putative proteomes. We employ an open-source software program called pClust that was developed for use in metagenomic studies. Sequence alignment is performed by pClust using the Smith-Waterman algorithm, which is known to give optimal alignment and, hence, greater accuracy than BLAST-based methods. Protein clusters generated by pClust are used to create protein profiles for each species in the dendrogram, these profiles forming a correlation filter library for use with a new taxon. To augment the dendrogram with a new taxon, a protein profile for the taxon is created using BLASTp, and this new taxon is placed into a position within the dendrogram corresponding to the highest correlation with profiles in the correlation filter library. This work was initiated because of our interest in plasmids, and each step is illustrated using proteomes from Gram-negative bacterial plasmids. Proteomes for 527 plasmids were used to generate the dendrogram, and to demonstrate the utility of the insertion algorithm twelve recently sequenced pAKD plasmids were used to augment the dendrogram. Hindawi Publishing Corporation 2013 2013-02-20 /pmc/articles/PMC3590580/ /pubmed/23509450 http://dx.doi.org/10.1155/2013/191586 Text en Copyright © 2013 Yunyun Zhou et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhou, Yunyun
Call, Douglas R.
Broschat, Shira L.
Using Protein Clusters from Whole Proteomes to Construct and Augment a Dendrogram
title Using Protein Clusters from Whole Proteomes to Construct and Augment a Dendrogram
title_full Using Protein Clusters from Whole Proteomes to Construct and Augment a Dendrogram
title_fullStr Using Protein Clusters from Whole Proteomes to Construct and Augment a Dendrogram
title_full_unstemmed Using Protein Clusters from Whole Proteomes to Construct and Augment a Dendrogram
title_short Using Protein Clusters from Whole Proteomes to Construct and Augment a Dendrogram
title_sort using protein clusters from whole proteomes to construct and augment a dendrogram
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3590580/
https://www.ncbi.nlm.nih.gov/pubmed/23509450
http://dx.doi.org/10.1155/2013/191586
work_keys_str_mv AT zhouyunyun usingproteinclustersfromwholeproteomestoconstructandaugmentadendrogram
AT calldouglasr usingproteinclustersfromwholeproteomestoconstructandaugmentadendrogram
AT broschatshiral usingproteinclustersfromwholeproteomestoconstructandaugmentadendrogram