Cargando…

Improving probe set selection for microbial community analysis by leveraging taxonomic information of training sequences

BACKGROUND: Population levels of microbial phylotypes can be examined using a hybridization-based method that utilizes a small set of computationally-designed DNA probes targeted to a gene common to all. Our previous algorithm attempts to select a set of probes such that each training sequence manif...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ruegger, Paul M, Della Vedova, Gianluca, Jiang, Tao, Borneman, James
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2011
Materias:	Methodology Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3224148/ https://www.ncbi.nlm.nih.gov/pubmed/21985453 http://dx.doi.org/10.1186/1471-2105-12-394

_version_	1782217350496387072
author	Ruegger, Paul M Della Vedova, Gianluca Jiang, Tao Borneman, James
author_facet	Ruegger, Paul M Della Vedova, Gianluca Jiang, Tao Borneman, James
author_sort	Ruegger, Paul M
collection	PubMed
description	BACKGROUND: Population levels of microbial phylotypes can be examined using a hybridization-based method that utilizes a small set of computationally-designed DNA probes targeted to a gene common to all. Our previous algorithm attempts to select a set of probes such that each training sequence manifests a unique theoretical hybridization pattern (a binary fingerprint) to a probe set. It does so without taking into account similarity between training gene sequences or their putative taxonomic classifications, however. We present an improved algorithm for probe set selection that utilizes the available taxonomic information of training gene sequences and attempts to choose probes such that the resultant binary fingerprints cluster into real taxonomic groups. RESULTS: Gene sequences manifesting identical fingerprints with probes chosen by the new algorithm are more likely to be from the same taxonomic group than probes chosen by the previous algorithm. In cases where they are from different taxonomic groups, underlying DNA sequences of identical fingerprints are more similar to each other in probe sets made with the new versus the previous algorithm. Complete removal of large taxonomic groups from training data does not greatly decrease the ability of probe sets to distinguish those groups. CONCLUSIONS: Probe sets made from the new algorithm create fingerprints that more reliably cluster into biologically meaningful groups. The method can readily distinguish microbial phylotypes that were excluded from the training sequences, suggesting novel microbes can also be detected.
format	Online Article Text
id	pubmed-3224148
institution	National Center for Biotechnology Information
language	English
publishDate	2011
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-32241482011-11-30 Improving probe set selection for microbial community analysis by leveraging taxonomic information of training sequences Ruegger, Paul M Della Vedova, Gianluca Jiang, Tao Borneman, James BMC Bioinformatics Methodology Article BACKGROUND: Population levels of microbial phylotypes can be examined using a hybridization-based method that utilizes a small set of computationally-designed DNA probes targeted to a gene common to all. Our previous algorithm attempts to select a set of probes such that each training sequence manifests a unique theoretical hybridization pattern (a binary fingerprint) to a probe set. It does so without taking into account similarity between training gene sequences or their putative taxonomic classifications, however. We present an improved algorithm for probe set selection that utilizes the available taxonomic information of training gene sequences and attempts to choose probes such that the resultant binary fingerprints cluster into real taxonomic groups. RESULTS: Gene sequences manifesting identical fingerprints with probes chosen by the new algorithm are more likely to be from the same taxonomic group than probes chosen by the previous algorithm. In cases where they are from different taxonomic groups, underlying DNA sequences of identical fingerprints are more similar to each other in probe sets made with the new versus the previous algorithm. Complete removal of large taxonomic groups from training data does not greatly decrease the ability of probe sets to distinguish those groups. CONCLUSIONS: Probe sets made from the new algorithm create fingerprints that more reliably cluster into biologically meaningful groups. The method can readily distinguish microbial phylotypes that were excluded from the training sequences, suggesting novel microbes can also be detected. BioMed Central 2011-10-10 /pmc/articles/PMC3224148/ /pubmed/21985453 http://dx.doi.org/10.1186/1471-2105-12-394 Text en Copyright ©2011 Ruegger et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Methodology Article Ruegger, Paul M Della Vedova, Gianluca Jiang, Tao Borneman, James Improving probe set selection for microbial community analysis by leveraging taxonomic information of training sequences
title	Improving probe set selection for microbial community analysis by leveraging taxonomic information of training sequences
title_full	Improving probe set selection for microbial community analysis by leveraging taxonomic information of training sequences
title_fullStr	Improving probe set selection for microbial community analysis by leveraging taxonomic information of training sequences
title_full_unstemmed	Improving probe set selection for microbial community analysis by leveraging taxonomic information of training sequences
title_short	Improving probe set selection for microbial community analysis by leveraging taxonomic information of training sequences
title_sort	improving probe set selection for microbial community analysis by leveraging taxonomic information of training sequences
topic	Methodology Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3224148/ https://www.ncbi.nlm.nih.gov/pubmed/21985453 http://dx.doi.org/10.1186/1471-2105-12-394
work_keys_str_mv	AT rueggerpaulm improvingprobesetselectionformicrobialcommunityanalysisbyleveragingtaxonomicinformationoftrainingsequences AT dellavedovagianluca improvingprobesetselectionformicrobialcommunityanalysisbyleveragingtaxonomicinformationoftrainingsequences AT jiangtao improvingprobesetselectionformicrobialcommunityanalysisbyleveragingtaxonomicinformationoftrainingsequences AT bornemanjames improvingprobesetselectionformicrobialcommunityanalysisbyleveragingtaxonomicinformationoftrainingsequences

Improving probe set selection for microbial community analysis by leveraging taxonomic information of training sequences

Ejemplares similares