Cargando…

Strainer: software for analysis of population variation in community genomic datasets

BACKGROUND: Metagenomic analyses of microbial communities that are comprehensive enough to provide multiple samples of most loci in the genomes of the dominant organism types will also reveal patterns of genetic variation within natural populations. New bioinformatic tools will enable visualization...

Descripción completa

Detalles Bibliográficos
Autores principales: Eppley, John M, Tyson, Gene W, Getz, Wayne M, Banfield, Jillian F
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2110895/
https://www.ncbi.nlm.nih.gov/pubmed/17941997
http://dx.doi.org/10.1186/1471-2105-8-398
_version_ 1782139677257498624
author Eppley, John M
Tyson, Gene W
Getz, Wayne M
Banfield, Jillian F
author_facet Eppley, John M
Tyson, Gene W
Getz, Wayne M
Banfield, Jillian F
author_sort Eppley, John M
collection PubMed
description BACKGROUND: Metagenomic analyses of microbial communities that are comprehensive enough to provide multiple samples of most loci in the genomes of the dominant organism types will also reveal patterns of genetic variation within natural populations. New bioinformatic tools will enable visualization and comprehensive analysis of this sequence variation and inference of recent evolutionary and ecological processes. RESULTS: We have developed a software package for analysis and visualization of genetic variation in populations and reconstruction of strain variants from otherwise co-assembled sequences. Sequencing reads can be clustered by matching patterns of single nucleotide polymorphisms to generate predicted gene and protein variant sequences, identify conserved intergenic regulatory sequences, and determine the quantity and distribution of recombination events. CONCLUSION: The Strainer software, a first generation metagenomic bioinformatics tool, facilitates comprehension and analysis of heterogeneity intrinsic in natural communities. The program reveals the degree of clustering among closely related sequence variants and provides a rapid means to generate gene and protein sequences for functional, ecological, and evolutionary analyses.
format Text
id pubmed-2110895
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-21108952007-12-05 Strainer: software for analysis of population variation in community genomic datasets Eppley, John M Tyson, Gene W Getz, Wayne M Banfield, Jillian F BMC Bioinformatics Software BACKGROUND: Metagenomic analyses of microbial communities that are comprehensive enough to provide multiple samples of most loci in the genomes of the dominant organism types will also reveal patterns of genetic variation within natural populations. New bioinformatic tools will enable visualization and comprehensive analysis of this sequence variation and inference of recent evolutionary and ecological processes. RESULTS: We have developed a software package for analysis and visualization of genetic variation in populations and reconstruction of strain variants from otherwise co-assembled sequences. Sequencing reads can be clustered by matching patterns of single nucleotide polymorphisms to generate predicted gene and protein variant sequences, identify conserved intergenic regulatory sequences, and determine the quantity and distribution of recombination events. CONCLUSION: The Strainer software, a first generation metagenomic bioinformatics tool, facilitates comprehension and analysis of heterogeneity intrinsic in natural communities. The program reveals the degree of clustering among closely related sequence variants and provides a rapid means to generate gene and protein sequences for functional, ecological, and evolutionary analyses. BioMed Central 2007-10-17 /pmc/articles/PMC2110895/ /pubmed/17941997 http://dx.doi.org/10.1186/1471-2105-8-398 Text en Copyright © 2007 Eppley et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Eppley, John M
Tyson, Gene W
Getz, Wayne M
Banfield, Jillian F
Strainer: software for analysis of population variation in community genomic datasets
title Strainer: software for analysis of population variation in community genomic datasets
title_full Strainer: software for analysis of population variation in community genomic datasets
title_fullStr Strainer: software for analysis of population variation in community genomic datasets
title_full_unstemmed Strainer: software for analysis of population variation in community genomic datasets
title_short Strainer: software for analysis of population variation in community genomic datasets
title_sort strainer: software for analysis of population variation in community genomic datasets
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2110895/
https://www.ncbi.nlm.nih.gov/pubmed/17941997
http://dx.doi.org/10.1186/1471-2105-8-398
work_keys_str_mv AT eppleyjohnm strainersoftwareforanalysisofpopulationvariationincommunitygenomicdatasets
AT tysongenew strainersoftwareforanalysisofpopulationvariationincommunitygenomicdatasets
AT getzwaynem strainersoftwareforanalysisofpopulationvariationincommunitygenomicdatasets
AT banfieldjillianf strainersoftwareforanalysisofpopulationvariationincommunitygenomicdatasets