Cargando…
Sigma: Strain-level inference of genomes from metagenomic analysis for biosurveillance
Motivation: Metagenomic sequencing of clinical samples provides a promising technique for direct pathogen detection and characterization in biosurveillance. Taxonomic analysis at the strain level can be used to resolve serotypes of a pathogen in biosurveillance. Sigma was developed for strain-level...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4287953/ https://www.ncbi.nlm.nih.gov/pubmed/25266224 http://dx.doi.org/10.1093/bioinformatics/btu641 |
_version_ | 1782351888438525952 |
---|---|
author | Ahn, Tae-Hyuk Chai, Juanjuan Pan, Chongle |
author_facet | Ahn, Tae-Hyuk Chai, Juanjuan Pan, Chongle |
author_sort | Ahn, Tae-Hyuk |
collection | PubMed |
description | Motivation: Metagenomic sequencing of clinical samples provides a promising technique for direct pathogen detection and characterization in biosurveillance. Taxonomic analysis at the strain level can be used to resolve serotypes of a pathogen in biosurveillance. Sigma was developed for strain-level identification and quantification of pathogens using their reference genomes based on metagenomic analysis. Results: Sigma provides not only accurate strain-level inferences, but also three unique capabilities: (i) Sigma quantifies the statistical uncertainty of its inferences, which includes hypothesis testing of identified genomes and confidence interval estimation of their relative abundances; (ii) Sigma enables strain variant calling by assigning metagenomic reads to their most likely reference genomes; and (iii) Sigma supports parallel computing for fast analysis of large datasets. The algorithm performance was evaluated using simulated mock communities and fecal samples with spike-in pathogen strains. Availability and Implementation: Sigma was implemented in C++ with source codes and binaries freely available at http://sigma.omicsbio.org. Contact: panc@ornl.gov Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-4287953 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-42879532015-01-30 Sigma: Strain-level inference of genomes from metagenomic analysis for biosurveillance Ahn, Tae-Hyuk Chai, Juanjuan Pan, Chongle Bioinformatics Original Papers Motivation: Metagenomic sequencing of clinical samples provides a promising technique for direct pathogen detection and characterization in biosurveillance. Taxonomic analysis at the strain level can be used to resolve serotypes of a pathogen in biosurveillance. Sigma was developed for strain-level identification and quantification of pathogens using their reference genomes based on metagenomic analysis. Results: Sigma provides not only accurate strain-level inferences, but also three unique capabilities: (i) Sigma quantifies the statistical uncertainty of its inferences, which includes hypothesis testing of identified genomes and confidence interval estimation of their relative abundances; (ii) Sigma enables strain variant calling by assigning metagenomic reads to their most likely reference genomes; and (iii) Sigma supports parallel computing for fast analysis of large datasets. The algorithm performance was evaluated using simulated mock communities and fecal samples with spike-in pathogen strains. Availability and Implementation: Sigma was implemented in C++ with source codes and binaries freely available at http://sigma.omicsbio.org. Contact: panc@ornl.gov Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2015-01-15 2014-09-29 /pmc/articles/PMC4287953/ /pubmed/25266224 http://dx.doi.org/10.1093/bioinformatics/btu641 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Ahn, Tae-Hyuk Chai, Juanjuan Pan, Chongle Sigma: Strain-level inference of genomes from metagenomic analysis for biosurveillance |
title | Sigma: Strain-level inference of genomes from metagenomic analysis for biosurveillance |
title_full | Sigma: Strain-level inference of genomes from metagenomic analysis for biosurveillance |
title_fullStr | Sigma: Strain-level inference of genomes from metagenomic analysis for biosurveillance |
title_full_unstemmed | Sigma: Strain-level inference of genomes from metagenomic analysis for biosurveillance |
title_short | Sigma: Strain-level inference of genomes from metagenomic analysis for biosurveillance |
title_sort | sigma: strain-level inference of genomes from metagenomic analysis for biosurveillance |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4287953/ https://www.ncbi.nlm.nih.gov/pubmed/25266224 http://dx.doi.org/10.1093/bioinformatics/btu641 |
work_keys_str_mv | AT ahntaehyuk sigmastrainlevelinferenceofgenomesfrommetagenomicanalysisforbiosurveillance AT chaijuanjuan sigmastrainlevelinferenceofgenomesfrommetagenomicanalysisforbiosurveillance AT panchongle sigmastrainlevelinferenceofgenomesfrommetagenomicanalysisforbiosurveillance |