Cargando…

Unraveling bacterial fingerprints of city subways from microbiome 16S gene profiles

BACKGROUND: Microbial communities can be location specific, and the abundance of species within locations can influence our ability to determine whether a sample belongs to one city or another. As part of the 2017 CAMDA MetaSUB Inter-City Challenge, next generation sequencing (NGS) data was generate...

Descripción completa

Detalles Bibliográficos
Autores principales: Walker, Alejandro R., Grimes, Tyler L., Datta, Somnath, Datta, Susmita
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5964687/
https://www.ncbi.nlm.nih.gov/pubmed/29789016
http://dx.doi.org/10.1186/s13062-018-0215-8
_version_ 1783325227122425856
author Walker, Alejandro R.
Grimes, Tyler L.
Datta, Somnath
Datta, Susmita
author_facet Walker, Alejandro R.
Grimes, Tyler L.
Datta, Somnath
Datta, Susmita
author_sort Walker, Alejandro R.
collection PubMed
description BACKGROUND: Microbial communities can be location specific, and the abundance of species within locations can influence our ability to determine whether a sample belongs to one city or another. As part of the 2017 CAMDA MetaSUB Inter-City Challenge, next generation sequencing (NGS) data was generated from swipe samples collected from subway stations in Boston, New York City hereafter New York, and Sacramento. DNA was extracted and Illumina sequenced. Sequencing data was provided for all cities as part of 2017 CAMDA contest challenge dataset. RESULTS: Principal component analysis (PCA) showed clear clustering of the samples for the three cities, with a substantial proportion of the variance explained by the first three components. We ran two different classifiers and results were robust for error rate (< 6%) and accuracy (> 95%). The analysis of variance (ANOVA) demonstrated that overall, bacterial composition across the three cities is significantly different. A similar conclusion was reached using a novel bootstrap based test using diversity indices. Last but not least, a co-abundance association network analyses for the taxonomic levels “order”, “family”, and “genus” found different patterns of bacterial networks for the three cities. CONCLUSIONS: Bacterial fingerprint can be useful to predict sample provenance. In this work prediction of provenance reported with over 95% accuracy. Association based network analysis, emphasized similarities between the closest cities sharing common bacterial composition. ANOVA showed different patterns of bacterial amongst cities, and these findings strongly suggest that bacterial signature across multiple cities are different. This work advocates a data analysis pipeline which could be followed in order to get biological insight from this data. However, the biological conclusions from this analysis is just an early indication out of a pilot microbiome data provided to us through CAMDA 2017 challenge and will be subject to change as we get more complete data sets in the near future. This microbiome data can have potential applications in forensics, ecology, and other sciences. REVIEWERS: This article was reviewed by Klas Udekwu, Alexandra Graf, and Rafal Mostowy.
format Online
Article
Text
id pubmed-5964687
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-59646872018-05-24 Unraveling bacterial fingerprints of city subways from microbiome 16S gene profiles Walker, Alejandro R. Grimes, Tyler L. Datta, Somnath Datta, Susmita Biol Direct Research BACKGROUND: Microbial communities can be location specific, and the abundance of species within locations can influence our ability to determine whether a sample belongs to one city or another. As part of the 2017 CAMDA MetaSUB Inter-City Challenge, next generation sequencing (NGS) data was generated from swipe samples collected from subway stations in Boston, New York City hereafter New York, and Sacramento. DNA was extracted and Illumina sequenced. Sequencing data was provided for all cities as part of 2017 CAMDA contest challenge dataset. RESULTS: Principal component analysis (PCA) showed clear clustering of the samples for the three cities, with a substantial proportion of the variance explained by the first three components. We ran two different classifiers and results were robust for error rate (< 6%) and accuracy (> 95%). The analysis of variance (ANOVA) demonstrated that overall, bacterial composition across the three cities is significantly different. A similar conclusion was reached using a novel bootstrap based test using diversity indices. Last but not least, a co-abundance association network analyses for the taxonomic levels “order”, “family”, and “genus” found different patterns of bacterial networks for the three cities. CONCLUSIONS: Bacterial fingerprint can be useful to predict sample provenance. In this work prediction of provenance reported with over 95% accuracy. Association based network analysis, emphasized similarities between the closest cities sharing common bacterial composition. ANOVA showed different patterns of bacterial amongst cities, and these findings strongly suggest that bacterial signature across multiple cities are different. This work advocates a data analysis pipeline which could be followed in order to get biological insight from this data. However, the biological conclusions from this analysis is just an early indication out of a pilot microbiome data provided to us through CAMDA 2017 challenge and will be subject to change as we get more complete data sets in the near future. This microbiome data can have potential applications in forensics, ecology, and other sciences. REVIEWERS: This article was reviewed by Klas Udekwu, Alexandra Graf, and Rafal Mostowy. BioMed Central 2018-05-22 /pmc/articles/PMC5964687/ /pubmed/29789016 http://dx.doi.org/10.1186/s13062-018-0215-8 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Walker, Alejandro R.
Grimes, Tyler L.
Datta, Somnath
Datta, Susmita
Unraveling bacterial fingerprints of city subways from microbiome 16S gene profiles
title Unraveling bacterial fingerprints of city subways from microbiome 16S gene profiles
title_full Unraveling bacterial fingerprints of city subways from microbiome 16S gene profiles
title_fullStr Unraveling bacterial fingerprints of city subways from microbiome 16S gene profiles
title_full_unstemmed Unraveling bacterial fingerprints of city subways from microbiome 16S gene profiles
title_short Unraveling bacterial fingerprints of city subways from microbiome 16S gene profiles
title_sort unraveling bacterial fingerprints of city subways from microbiome 16s gene profiles
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5964687/
https://www.ncbi.nlm.nih.gov/pubmed/29789016
http://dx.doi.org/10.1186/s13062-018-0215-8
work_keys_str_mv AT walkeralejandror unravelingbacterialfingerprintsofcitysubwaysfrommicrobiome16sgeneprofiles
AT grimestylerl unravelingbacterialfingerprintsofcitysubwaysfrommicrobiome16sgeneprofiles
AT dattasomnath unravelingbacterialfingerprintsofcitysubwaysfrommicrobiome16sgeneprofiles
AT dattasusmita unravelingbacterialfingerprintsofcitysubwaysfrommicrobiome16sgeneprofiles