Cargando…

Leveraging Existing 16S rRNA Gene Surveys To Identify Reproducible Biomarkers in Individuals with Colorectal Tumors

An increasing body of literature suggests that both individual and collections of bacteria are associated with the progression of colorectal cancer. As the number of studies investigating these associations increases and the number of subjects in each study increases, a meta-analysis to identify the...

Descripción completa

Detalles Bibliográficos
Autores principales: Sze, Marc A., Schloss, Patrick D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5989068/
https://www.ncbi.nlm.nih.gov/pubmed/29871916
http://dx.doi.org/10.1128/mBio.00630-18
_version_ 1783329388772720640
author Sze, Marc A.
Schloss, Patrick D.
author_facet Sze, Marc A.
Schloss, Patrick D.
author_sort Sze, Marc A.
collection PubMed
description An increasing body of literature suggests that both individual and collections of bacteria are associated with the progression of colorectal cancer. As the number of studies investigating these associations increases and the number of subjects in each study increases, a meta-analysis to identify the associations that are the most predictive of disease progression is warranted. We analyzed previously published 16S rRNA gene sequencing data collected from feces and colon tissue. We quantified the odds ratios (ORs) for individual bacterial taxa that were associated with an individual having tumors relative to a normal colon. Among the fecal samples, there were no taxa that had significant ORs associated with adenoma and there were 8 taxa with significant ORs associated with carcinoma. Similarly, among the tissue samples, there were no taxa that had a significant OR associated with adenoma and there were 3 taxa with significant ORs associated with carcinoma. Among the significant ORs, the association between individual taxa and tumor diagnosis was equal to or below 7.11. Because individual taxa had limited association with tumor diagnosis, we trained Random Forest classification models using only the taxa that had significant ORs, using the entire collection of taxa found in each study, and using operational taxonomic units defined based on a 97% similarity threshold. All training approaches yielded similar classification success as measured using the area under the curve. The ability to correctly classify individuals with adenomas was poor, and the ability to classify individuals with carcinomas was considerably better using sequences from feces or tissue.
format Online
Article
Text
id pubmed-5989068
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-59890682018-06-12 Leveraging Existing 16S rRNA Gene Surveys To Identify Reproducible Biomarkers in Individuals with Colorectal Tumors Sze, Marc A. Schloss, Patrick D. mBio Research Article An increasing body of literature suggests that both individual and collections of bacteria are associated with the progression of colorectal cancer. As the number of studies investigating these associations increases and the number of subjects in each study increases, a meta-analysis to identify the associations that are the most predictive of disease progression is warranted. We analyzed previously published 16S rRNA gene sequencing data collected from feces and colon tissue. We quantified the odds ratios (ORs) for individual bacterial taxa that were associated with an individual having tumors relative to a normal colon. Among the fecal samples, there were no taxa that had significant ORs associated with adenoma and there were 8 taxa with significant ORs associated with carcinoma. Similarly, among the tissue samples, there were no taxa that had a significant OR associated with adenoma and there were 3 taxa with significant ORs associated with carcinoma. Among the significant ORs, the association between individual taxa and tumor diagnosis was equal to or below 7.11. Because individual taxa had limited association with tumor diagnosis, we trained Random Forest classification models using only the taxa that had significant ORs, using the entire collection of taxa found in each study, and using operational taxonomic units defined based on a 97% similarity threshold. All training approaches yielded similar classification success as measured using the area under the curve. The ability to correctly classify individuals with adenomas was poor, and the ability to classify individuals with carcinomas was considerably better using sequences from feces or tissue. American Society for Microbiology 2018-06-05 /pmc/articles/PMC5989068/ /pubmed/29871916 http://dx.doi.org/10.1128/mBio.00630-18 Text en Copyright © 2018 Sze and Schloss. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research Article
Sze, Marc A.
Schloss, Patrick D.
Leveraging Existing 16S rRNA Gene Surveys To Identify Reproducible Biomarkers in Individuals with Colorectal Tumors
title Leveraging Existing 16S rRNA Gene Surveys To Identify Reproducible Biomarkers in Individuals with Colorectal Tumors
title_full Leveraging Existing 16S rRNA Gene Surveys To Identify Reproducible Biomarkers in Individuals with Colorectal Tumors
title_fullStr Leveraging Existing 16S rRNA Gene Surveys To Identify Reproducible Biomarkers in Individuals with Colorectal Tumors
title_full_unstemmed Leveraging Existing 16S rRNA Gene Surveys To Identify Reproducible Biomarkers in Individuals with Colorectal Tumors
title_short Leveraging Existing 16S rRNA Gene Surveys To Identify Reproducible Biomarkers in Individuals with Colorectal Tumors
title_sort leveraging existing 16s rrna gene surveys to identify reproducible biomarkers in individuals with colorectal tumors
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5989068/
https://www.ncbi.nlm.nih.gov/pubmed/29871916
http://dx.doi.org/10.1128/mBio.00630-18
work_keys_str_mv AT szemarca leveragingexisting16srrnagenesurveystoidentifyreproduciblebiomarkersinindividualswithcolorectaltumors
AT schlosspatrickd leveragingexisting16srrnagenesurveystoidentifyreproduciblebiomarkersinindividualswithcolorectaltumors