Cargando…

Compression-based distance (CBD): a simple, rapid, and accurate method for microbiota composition comparison

BACKGROUND: Perturbations in intestinal microbiota composition have been associated with a variety of gastrointestinal tract-related diseases. The alleviation of symptoms has been achieved using treatments that alter the gastrointestinal tract microbiota toward that of healthy individuals. Identifyi...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Fang, Chia, Nicholas, White, Bryan A, Schook, Lawrence B
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3660234/
https://www.ncbi.nlm.nih.gov/pubmed/23617892
http://dx.doi.org/10.1186/1471-2105-14-136
_version_ 1782270522955923456
author Yang, Fang
Chia, Nicholas
White, Bryan A
Schook, Lawrence B
author_facet Yang, Fang
Chia, Nicholas
White, Bryan A
Schook, Lawrence B
author_sort Yang, Fang
collection PubMed
description BACKGROUND: Perturbations in intestinal microbiota composition have been associated with a variety of gastrointestinal tract-related diseases. The alleviation of symptoms has been achieved using treatments that alter the gastrointestinal tract microbiota toward that of healthy individuals. Identifying differences in microbiota composition through the use of 16S rRNA gene hypervariable tag sequencing has profound health implications. Current computational methods for comparing microbial communities are usually based on multiple alignments and phylogenetic inference, making them time consuming and requiring exceptional expertise and computational resources. As sequencing data rapidly grows in size, simpler analysis methods are needed to meet the growing computational burdens of microbiota comparisons. Thus, we have developed a simple, rapid, and accurate method, independent of multiple alignments and phylogenetic inference, to support microbiota comparisons. RESULTS: We create a metric, called compression-based distance (CBD) for quantifying the degree of similarity between microbial communities. CBD uses the repetitive nature of hypervariable tag datasets and well-established compression algorithms to approximate the total information shared between two datasets. Three published microbiota datasets were used as test cases for CBD as an applicable tool. Our study revealed that CBD recaptured 100% of the statistically significant conclusions reported in the previous studies, while achieving a decrease in computational time required when compared to similar tools without expert user intervention. CONCLUSION: CBD provides a simple, rapid, and accurate method for assessing distances between gastrointestinal tract microbiota 16S hypervariable tag datasets.
format Online
Article
Text
id pubmed-3660234
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36602342013-05-23 Compression-based distance (CBD): a simple, rapid, and accurate method for microbiota composition comparison Yang, Fang Chia, Nicholas White, Bryan A Schook, Lawrence B BMC Bioinformatics Research Article BACKGROUND: Perturbations in intestinal microbiota composition have been associated with a variety of gastrointestinal tract-related diseases. The alleviation of symptoms has been achieved using treatments that alter the gastrointestinal tract microbiota toward that of healthy individuals. Identifying differences in microbiota composition through the use of 16S rRNA gene hypervariable tag sequencing has profound health implications. Current computational methods for comparing microbial communities are usually based on multiple alignments and phylogenetic inference, making them time consuming and requiring exceptional expertise and computational resources. As sequencing data rapidly grows in size, simpler analysis methods are needed to meet the growing computational burdens of microbiota comparisons. Thus, we have developed a simple, rapid, and accurate method, independent of multiple alignments and phylogenetic inference, to support microbiota comparisons. RESULTS: We create a metric, called compression-based distance (CBD) for quantifying the degree of similarity between microbial communities. CBD uses the repetitive nature of hypervariable tag datasets and well-established compression algorithms to approximate the total information shared between two datasets. Three published microbiota datasets were used as test cases for CBD as an applicable tool. Our study revealed that CBD recaptured 100% of the statistically significant conclusions reported in the previous studies, while achieving a decrease in computational time required when compared to similar tools without expert user intervention. CONCLUSION: CBD provides a simple, rapid, and accurate method for assessing distances between gastrointestinal tract microbiota 16S hypervariable tag datasets. BioMed Central 2013-04-23 /pmc/articles/PMC3660234/ /pubmed/23617892 http://dx.doi.org/10.1186/1471-2105-14-136 Text en Copyright © 2013 Yang et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Yang, Fang
Chia, Nicholas
White, Bryan A
Schook, Lawrence B
Compression-based distance (CBD): a simple, rapid, and accurate method for microbiota composition comparison
title Compression-based distance (CBD): a simple, rapid, and accurate method for microbiota composition comparison
title_full Compression-based distance (CBD): a simple, rapid, and accurate method for microbiota composition comparison
title_fullStr Compression-based distance (CBD): a simple, rapid, and accurate method for microbiota composition comparison
title_full_unstemmed Compression-based distance (CBD): a simple, rapid, and accurate method for microbiota composition comparison
title_short Compression-based distance (CBD): a simple, rapid, and accurate method for microbiota composition comparison
title_sort compression-based distance (cbd): a simple, rapid, and accurate method for microbiota composition comparison
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3660234/
https://www.ncbi.nlm.nih.gov/pubmed/23617892
http://dx.doi.org/10.1186/1471-2105-14-136
work_keys_str_mv AT yangfang compressionbaseddistancecbdasimplerapidandaccuratemethodformicrobiotacompositioncomparison
AT chianicholas compressionbaseddistancecbdasimplerapidandaccuratemethodformicrobiotacompositioncomparison
AT whitebryana compressionbaseddistancecbdasimplerapidandaccuratemethodformicrobiotacompositioncomparison
AT schooklawrenceb compressionbaseddistancecbdasimplerapidandaccuratemethodformicrobiotacompositioncomparison