Cargando…
Compression-based distance (CBD): a simple, rapid, and accurate method for microbiota composition comparison
BACKGROUND: Perturbations in intestinal microbiota composition have been associated with a variety of gastrointestinal tract-related diseases. The alleviation of symptoms has been achieved using treatments that alter the gastrointestinal tract microbiota toward that of healthy individuals. Identifyi...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3660234/ https://www.ncbi.nlm.nih.gov/pubmed/23617892 http://dx.doi.org/10.1186/1471-2105-14-136 |
_version_ | 1782270522955923456 |
---|---|
author | Yang, Fang Chia, Nicholas White, Bryan A Schook, Lawrence B |
author_facet | Yang, Fang Chia, Nicholas White, Bryan A Schook, Lawrence B |
author_sort | Yang, Fang |
collection | PubMed |
description | BACKGROUND: Perturbations in intestinal microbiota composition have been associated with a variety of gastrointestinal tract-related diseases. The alleviation of symptoms has been achieved using treatments that alter the gastrointestinal tract microbiota toward that of healthy individuals. Identifying differences in microbiota composition through the use of 16S rRNA gene hypervariable tag sequencing has profound health implications. Current computational methods for comparing microbial communities are usually based on multiple alignments and phylogenetic inference, making them time consuming and requiring exceptional expertise and computational resources. As sequencing data rapidly grows in size, simpler analysis methods are needed to meet the growing computational burdens of microbiota comparisons. Thus, we have developed a simple, rapid, and accurate method, independent of multiple alignments and phylogenetic inference, to support microbiota comparisons. RESULTS: We create a metric, called compression-based distance (CBD) for quantifying the degree of similarity between microbial communities. CBD uses the repetitive nature of hypervariable tag datasets and well-established compression algorithms to approximate the total information shared between two datasets. Three published microbiota datasets were used as test cases for CBD as an applicable tool. Our study revealed that CBD recaptured 100% of the statistically significant conclusions reported in the previous studies, while achieving a decrease in computational time required when compared to similar tools without expert user intervention. CONCLUSION: CBD provides a simple, rapid, and accurate method for assessing distances between gastrointestinal tract microbiota 16S hypervariable tag datasets. |
format | Online Article Text |
id | pubmed-3660234 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-36602342013-05-23 Compression-based distance (CBD): a simple, rapid, and accurate method for microbiota composition comparison Yang, Fang Chia, Nicholas White, Bryan A Schook, Lawrence B BMC Bioinformatics Research Article BACKGROUND: Perturbations in intestinal microbiota composition have been associated with a variety of gastrointestinal tract-related diseases. The alleviation of symptoms has been achieved using treatments that alter the gastrointestinal tract microbiota toward that of healthy individuals. Identifying differences in microbiota composition through the use of 16S rRNA gene hypervariable tag sequencing has profound health implications. Current computational methods for comparing microbial communities are usually based on multiple alignments and phylogenetic inference, making them time consuming and requiring exceptional expertise and computational resources. As sequencing data rapidly grows in size, simpler analysis methods are needed to meet the growing computational burdens of microbiota comparisons. Thus, we have developed a simple, rapid, and accurate method, independent of multiple alignments and phylogenetic inference, to support microbiota comparisons. RESULTS: We create a metric, called compression-based distance (CBD) for quantifying the degree of similarity between microbial communities. CBD uses the repetitive nature of hypervariable tag datasets and well-established compression algorithms to approximate the total information shared between two datasets. Three published microbiota datasets were used as test cases for CBD as an applicable tool. Our study revealed that CBD recaptured 100% of the statistically significant conclusions reported in the previous studies, while achieving a decrease in computational time required when compared to similar tools without expert user intervention. CONCLUSION: CBD provides a simple, rapid, and accurate method for assessing distances between gastrointestinal tract microbiota 16S hypervariable tag datasets. BioMed Central 2013-04-23 /pmc/articles/PMC3660234/ /pubmed/23617892 http://dx.doi.org/10.1186/1471-2105-14-136 Text en Copyright © 2013 Yang et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Yang, Fang Chia, Nicholas White, Bryan A Schook, Lawrence B Compression-based distance (CBD): a simple, rapid, and accurate method for microbiota composition comparison |
title | Compression-based distance (CBD): a simple, rapid, and accurate method for microbiota composition comparison |
title_full | Compression-based distance (CBD): a simple, rapid, and accurate method for microbiota composition comparison |
title_fullStr | Compression-based distance (CBD): a simple, rapid, and accurate method for microbiota composition comparison |
title_full_unstemmed | Compression-based distance (CBD): a simple, rapid, and accurate method for microbiota composition comparison |
title_short | Compression-based distance (CBD): a simple, rapid, and accurate method for microbiota composition comparison |
title_sort | compression-based distance (cbd): a simple, rapid, and accurate method for microbiota composition comparison |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3660234/ https://www.ncbi.nlm.nih.gov/pubmed/23617892 http://dx.doi.org/10.1186/1471-2105-14-136 |
work_keys_str_mv | AT yangfang compressionbaseddistancecbdasimplerapidandaccuratemethodformicrobiotacompositioncomparison AT chianicholas compressionbaseddistancecbdasimplerapidandaccuratemethodformicrobiotacompositioncomparison AT whitebryana compressionbaseddistancecbdasimplerapidandaccuratemethodformicrobiotacompositioncomparison AT schooklawrenceb compressionbaseddistancecbdasimplerapidandaccuratemethodformicrobiotacompositioncomparison |