Cargando…

DISCo-microbe: design of an identifiable synthetic community of microbes

BACKGROUND: Microbiomes are extremely important for their host organisms, providing many vital functions and extending their hosts’ phenotypes. Natural studies of host-associated microbiomes can be difficult to interpret due to the high complexity of microbial communities, which hinders our ability...

Descripción completa

Detalles Bibliográficos
Autores principales: Carper, Dana L., Lawrence, Travis J., Carrell, Alyssa A., Pelletier, Dale A., Weston, David J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7049465/
https://www.ncbi.nlm.nih.gov/pubmed/32149021
http://dx.doi.org/10.7717/peerj.8534
_version_ 1783502446853619712
author Carper, Dana L.
Lawrence, Travis J.
Carrell, Alyssa A.
Pelletier, Dale A.
Weston, David J.
author_facet Carper, Dana L.
Lawrence, Travis J.
Carrell, Alyssa A.
Pelletier, Dale A.
Weston, David J.
author_sort Carper, Dana L.
collection PubMed
description BACKGROUND: Microbiomes are extremely important for their host organisms, providing many vital functions and extending their hosts’ phenotypes. Natural studies of host-associated microbiomes can be difficult to interpret due to the high complexity of microbial communities, which hinders our ability to track and identify individual members along with the many factors that structure or perturb those communities. For this reason, researchers have turned to synthetic or constructed communities in which the identities of all members are known. However, due to the lack of tracking methods and the difficulty of creating a more diverse and identifiable community that can be distinguished through next-generation sequencing, most such in vivo studies have used only a few strains. RESULTS: To address this issue, we developed DISCo-microbe, a program for the design of an identifiable synthetic community of microbes for use in in vivo experimentation. The program is composed of two modules; (1) create, which allows the user to generate a highly diverse community list from an input DNA sequence alignment using a custom nucleotide distance algorithm, and (2) subsample, which subsamples the community list to either represent a number of grouping variables, including taxonomic proportions, or to reach a user-specified maximum number of community members. As an example, we demonstrate the generation of a synthetic microbial community that can be distinguished through amplicon sequencing. The synthetic microbial community in this example consisted of 2,122 members from a starting DNA sequence alignment of 10,000 16S rRNA sequences from the Ribosomal Database Project. We generated simulated Illumina sequencing data from the constructed community and demonstrate that DISCo-microbe is capable of designing diverse communities with members distinguishable by amplicon sequencing. Using the simulated data we were able to recover sequences from between 97–100% of community members using two different post-processing workflows. Furthermore, 97–99% of sequences were assigned to a community member with zero sequences being misidentified. We then subsampled the community list using taxonomic proportions to mimic a natural plant host–associated microbiome, ultimately yielding a diverse community of 784 members. CONCLUSIONS: DISCo-microbe can create a highly diverse community list of microbes that can be distinguished through 16S rRNA gene sequencing, and has the ability to subsample (i.e., design) the community for the desired number of members and taxonomic proportions. Although developed for bacteria, the program allows for any alignment input from any taxonomic group, making it broadly applicable. The software and data are freely available from GitHub (https://github.com/dlcarper/DISCo-microbe) and Python Package Index (PYPI).
format Online
Article
Text
id pubmed-7049465
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-70494652020-03-06 DISCo-microbe: design of an identifiable synthetic community of microbes Carper, Dana L. Lawrence, Travis J. Carrell, Alyssa A. Pelletier, Dale A. Weston, David J. PeerJ Bioinformatics BACKGROUND: Microbiomes are extremely important for their host organisms, providing many vital functions and extending their hosts’ phenotypes. Natural studies of host-associated microbiomes can be difficult to interpret due to the high complexity of microbial communities, which hinders our ability to track and identify individual members along with the many factors that structure or perturb those communities. For this reason, researchers have turned to synthetic or constructed communities in which the identities of all members are known. However, due to the lack of tracking methods and the difficulty of creating a more diverse and identifiable community that can be distinguished through next-generation sequencing, most such in vivo studies have used only a few strains. RESULTS: To address this issue, we developed DISCo-microbe, a program for the design of an identifiable synthetic community of microbes for use in in vivo experimentation. The program is composed of two modules; (1) create, which allows the user to generate a highly diverse community list from an input DNA sequence alignment using a custom nucleotide distance algorithm, and (2) subsample, which subsamples the community list to either represent a number of grouping variables, including taxonomic proportions, or to reach a user-specified maximum number of community members. As an example, we demonstrate the generation of a synthetic microbial community that can be distinguished through amplicon sequencing. The synthetic microbial community in this example consisted of 2,122 members from a starting DNA sequence alignment of 10,000 16S rRNA sequences from the Ribosomal Database Project. We generated simulated Illumina sequencing data from the constructed community and demonstrate that DISCo-microbe is capable of designing diverse communities with members distinguishable by amplicon sequencing. Using the simulated data we were able to recover sequences from between 97–100% of community members using two different post-processing workflows. Furthermore, 97–99% of sequences were assigned to a community member with zero sequences being misidentified. We then subsampled the community list using taxonomic proportions to mimic a natural plant host–associated microbiome, ultimately yielding a diverse community of 784 members. CONCLUSIONS: DISCo-microbe can create a highly diverse community list of microbes that can be distinguished through 16S rRNA gene sequencing, and has the ability to subsample (i.e., design) the community for the desired number of members and taxonomic proportions. Although developed for bacteria, the program allows for any alignment input from any taxonomic group, making it broadly applicable. The software and data are freely available from GitHub (https://github.com/dlcarper/DISCo-microbe) and Python Package Index (PYPI). PeerJ Inc. 2020-02-27 /pmc/articles/PMC7049465/ /pubmed/32149021 http://dx.doi.org/10.7717/peerj.8534 Text en ©2020 Carper et al. https://creativecommons.org/licenses/by-nc/4.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by-nc/4.0) , which permits using, remixing, and building upon the work non-commercially, as long as it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Carper, Dana L.
Lawrence, Travis J.
Carrell, Alyssa A.
Pelletier, Dale A.
Weston, David J.
DISCo-microbe: design of an identifiable synthetic community of microbes
title DISCo-microbe: design of an identifiable synthetic community of microbes
title_full DISCo-microbe: design of an identifiable synthetic community of microbes
title_fullStr DISCo-microbe: design of an identifiable synthetic community of microbes
title_full_unstemmed DISCo-microbe: design of an identifiable synthetic community of microbes
title_short DISCo-microbe: design of an identifiable synthetic community of microbes
title_sort disco-microbe: design of an identifiable synthetic community of microbes
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7049465/
https://www.ncbi.nlm.nih.gov/pubmed/32149021
http://dx.doi.org/10.7717/peerj.8534
work_keys_str_mv AT carperdanal discomicrobedesignofanidentifiablesyntheticcommunityofmicrobes
AT lawrencetravisj discomicrobedesignofanidentifiablesyntheticcommunityofmicrobes
AT carrellalyssaa discomicrobedesignofanidentifiablesyntheticcommunityofmicrobes
AT pelletierdalea discomicrobedesignofanidentifiablesyntheticcommunityofmicrobes
AT westondavidj discomicrobedesignofanidentifiablesyntheticcommunityofmicrobes