Cargando…

Computational Methods for Strain-Level Microbial Detection in Colony and Metagenome Sequencing Data

Metagenomic sequencing is a powerful tool for examining the diversity and complexity of microbial communities. Most widely used tools for taxonomic profiling of metagenomic sequence data allow for a species-level overview of the composition. However, individual strains within a species can differ gr...

Descripción completa

Detalles Bibliográficos
Autores principales: Anyansi, Christine, Straub, Timothy J., Manson, Abigail L., Earl, Ashlee M., Abeel, Thomas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7507117/
https://www.ncbi.nlm.nih.gov/pubmed/33013732
http://dx.doi.org/10.3389/fmicb.2020.01925
_version_ 1783585165689225216
author Anyansi, Christine
Straub, Timothy J.
Manson, Abigail L.
Earl, Ashlee M.
Abeel, Thomas
author_facet Anyansi, Christine
Straub, Timothy J.
Manson, Abigail L.
Earl, Ashlee M.
Abeel, Thomas
author_sort Anyansi, Christine
collection PubMed
description Metagenomic sequencing is a powerful tool for examining the diversity and complexity of microbial communities. Most widely used tools for taxonomic profiling of metagenomic sequence data allow for a species-level overview of the composition. However, individual strains within a species can differ greatly in key genotypic and phenotypic characteristics, such as drug resistance, virulence and growth rate. Therefore, the ability to resolve microbial communities down to the level of individual strains within a species is critical to interpreting metagenomic data for clinical and environmental applications, where identifying a particular strain, or tracking a particular strain across a set of samples, can help aid in clinical diagnosis and treatment, or in characterizing yet unstudied strains across novel environmental locations. Recently published approaches have begun to tackle the problem of resolving strains within a particular species in metagenomic samples. In this review, we present an overview of these new algorithms and their uses, including methods based on assembly reconstruction and methods operating with or without a reference database. While existing metagenomic analysis methods show reasonable performance at the species and higher taxonomic levels, identifying closely related strains within a species presents a bigger challenge, due to the diversity of databases, genetic relatedness, and goals when conducting these analyses. Selection of which metagenomic tool to employ for a specific application should be performed on a case-by case basis as these tools have strengths and weaknesses that affect their performance on specific tasks. A comprehensive benchmark across different use case scenarios is vital to validate performance of these tools on microbial samples. Because strain-level metagenomic analysis is still in its infancy, development of more fine-grained, high-resolution algorithms will continue to be in demand for the future.
format Online
Article
Text
id pubmed-7507117
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-75071172020-10-02 Computational Methods for Strain-Level Microbial Detection in Colony and Metagenome Sequencing Data Anyansi, Christine Straub, Timothy J. Manson, Abigail L. Earl, Ashlee M. Abeel, Thomas Front Microbiol Microbiology Metagenomic sequencing is a powerful tool for examining the diversity and complexity of microbial communities. Most widely used tools for taxonomic profiling of metagenomic sequence data allow for a species-level overview of the composition. However, individual strains within a species can differ greatly in key genotypic and phenotypic characteristics, such as drug resistance, virulence and growth rate. Therefore, the ability to resolve microbial communities down to the level of individual strains within a species is critical to interpreting metagenomic data for clinical and environmental applications, where identifying a particular strain, or tracking a particular strain across a set of samples, can help aid in clinical diagnosis and treatment, or in characterizing yet unstudied strains across novel environmental locations. Recently published approaches have begun to tackle the problem of resolving strains within a particular species in metagenomic samples. In this review, we present an overview of these new algorithms and their uses, including methods based on assembly reconstruction and methods operating with or without a reference database. While existing metagenomic analysis methods show reasonable performance at the species and higher taxonomic levels, identifying closely related strains within a species presents a bigger challenge, due to the diversity of databases, genetic relatedness, and goals when conducting these analyses. Selection of which metagenomic tool to employ for a specific application should be performed on a case-by case basis as these tools have strengths and weaknesses that affect their performance on specific tasks. A comprehensive benchmark across different use case scenarios is vital to validate performance of these tools on microbial samples. Because strain-level metagenomic analysis is still in its infancy, development of more fine-grained, high-resolution algorithms will continue to be in demand for the future. Frontiers Media S.A. 2020-08-18 /pmc/articles/PMC7507117/ /pubmed/33013732 http://dx.doi.org/10.3389/fmicb.2020.01925 Text en Copyright © 2020 Anyansi, Straub, Manson, Earl and Abeel. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Anyansi, Christine
Straub, Timothy J.
Manson, Abigail L.
Earl, Ashlee M.
Abeel, Thomas
Computational Methods for Strain-Level Microbial Detection in Colony and Metagenome Sequencing Data
title Computational Methods for Strain-Level Microbial Detection in Colony and Metagenome Sequencing Data
title_full Computational Methods for Strain-Level Microbial Detection in Colony and Metagenome Sequencing Data
title_fullStr Computational Methods for Strain-Level Microbial Detection in Colony and Metagenome Sequencing Data
title_full_unstemmed Computational Methods for Strain-Level Microbial Detection in Colony and Metagenome Sequencing Data
title_short Computational Methods for Strain-Level Microbial Detection in Colony and Metagenome Sequencing Data
title_sort computational methods for strain-level microbial detection in colony and metagenome sequencing data
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7507117/
https://www.ncbi.nlm.nih.gov/pubmed/33013732
http://dx.doi.org/10.3389/fmicb.2020.01925
work_keys_str_mv AT anyansichristine computationalmethodsforstrainlevelmicrobialdetectionincolonyandmetagenomesequencingdata
AT straubtimothyj computationalmethodsforstrainlevelmicrobialdetectionincolonyandmetagenomesequencingdata
AT mansonabigaill computationalmethodsforstrainlevelmicrobialdetectionincolonyandmetagenomesequencingdata
AT earlashleem computationalmethodsforstrainlevelmicrobialdetectionincolonyandmetagenomesequencingdata
AT abeelthomas computationalmethodsforstrainlevelmicrobialdetectionincolonyandmetagenomesequencingdata