Cargando…

Bioinformatics strategies for taxonomy independent binning and visualization of sequences in shotgun metagenomics

One of main steps in a study of microbial communities is resolving their composition, diversity and function. In the past, these issues were mostly addressed by the use of amplicon sequencing of a target gene because of reasonable price and easier computational postprocessing of the bioinformatic da...

Descripción completa

Detalles Bibliográficos
Autores principales: Sedlar, Karel, Kupkova, Kristyna, Provaznik, Ivo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5148923/
https://www.ncbi.nlm.nih.gov/pubmed/27980708
http://dx.doi.org/10.1016/j.csbj.2016.11.005
_version_ 1782473911113351168
author Sedlar, Karel
Kupkova, Kristyna
Provaznik, Ivo
author_facet Sedlar, Karel
Kupkova, Kristyna
Provaznik, Ivo
author_sort Sedlar, Karel
collection PubMed
description One of main steps in a study of microbial communities is resolving their composition, diversity and function. In the past, these issues were mostly addressed by the use of amplicon sequencing of a target gene because of reasonable price and easier computational postprocessing of the bioinformatic data. With the advancement of sequencing techniques, the main focus shifted to the whole metagenome shotgun sequencing, which allows much more detailed analysis of the metagenomic data, including reconstruction of novel microbial genomes and to gain knowledge about genetic potential and metabolic capacities of whole environments. On the other hand, the output of whole metagenomic shotgun sequencing is mixture of short DNA fragments belonging to various genomes, therefore this approach requires more sophisticated computational algorithms for clustering of related sequences, commonly referred to as sequence binning. There are currently two types of binning methods: taxonomy dependent and taxonomy independent. The first type classifies the DNA fragments by performing a standard homology inference against a reference database, while the latter performs the reference-free binning by applying clustering techniques on features extracted from the sequences. In this review, we describe the strategies within the second approach. Although these strategies do not require prior knowledge, they have higher demands on the length of sequences. Besides their basic principle, an overview of particular methods and tools is provided. Furthermore, the review covers the utilization of the methods in context with the length of sequences and discusses the needs for metagenomic data preprocessing in form of initial assembly prior to binning.
format Online
Article
Text
id pubmed-5148923
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-51489232016-12-15 Bioinformatics strategies for taxonomy independent binning and visualization of sequences in shotgun metagenomics Sedlar, Karel Kupkova, Kristyna Provaznik, Ivo Comput Struct Biotechnol J Mini Review One of main steps in a study of microbial communities is resolving their composition, diversity and function. In the past, these issues were mostly addressed by the use of amplicon sequencing of a target gene because of reasonable price and easier computational postprocessing of the bioinformatic data. With the advancement of sequencing techniques, the main focus shifted to the whole metagenome shotgun sequencing, which allows much more detailed analysis of the metagenomic data, including reconstruction of novel microbial genomes and to gain knowledge about genetic potential and metabolic capacities of whole environments. On the other hand, the output of whole metagenomic shotgun sequencing is mixture of short DNA fragments belonging to various genomes, therefore this approach requires more sophisticated computational algorithms for clustering of related sequences, commonly referred to as sequence binning. There are currently two types of binning methods: taxonomy dependent and taxonomy independent. The first type classifies the DNA fragments by performing a standard homology inference against a reference database, while the latter performs the reference-free binning by applying clustering techniques on features extracted from the sequences. In this review, we describe the strategies within the second approach. Although these strategies do not require prior knowledge, they have higher demands on the length of sequences. Besides their basic principle, an overview of particular methods and tools is provided. Furthermore, the review covers the utilization of the methods in context with the length of sequences and discusses the needs for metagenomic data preprocessing in form of initial assembly prior to binning. Research Network of Computational and Structural Biotechnology 2016-12-05 /pmc/articles/PMC5148923/ /pubmed/27980708 http://dx.doi.org/10.1016/j.csbj.2016.11.005 Text en © 2016 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Mini Review
Sedlar, Karel
Kupkova, Kristyna
Provaznik, Ivo
Bioinformatics strategies for taxonomy independent binning and visualization of sequences in shotgun metagenomics
title Bioinformatics strategies for taxonomy independent binning and visualization of sequences in shotgun metagenomics
title_full Bioinformatics strategies for taxonomy independent binning and visualization of sequences in shotgun metagenomics
title_fullStr Bioinformatics strategies for taxonomy independent binning and visualization of sequences in shotgun metagenomics
title_full_unstemmed Bioinformatics strategies for taxonomy independent binning and visualization of sequences in shotgun metagenomics
title_short Bioinformatics strategies for taxonomy independent binning and visualization of sequences in shotgun metagenomics
title_sort bioinformatics strategies for taxonomy independent binning and visualization of sequences in shotgun metagenomics
topic Mini Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5148923/
https://www.ncbi.nlm.nih.gov/pubmed/27980708
http://dx.doi.org/10.1016/j.csbj.2016.11.005
work_keys_str_mv AT sedlarkarel bioinformaticsstrategiesfortaxonomyindependentbinningandvisualizationofsequencesinshotgunmetagenomics
AT kupkovakristyna bioinformaticsstrategiesfortaxonomyindependentbinningandvisualizationofsequencesinshotgunmetagenomics
AT provaznikivo bioinformaticsstrategiesfortaxonomyindependentbinningandvisualizationofsequencesinshotgunmetagenomics