Cargando…

RIEMS: a software pipeline for sensitive and comprehensive taxonomic classification of reads from metagenomics datasets

BACKGROUND: Fuelled by the advent and subsequent development of next generation sequencing technologies, metagenomics became a powerful tool for the analysis of microbial communities both scientifically and diagnostically. The biggest challenge is the extraction of relevant information from the huge...

Descripción completa

Detalles Bibliográficos
Autores principales: Scheuch, Matthias, Höper, Dirk, Beer, Martin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4351923/
https://www.ncbi.nlm.nih.gov/pubmed/25886935
http://dx.doi.org/10.1186/s12859-015-0503-6
_version_ 1782360377964625920
author Scheuch, Matthias
Höper, Dirk
Beer, Martin
author_facet Scheuch, Matthias
Höper, Dirk
Beer, Martin
author_sort Scheuch, Matthias
collection PubMed
description BACKGROUND: Fuelled by the advent and subsequent development of next generation sequencing technologies, metagenomics became a powerful tool for the analysis of microbial communities both scientifically and diagnostically. The biggest challenge is the extraction of relevant information from the huge sequence datasets generated for metagenomics studies. Although a plethora of tools are available, data analysis is still a bottleneck. RESULTS: To overcome the bottleneck of data analysis, we developed an automated computational workflow called RIEMS – Reliable Information Extraction from Metagenomic Sequence datasets. RIEMS assigns every individual read sequence within a dataset taxonomically by cascading different sequence analyses with decreasing stringency of the assignments using various software applications. After completion of the analyses, the results are summarised in a clearly structured result protocol organised taxonomically. The high accuracy and performance of RIEMS analyses were proven in comparison with other tools for metagenomics data analysis using simulated sequencing read datasets. CONCLUSIONS: RIEMS has the potential to fill the gap that still exists with regard to data analysis for metagenomics studies. The usefulness and power of RIEMS for the analysis of genuine sequencing datasets was demonstrated with an early version of RIEMS in 2011 when it was used to detect the orthobunyavirus sequences leading to the discovery of Schmallenberg virus. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0503-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4351923
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43519232015-03-07 RIEMS: a software pipeline for sensitive and comprehensive taxonomic classification of reads from metagenomics datasets Scheuch, Matthias Höper, Dirk Beer, Martin BMC Bioinformatics Methodology Article BACKGROUND: Fuelled by the advent and subsequent development of next generation sequencing technologies, metagenomics became a powerful tool for the analysis of microbial communities both scientifically and diagnostically. The biggest challenge is the extraction of relevant information from the huge sequence datasets generated for metagenomics studies. Although a plethora of tools are available, data analysis is still a bottleneck. RESULTS: To overcome the bottleneck of data analysis, we developed an automated computational workflow called RIEMS – Reliable Information Extraction from Metagenomic Sequence datasets. RIEMS assigns every individual read sequence within a dataset taxonomically by cascading different sequence analyses with decreasing stringency of the assignments using various software applications. After completion of the analyses, the results are summarised in a clearly structured result protocol organised taxonomically. The high accuracy and performance of RIEMS analyses were proven in comparison with other tools for metagenomics data analysis using simulated sequencing read datasets. CONCLUSIONS: RIEMS has the potential to fill the gap that still exists with regard to data analysis for metagenomics studies. The usefulness and power of RIEMS for the analysis of genuine sequencing datasets was demonstrated with an early version of RIEMS in 2011 when it was used to detect the orthobunyavirus sequences leading to the discovery of Schmallenberg virus. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0503-6) contains supplementary material, which is available to authorized users. BioMed Central 2015-03-03 /pmc/articles/PMC4351923/ /pubmed/25886935 http://dx.doi.org/10.1186/s12859-015-0503-6 Text en © Scheuch et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Scheuch, Matthias
Höper, Dirk
Beer, Martin
RIEMS: a software pipeline for sensitive and comprehensive taxonomic classification of reads from metagenomics datasets
title RIEMS: a software pipeline for sensitive and comprehensive taxonomic classification of reads from metagenomics datasets
title_full RIEMS: a software pipeline for sensitive and comprehensive taxonomic classification of reads from metagenomics datasets
title_fullStr RIEMS: a software pipeline for sensitive and comprehensive taxonomic classification of reads from metagenomics datasets
title_full_unstemmed RIEMS: a software pipeline for sensitive and comprehensive taxonomic classification of reads from metagenomics datasets
title_short RIEMS: a software pipeline for sensitive and comprehensive taxonomic classification of reads from metagenomics datasets
title_sort riems: a software pipeline for sensitive and comprehensive taxonomic classification of reads from metagenomics datasets
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4351923/
https://www.ncbi.nlm.nih.gov/pubmed/25886935
http://dx.doi.org/10.1186/s12859-015-0503-6
work_keys_str_mv AT scheuchmatthias riemsasoftwarepipelineforsensitiveandcomprehensivetaxonomicclassificationofreadsfrommetagenomicsdatasets
AT hoperdirk riemsasoftwarepipelineforsensitiveandcomprehensivetaxonomicclassificationofreadsfrommetagenomicsdatasets
AT beermartin riemsasoftwarepipelineforsensitiveandcomprehensivetaxonomicclassificationofreadsfrommetagenomicsdatasets