Cargando…

MAPseq: highly efficient k-mer search with confidence estimates, for rRNA sequence analysis

MOTIVATION: Ribosomal RNA profiling has become crucial to studying microbial communities, but meaningful taxonomic analysis and inter-comparison of such data are still hampered by technical limitations, between-study design variability and inconsistencies between taxonomies used. RESULTS: Here we pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Matias Rodrigues, João F, Schmidt, Thomas S B, Tackmann, Janko, von Mering, Christian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860325/
https://www.ncbi.nlm.nih.gov/pubmed/28961926
http://dx.doi.org/10.1093/bioinformatics/btx517
_version_ 1783307966869405696
author Matias Rodrigues, João F
Schmidt, Thomas S B
Tackmann, Janko
von Mering, Christian
author_facet Matias Rodrigues, João F
Schmidt, Thomas S B
Tackmann, Janko
von Mering, Christian
author_sort Matias Rodrigues, João F
collection PubMed
description MOTIVATION: Ribosomal RNA profiling has become crucial to studying microbial communities, but meaningful taxonomic analysis and inter-comparison of such data are still hampered by technical limitations, between-study design variability and inconsistencies between taxonomies used. RESULTS: Here we present MAPseq, a framework for reference-based rRNA sequence analysis that is up to 30% more accurate (F(½) score) and up to one hundred times faster than existing solutions, providing in a single run multiple taxonomy classifications and hierarchical operational taxonomic unit mappings, for rRNA sequences in both amplicon and shotgun sequencing strategies, and for datasets of virtually any size. AVAILABILITY AND IMPLEMENTATION: Source code and binaries are freely available at https://github.com/jfmrod/mapseq SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-5860325
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58603252018-03-21 MAPseq: highly efficient k-mer search with confidence estimates, for rRNA sequence analysis Matias Rodrigues, João F Schmidt, Thomas S B Tackmann, Janko von Mering, Christian Bioinformatics Applications Notes MOTIVATION: Ribosomal RNA profiling has become crucial to studying microbial communities, but meaningful taxonomic analysis and inter-comparison of such data are still hampered by technical limitations, between-study design variability and inconsistencies between taxonomies used. RESULTS: Here we present MAPseq, a framework for reference-based rRNA sequence analysis that is up to 30% more accurate (F(½) score) and up to one hundred times faster than existing solutions, providing in a single run multiple taxonomy classifications and hierarchical operational taxonomic unit mappings, for rRNA sequences in both amplicon and shotgun sequencing strategies, and for datasets of virtually any size. AVAILABILITY AND IMPLEMENTATION: Source code and binaries are freely available at https://github.com/jfmrod/mapseq SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2017-12-01 2017-08-14 /pmc/articles/PMC5860325/ /pubmed/28961926 http://dx.doi.org/10.1093/bioinformatics/btx517 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Notes
Matias Rodrigues, João F
Schmidt, Thomas S B
Tackmann, Janko
von Mering, Christian
MAPseq: highly efficient k-mer search with confidence estimates, for rRNA sequence analysis
title MAPseq: highly efficient k-mer search with confidence estimates, for rRNA sequence analysis
title_full MAPseq: highly efficient k-mer search with confidence estimates, for rRNA sequence analysis
title_fullStr MAPseq: highly efficient k-mer search with confidence estimates, for rRNA sequence analysis
title_full_unstemmed MAPseq: highly efficient k-mer search with confidence estimates, for rRNA sequence analysis
title_short MAPseq: highly efficient k-mer search with confidence estimates, for rRNA sequence analysis
title_sort mapseq: highly efficient k-mer search with confidence estimates, for rrna sequence analysis
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860325/
https://www.ncbi.nlm.nih.gov/pubmed/28961926
http://dx.doi.org/10.1093/bioinformatics/btx517
work_keys_str_mv AT matiasrodriguesjoaof mapseqhighlyefficientkmersearchwithconfidenceestimatesforrrnasequenceanalysis
AT schmidtthomassb mapseqhighlyefficientkmersearchwithconfidenceestimatesforrrnasequenceanalysis
AT tackmannjanko mapseqhighlyefficientkmersearchwithconfidenceestimatesforrrnasequenceanalysis
AT vonmeringchristian mapseqhighlyefficientkmersearchwithconfidenceestimatesforrrnasequenceanalysis