Cargando…
MAPseq: highly efficient k-mer search with confidence estimates, for rRNA sequence analysis
MOTIVATION: Ribosomal RNA profiling has become crucial to studying microbial communities, but meaningful taxonomic analysis and inter-comparison of such data are still hampered by technical limitations, between-study design variability and inconsistencies between taxonomies used. RESULTS: Here we pr...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860325/ https://www.ncbi.nlm.nih.gov/pubmed/28961926 http://dx.doi.org/10.1093/bioinformatics/btx517 |
_version_ | 1783307966869405696 |
---|---|
author | Matias Rodrigues, João F Schmidt, Thomas S B Tackmann, Janko von Mering, Christian |
author_facet | Matias Rodrigues, João F Schmidt, Thomas S B Tackmann, Janko von Mering, Christian |
author_sort | Matias Rodrigues, João F |
collection | PubMed |
description | MOTIVATION: Ribosomal RNA profiling has become crucial to studying microbial communities, but meaningful taxonomic analysis and inter-comparison of such data are still hampered by technical limitations, between-study design variability and inconsistencies between taxonomies used. RESULTS: Here we present MAPseq, a framework for reference-based rRNA sequence analysis that is up to 30% more accurate (F(½) score) and up to one hundred times faster than existing solutions, providing in a single run multiple taxonomy classifications and hierarchical operational taxonomic unit mappings, for rRNA sequences in both amplicon and shotgun sequencing strategies, and for datasets of virtually any size. AVAILABILITY AND IMPLEMENTATION: Source code and binaries are freely available at https://github.com/jfmrod/mapseq SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-5860325 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-58603252018-03-21 MAPseq: highly efficient k-mer search with confidence estimates, for rRNA sequence analysis Matias Rodrigues, João F Schmidt, Thomas S B Tackmann, Janko von Mering, Christian Bioinformatics Applications Notes MOTIVATION: Ribosomal RNA profiling has become crucial to studying microbial communities, but meaningful taxonomic analysis and inter-comparison of such data are still hampered by technical limitations, between-study design variability and inconsistencies between taxonomies used. RESULTS: Here we present MAPseq, a framework for reference-based rRNA sequence analysis that is up to 30% more accurate (F(½) score) and up to one hundred times faster than existing solutions, providing in a single run multiple taxonomy classifications and hierarchical operational taxonomic unit mappings, for rRNA sequences in both amplicon and shotgun sequencing strategies, and for datasets of virtually any size. AVAILABILITY AND IMPLEMENTATION: Source code and binaries are freely available at https://github.com/jfmrod/mapseq SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2017-12-01 2017-08-14 /pmc/articles/PMC5860325/ /pubmed/28961926 http://dx.doi.org/10.1093/bioinformatics/btx517 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Applications Notes Matias Rodrigues, João F Schmidt, Thomas S B Tackmann, Janko von Mering, Christian MAPseq: highly efficient k-mer search with confidence estimates, for rRNA sequence analysis |
title | MAPseq: highly efficient k-mer search with confidence estimates, for rRNA sequence analysis |
title_full | MAPseq: highly efficient k-mer search with confidence estimates, for rRNA sequence analysis |
title_fullStr | MAPseq: highly efficient k-mer search with confidence estimates, for rRNA sequence analysis |
title_full_unstemmed | MAPseq: highly efficient k-mer search with confidence estimates, for rRNA sequence analysis |
title_short | MAPseq: highly efficient k-mer search with confidence estimates, for rRNA sequence analysis |
title_sort | mapseq: highly efficient k-mer search with confidence estimates, for rrna sequence analysis |
topic | Applications Notes |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860325/ https://www.ncbi.nlm.nih.gov/pubmed/28961926 http://dx.doi.org/10.1093/bioinformatics/btx517 |
work_keys_str_mv | AT matiasrodriguesjoaof mapseqhighlyefficientkmersearchwithconfidenceestimatesforrrnasequenceanalysis AT schmidtthomassb mapseqhighlyefficientkmersearchwithconfidenceestimatesforrrnasequenceanalysis AT tackmannjanko mapseqhighlyefficientkmersearchwithconfidenceestimatesforrrnasequenceanalysis AT vonmeringchristian mapseqhighlyefficientkmersearchwithconfidenceestimatesforrrnasequenceanalysis |