Cargando…

RTK: efficient rarefaction analysis of large datasets

MOTIVATION: The rapidly expanding microbiomics field is generating increasingly larger datasets, characterizing the microbiota in diverse environments. Although classical numerical ecology methods provide a robust statistical framework for their analysis, software currently available is inadequate f...

Descripción completa

Detalles Bibliográficos
Autores principales: Saary, Paul, Forslund, Kristoffer, Bork, Peer, Hildebrand, Falk
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5870771/
https://www.ncbi.nlm.nih.gov/pubmed/28398468
http://dx.doi.org/10.1093/bioinformatics/btx206
_version_ 1783309546824925184
author Saary, Paul
Forslund, Kristoffer
Bork, Peer
Hildebrand, Falk
author_facet Saary, Paul
Forslund, Kristoffer
Bork, Peer
Hildebrand, Falk
author_sort Saary, Paul
collection PubMed
description MOTIVATION: The rapidly expanding microbiomics field is generating increasingly larger datasets, characterizing the microbiota in diverse environments. Although classical numerical ecology methods provide a robust statistical framework for their analysis, software currently available is inadequate for large datasets and some computationally intensive tasks, like rarefaction and associated analysis. RESULTS: Here we present a software package for rarefaction analysis of large count matrices, as well as estimation and visualization of diversity, richness and evenness. Our software is designed for ease of use, operating at least 7x faster than existing solutions, despite requiring 10x less memory. AVAILABILITY AND IMPLEMENTATION: C ++ and R source code (GPL v.2) as well as binaries are available from https://github.com/hildebra/Rarefaction and from CRAN (https://cran.r-project.org/). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-5870771
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58707712018-03-29 RTK: efficient rarefaction analysis of large datasets Saary, Paul Forslund, Kristoffer Bork, Peer Hildebrand, Falk Bioinformatics Applications Notes MOTIVATION: The rapidly expanding microbiomics field is generating increasingly larger datasets, characterizing the microbiota in diverse environments. Although classical numerical ecology methods provide a robust statistical framework for their analysis, software currently available is inadequate for large datasets and some computationally intensive tasks, like rarefaction and associated analysis. RESULTS: Here we present a software package for rarefaction analysis of large count matrices, as well as estimation and visualization of diversity, richness and evenness. Our software is designed for ease of use, operating at least 7x faster than existing solutions, despite requiring 10x less memory. AVAILABILITY AND IMPLEMENTATION: C ++ and R source code (GPL v.2) as well as binaries are available from https://github.com/hildebra/Rarefaction and from CRAN (https://cran.r-project.org/). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2017-08-15 2017-04-07 /pmc/articles/PMC5870771/ /pubmed/28398468 http://dx.doi.org/10.1093/bioinformatics/btx206 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Notes
Saary, Paul
Forslund, Kristoffer
Bork, Peer
Hildebrand, Falk
RTK: efficient rarefaction analysis of large datasets
title RTK: efficient rarefaction analysis of large datasets
title_full RTK: efficient rarefaction analysis of large datasets
title_fullStr RTK: efficient rarefaction analysis of large datasets
title_full_unstemmed RTK: efficient rarefaction analysis of large datasets
title_short RTK: efficient rarefaction analysis of large datasets
title_sort rtk: efficient rarefaction analysis of large datasets
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5870771/
https://www.ncbi.nlm.nih.gov/pubmed/28398468
http://dx.doi.org/10.1093/bioinformatics/btx206
work_keys_str_mv AT saarypaul rtkefficientrarefactionanalysisoflargedatasets
AT forslundkristoffer rtkefficientrarefactionanalysisoflargedatasets
AT borkpeer rtkefficientrarefactionanalysisoflargedatasets
AT hildebrandfalk rtkefficientrarefactionanalysisoflargedatasets