Cargando…
RTK: efficient rarefaction analysis of large datasets
MOTIVATION: The rapidly expanding microbiomics field is generating increasingly larger datasets, characterizing the microbiota in diverse environments. Although classical numerical ecology methods provide a robust statistical framework for their analysis, software currently available is inadequate f...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5870771/ https://www.ncbi.nlm.nih.gov/pubmed/28398468 http://dx.doi.org/10.1093/bioinformatics/btx206 |
_version_ | 1783309546824925184 |
---|---|
author | Saary, Paul Forslund, Kristoffer Bork, Peer Hildebrand, Falk |
author_facet | Saary, Paul Forslund, Kristoffer Bork, Peer Hildebrand, Falk |
author_sort | Saary, Paul |
collection | PubMed |
description | MOTIVATION: The rapidly expanding microbiomics field is generating increasingly larger datasets, characterizing the microbiota in diverse environments. Although classical numerical ecology methods provide a robust statistical framework for their analysis, software currently available is inadequate for large datasets and some computationally intensive tasks, like rarefaction and associated analysis. RESULTS: Here we present a software package for rarefaction analysis of large count matrices, as well as estimation and visualization of diversity, richness and evenness. Our software is designed for ease of use, operating at least 7x faster than existing solutions, despite requiring 10x less memory. AVAILABILITY AND IMPLEMENTATION: C ++ and R source code (GPL v.2) as well as binaries are available from https://github.com/hildebra/Rarefaction and from CRAN (https://cran.r-project.org/). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-5870771 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-58707712018-03-29 RTK: efficient rarefaction analysis of large datasets Saary, Paul Forslund, Kristoffer Bork, Peer Hildebrand, Falk Bioinformatics Applications Notes MOTIVATION: The rapidly expanding microbiomics field is generating increasingly larger datasets, characterizing the microbiota in diverse environments. Although classical numerical ecology methods provide a robust statistical framework for their analysis, software currently available is inadequate for large datasets and some computationally intensive tasks, like rarefaction and associated analysis. RESULTS: Here we present a software package for rarefaction analysis of large count matrices, as well as estimation and visualization of diversity, richness and evenness. Our software is designed for ease of use, operating at least 7x faster than existing solutions, despite requiring 10x less memory. AVAILABILITY AND IMPLEMENTATION: C ++ and R source code (GPL v.2) as well as binaries are available from https://github.com/hildebra/Rarefaction and from CRAN (https://cran.r-project.org/). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2017-08-15 2017-04-07 /pmc/articles/PMC5870771/ /pubmed/28398468 http://dx.doi.org/10.1093/bioinformatics/btx206 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Applications Notes Saary, Paul Forslund, Kristoffer Bork, Peer Hildebrand, Falk RTK: efficient rarefaction analysis of large datasets |
title | RTK: efficient rarefaction analysis of large datasets |
title_full | RTK: efficient rarefaction analysis of large datasets |
title_fullStr | RTK: efficient rarefaction analysis of large datasets |
title_full_unstemmed | RTK: efficient rarefaction analysis of large datasets |
title_short | RTK: efficient rarefaction analysis of large datasets |
title_sort | rtk: efficient rarefaction analysis of large datasets |
topic | Applications Notes |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5870771/ https://www.ncbi.nlm.nih.gov/pubmed/28398468 http://dx.doi.org/10.1093/bioinformatics/btx206 |
work_keys_str_mv | AT saarypaul rtkefficientrarefactionanalysisoflargedatasets AT forslundkristoffer rtkefficientrarefactionanalysisoflargedatasets AT borkpeer rtkefficientrarefactionanalysisoflargedatasets AT hildebrandfalk rtkefficientrarefactionanalysisoflargedatasets |