Cargando…

Optimizing UniFrac with OpenACC Yields Greater Than One Thousand Times Speed Increase

UniFrac is an important tool in microbiome research that is used for phylogenetically comparing microbiome profiles to one another (beta diversity). Striped UniFrac recently added the ability to split the problem into many independent subproblems, exhibiting nearly linear scaling but suffering from...

Descripción completa

Detalles Bibliográficos
Autores principales: Sfiligoi, Igor, Armstrong, George, Gonzalez, Antonio, McDonald, Daniel, Knight, Rob
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9239203/
https://www.ncbi.nlm.nih.gov/pubmed/35638356
http://dx.doi.org/10.1128/msystems.00028-22
_version_ 1784737242777911296
author Sfiligoi, Igor
Armstrong, George
Gonzalez, Antonio
McDonald, Daniel
Knight, Rob
author_facet Sfiligoi, Igor
Armstrong, George
Gonzalez, Antonio
McDonald, Daniel
Knight, Rob
author_sort Sfiligoi, Igor
collection PubMed
description UniFrac is an important tool in microbiome research that is used for phylogenetically comparing microbiome profiles to one another (beta diversity). Striped UniFrac recently added the ability to split the problem into many independent subproblems, exhibiting nearly linear scaling but suffering from memory contention. Here, we adapt UniFrac to graphics processing units using OpenACC, enabling greater than 1,000× computational improvement, and apply it to 307,237 samples, the largest 16S rRNA V4 uniformly preprocessed microbiome data set analyzed to date. IMPORTANCE UniFrac is an important tool in microbiome research that is used for phylogenetically comparing microbiome profiles to one another. Here, we adapt UniFrac to operate on graphics processing units, enabling a 1,000× computational improvement. To highlight this advance, we perform what may be the largest microbiome analysis to date, applying UniFrac to 307,237 16S rRNA V4 microbiome samples preprocessed with Deblur. These scaling improvements turn UniFrac into a real-time tool for common data sets and unlock new research questions as more microbiome data are collected.
format Online
Article
Text
id pubmed-9239203
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-92392032022-06-29 Optimizing UniFrac with OpenACC Yields Greater Than One Thousand Times Speed Increase Sfiligoi, Igor Armstrong, George Gonzalez, Antonio McDonald, Daniel Knight, Rob mSystems Observation UniFrac is an important tool in microbiome research that is used for phylogenetically comparing microbiome profiles to one another (beta diversity). Striped UniFrac recently added the ability to split the problem into many independent subproblems, exhibiting nearly linear scaling but suffering from memory contention. Here, we adapt UniFrac to graphics processing units using OpenACC, enabling greater than 1,000× computational improvement, and apply it to 307,237 samples, the largest 16S rRNA V4 uniformly preprocessed microbiome data set analyzed to date. IMPORTANCE UniFrac is an important tool in microbiome research that is used for phylogenetically comparing microbiome profiles to one another. Here, we adapt UniFrac to operate on graphics processing units, enabling a 1,000× computational improvement. To highlight this advance, we perform what may be the largest microbiome analysis to date, applying UniFrac to 307,237 16S rRNA V4 microbiome samples preprocessed with Deblur. These scaling improvements turn UniFrac into a real-time tool for common data sets and unlock new research questions as more microbiome data are collected. American Society for Microbiology 2022-05-31 /pmc/articles/PMC9239203/ /pubmed/35638356 http://dx.doi.org/10.1128/msystems.00028-22 Text en Copyright © 2022 Sfiligoi et al. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Observation
Sfiligoi, Igor
Armstrong, George
Gonzalez, Antonio
McDonald, Daniel
Knight, Rob
Optimizing UniFrac with OpenACC Yields Greater Than One Thousand Times Speed Increase
title Optimizing UniFrac with OpenACC Yields Greater Than One Thousand Times Speed Increase
title_full Optimizing UniFrac with OpenACC Yields Greater Than One Thousand Times Speed Increase
title_fullStr Optimizing UniFrac with OpenACC Yields Greater Than One Thousand Times Speed Increase
title_full_unstemmed Optimizing UniFrac with OpenACC Yields Greater Than One Thousand Times Speed Increase
title_short Optimizing UniFrac with OpenACC Yields Greater Than One Thousand Times Speed Increase
title_sort optimizing unifrac with openacc yields greater than one thousand times speed increase
topic Observation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9239203/
https://www.ncbi.nlm.nih.gov/pubmed/35638356
http://dx.doi.org/10.1128/msystems.00028-22
work_keys_str_mv AT sfiligoiigor optimizingunifracwithopenaccyieldsgreaterthanonethousandtimesspeedincrease
AT armstronggeorge optimizingunifracwithopenaccyieldsgreaterthanonethousandtimesspeedincrease
AT gonzalezantonio optimizingunifracwithopenaccyieldsgreaterthanonethousandtimesspeedincrease
AT mcdonalddaniel optimizingunifracwithopenaccyieldsgreaterthanonethousandtimesspeedincrease
AT knightrob optimizingunifracwithopenaccyieldsgreaterthanonethousandtimesspeedincrease