Cargando…

Advancing HIV Vaccine Research With Low-Cost High-Performance Computing Infrastructure: An Alternative Approach for Resource-Limited Settings

Next-generation sequencing (NGS) technologies have revolutionized biological research by generating genomic data that were once unaffordable by traditional first-generation sequencing technologies. These sequencing methodologies provide an opportunity for in-depth analyses of host and pathogen genom...

Descripción completa

Detalles Bibliográficos
Autores principales: Mabvakure, Batsirai M, Rott, Raymond, Dobrowsky, Leslie, Van Heusden, Peter, Morris, Lynn, Scheepers, Cathrine, Moore, Penny L
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8842485/
https://www.ncbi.nlm.nih.gov/pubmed/35173421
http://dx.doi.org/10.1177/1177932219882347
_version_ 1784651060157087744
author Mabvakure, Batsirai M
Rott, Raymond
Dobrowsky, Leslie
Van Heusden, Peter
Morris, Lynn
Scheepers, Cathrine
Moore, Penny L
author_facet Mabvakure, Batsirai M
Rott, Raymond
Dobrowsky, Leslie
Van Heusden, Peter
Morris, Lynn
Scheepers, Cathrine
Moore, Penny L
author_sort Mabvakure, Batsirai M
collection PubMed
description Next-generation sequencing (NGS) technologies have revolutionized biological research by generating genomic data that were once unaffordable by traditional first-generation sequencing technologies. These sequencing methodologies provide an opportunity for in-depth analyses of host and pathogen genomes as they are able to sequence millions of templates at a time. However, these large datasets can only be efficiently explored using bioinformatics analyses requiring huge data storage and computational resources adapted for high-performance processing. High-performance computing allows for efficient handling of large data and tasks that may require multi-threading and prolonged computational times, which is not feasible with ordinary computers. However, high-performance computing resources are costly and therefore not always readily available in low-income settings. We describe the establishment of an affordable high-performance computing bioinformatics cluster consisting of 3 nodes, constructed using ordinary desktop computers and open-source software including Linux Fedora, SLURM Workload Manager, and the Conda package manager. For the analysis of large antibody sequence datasets and for complex viral phylodynamic analyses, the cluster out-performed desktop computers. This has demonstrated that it is possible to construct high-performance computing capacity capable of analyzing large NGS data from relatively low-cost hardware and entirely free (open-source) software, even in resource-limited settings. Such a cluster design has broad utility beyond bioinformatics to other studies that require high-performance computing.
format Online
Article
Text
id pubmed-8842485
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-88424852022-02-15 Advancing HIV Vaccine Research With Low-Cost High-Performance Computing Infrastructure: An Alternative Approach for Resource-Limited Settings Mabvakure, Batsirai M Rott, Raymond Dobrowsky, Leslie Van Heusden, Peter Morris, Lynn Scheepers, Cathrine Moore, Penny L Bioinform Biol Insights Original Research Next-generation sequencing (NGS) technologies have revolutionized biological research by generating genomic data that were once unaffordable by traditional first-generation sequencing technologies. These sequencing methodologies provide an opportunity for in-depth analyses of host and pathogen genomes as they are able to sequence millions of templates at a time. However, these large datasets can only be efficiently explored using bioinformatics analyses requiring huge data storage and computational resources adapted for high-performance processing. High-performance computing allows for efficient handling of large data and tasks that may require multi-threading and prolonged computational times, which is not feasible with ordinary computers. However, high-performance computing resources are costly and therefore not always readily available in low-income settings. We describe the establishment of an affordable high-performance computing bioinformatics cluster consisting of 3 nodes, constructed using ordinary desktop computers and open-source software including Linux Fedora, SLURM Workload Manager, and the Conda package manager. For the analysis of large antibody sequence datasets and for complex viral phylodynamic analyses, the cluster out-performed desktop computers. This has demonstrated that it is possible to construct high-performance computing capacity capable of analyzing large NGS data from relatively low-cost hardware and entirely free (open-source) software, even in resource-limited settings. Such a cluster design has broad utility beyond bioinformatics to other studies that require high-performance computing. SAGE Publications 2019-11-07 /pmc/articles/PMC8842485/ /pubmed/35173421 http://dx.doi.org/10.1177/1177932219882347 Text en © The Author(s) 2019 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (http://www.creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Research
Mabvakure, Batsirai M
Rott, Raymond
Dobrowsky, Leslie
Van Heusden, Peter
Morris, Lynn
Scheepers, Cathrine
Moore, Penny L
Advancing HIV Vaccine Research With Low-Cost High-Performance Computing Infrastructure: An Alternative Approach for Resource-Limited Settings
title Advancing HIV Vaccine Research With Low-Cost High-Performance Computing Infrastructure: An Alternative Approach for Resource-Limited Settings
title_full Advancing HIV Vaccine Research With Low-Cost High-Performance Computing Infrastructure: An Alternative Approach for Resource-Limited Settings
title_fullStr Advancing HIV Vaccine Research With Low-Cost High-Performance Computing Infrastructure: An Alternative Approach for Resource-Limited Settings
title_full_unstemmed Advancing HIV Vaccine Research With Low-Cost High-Performance Computing Infrastructure: An Alternative Approach for Resource-Limited Settings
title_short Advancing HIV Vaccine Research With Low-Cost High-Performance Computing Infrastructure: An Alternative Approach for Resource-Limited Settings
title_sort advancing hiv vaccine research with low-cost high-performance computing infrastructure: an alternative approach for resource-limited settings
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8842485/
https://www.ncbi.nlm.nih.gov/pubmed/35173421
http://dx.doi.org/10.1177/1177932219882347
work_keys_str_mv AT mabvakurebatsiraim advancinghivvaccineresearchwithlowcosthighperformancecomputinginfrastructureanalternativeapproachforresourcelimitedsettings
AT rottraymond advancinghivvaccineresearchwithlowcosthighperformancecomputinginfrastructureanalternativeapproachforresourcelimitedsettings
AT dobrowskyleslie advancinghivvaccineresearchwithlowcosthighperformancecomputinginfrastructureanalternativeapproachforresourcelimitedsettings
AT vanheusdenpeter advancinghivvaccineresearchwithlowcosthighperformancecomputinginfrastructureanalternativeapproachforresourcelimitedsettings
AT morrislynn advancinghivvaccineresearchwithlowcosthighperformancecomputinginfrastructureanalternativeapproachforresourcelimitedsettings
AT scheeperscathrine advancinghivvaccineresearchwithlowcosthighperformancecomputinginfrastructureanalternativeapproachforresourcelimitedsettings
AT moorepennyl advancinghivvaccineresearchwithlowcosthighperformancecomputinginfrastructureanalternativeapproachforresourcelimitedsettings