Cargando…

GHOSTM: A GPU-Accelerated Homology Search Tool for Metagenomics

BACKGROUND: A large number of sensitive homology searches are required for mapping DNA sequence fragments to known protein sequences in public and private databases during metagenomic analysis. BLAST is currently used for this purpose, but its calculation speed is insufficient, especially for analyz...

Descripción completa

Detalles Bibliográficos
Autores principales: Suzuki, Shuji, Ishida, Takashi, Kurokawa, Ken, Akiyama, Yutaka
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3344842/
https://www.ncbi.nlm.nih.gov/pubmed/22574135
http://dx.doi.org/10.1371/journal.pone.0036060
_version_ 1782232082873843712
author Suzuki, Shuji
Ishida, Takashi
Kurokawa, Ken
Akiyama, Yutaka
author_facet Suzuki, Shuji
Ishida, Takashi
Kurokawa, Ken
Akiyama, Yutaka
author_sort Suzuki, Shuji
collection PubMed
description BACKGROUND: A large number of sensitive homology searches are required for mapping DNA sequence fragments to known protein sequences in public and private databases during metagenomic analysis. BLAST is currently used for this purpose, but its calculation speed is insufficient, especially for analyzing the large quantities of sequence data obtained from a next-generation sequencer. However, faster search tools, such as BLAT, do not have sufficient search sensitivity for metagenomic analysis. Thus, a sensitive and efficient homology search tool is in high demand for this type of analysis. METHODOLOGY/PRINCIPAL FINDINGS: We developed a new, highly efficient homology search algorithm suitable for graphics processing unit (GPU) calculations that was implemented as a GPU system that we called GHOSTM. The system first searches for candidate alignment positions for a sequence from the database using pre-calculated indexes and then calculates local alignments around the candidate positions before calculating alignment scores. We implemented both of these processes on GPUs. The system achieved calculation speeds that were 130 and 407 times faster than BLAST with 1 GPU and 4 GPUs, respectively. The system also showed higher search sensitivity and had a calculation speed that was 4 and 15 times faster than BLAT with 1 GPU and 4 GPUs. CONCLUSIONS: We developed a GPU-optimized algorithm to perform sensitive sequence homology searches and implemented the system as GHOSTM. Currently, sequencing technology continues to improve, and sequencers are increasingly producing larger and larger quantities of data. This explosion of sequence data makes computational analysis with contemporary tools more difficult. We developed GHOSTM, which is a cost-efficient tool, and offer this tool as a potential solution to this problem.
format Online
Article
Text
id pubmed-3344842
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-33448422012-05-09 GHOSTM: A GPU-Accelerated Homology Search Tool for Metagenomics Suzuki, Shuji Ishida, Takashi Kurokawa, Ken Akiyama, Yutaka PLoS One Research Article BACKGROUND: A large number of sensitive homology searches are required for mapping DNA sequence fragments to known protein sequences in public and private databases during metagenomic analysis. BLAST is currently used for this purpose, but its calculation speed is insufficient, especially for analyzing the large quantities of sequence data obtained from a next-generation sequencer. However, faster search tools, such as BLAT, do not have sufficient search sensitivity for metagenomic analysis. Thus, a sensitive and efficient homology search tool is in high demand for this type of analysis. METHODOLOGY/PRINCIPAL FINDINGS: We developed a new, highly efficient homology search algorithm suitable for graphics processing unit (GPU) calculations that was implemented as a GPU system that we called GHOSTM. The system first searches for candidate alignment positions for a sequence from the database using pre-calculated indexes and then calculates local alignments around the candidate positions before calculating alignment scores. We implemented both of these processes on GPUs. The system achieved calculation speeds that were 130 and 407 times faster than BLAST with 1 GPU and 4 GPUs, respectively. The system also showed higher search sensitivity and had a calculation speed that was 4 and 15 times faster than BLAT with 1 GPU and 4 GPUs. CONCLUSIONS: We developed a GPU-optimized algorithm to perform sensitive sequence homology searches and implemented the system as GHOSTM. Currently, sequencing technology continues to improve, and sequencers are increasingly producing larger and larger quantities of data. This explosion of sequence data makes computational analysis with contemporary tools more difficult. We developed GHOSTM, which is a cost-efficient tool, and offer this tool as a potential solution to this problem. Public Library of Science 2012-05-04 /pmc/articles/PMC3344842/ /pubmed/22574135 http://dx.doi.org/10.1371/journal.pone.0036060 Text en Suzuki et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Suzuki, Shuji
Ishida, Takashi
Kurokawa, Ken
Akiyama, Yutaka
GHOSTM: A GPU-Accelerated Homology Search Tool for Metagenomics
title GHOSTM: A GPU-Accelerated Homology Search Tool for Metagenomics
title_full GHOSTM: A GPU-Accelerated Homology Search Tool for Metagenomics
title_fullStr GHOSTM: A GPU-Accelerated Homology Search Tool for Metagenomics
title_full_unstemmed GHOSTM: A GPU-Accelerated Homology Search Tool for Metagenomics
title_short GHOSTM: A GPU-Accelerated Homology Search Tool for Metagenomics
title_sort ghostm: a gpu-accelerated homology search tool for metagenomics
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3344842/
https://www.ncbi.nlm.nih.gov/pubmed/22574135
http://dx.doi.org/10.1371/journal.pone.0036060
work_keys_str_mv AT suzukishuji ghostmagpuacceleratedhomologysearchtoolformetagenomics
AT ishidatakashi ghostmagpuacceleratedhomologysearchtoolformetagenomics
AT kurokawaken ghostmagpuacceleratedhomologysearchtoolformetagenomics
AT akiyamayutaka ghostmagpuacceleratedhomologysearchtoolformetagenomics