Cargando…

DDBJ Read Annotation Pipeline: A Cloud Computing-Based Pipeline for High-Throughput Analysis of Next-Generation Sequencing Data

High-performance next-generation sequencing (NGS) technologies are advancing genomics and molecular biological research. However, the immense amount of sequence data requires computational skills and suitable hardware resources that are a challenge to molecular biologists. The DNA Data Bank of Japan...

Descripción completa

Detalles Bibliográficos
Autores principales: Nagasaki, Hideki, Mochizuki, Takako, Kodama, Yuichi, Saruhashi, Satoshi, Morizaki, Shota, Sugawara, Hideaki, Ohyanagi, Hajime, Kurata, Nori, Okubo, Kousaku, Takagi, Toshihisa, Kaminuma, Eli, Nakamura, Yasukazu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3738164/
https://www.ncbi.nlm.nih.gov/pubmed/23657089
http://dx.doi.org/10.1093/dnares/dst017
_version_ 1782476816354639872
author Nagasaki, Hideki
Mochizuki, Takako
Kodama, Yuichi
Saruhashi, Satoshi
Morizaki, Shota
Sugawara, Hideaki
Ohyanagi, Hajime
Kurata, Nori
Okubo, Kousaku
Takagi, Toshihisa
Kaminuma, Eli
Nakamura, Yasukazu
author_facet Nagasaki, Hideki
Mochizuki, Takako
Kodama, Yuichi
Saruhashi, Satoshi
Morizaki, Shota
Sugawara, Hideaki
Ohyanagi, Hajime
Kurata, Nori
Okubo, Kousaku
Takagi, Toshihisa
Kaminuma, Eli
Nakamura, Yasukazu
author_sort Nagasaki, Hideki
collection PubMed
description High-performance next-generation sequencing (NGS) technologies are advancing genomics and molecular biological research. However, the immense amount of sequence data requires computational skills and suitable hardware resources that are a challenge to molecular biologists. The DNA Data Bank of Japan (DDBJ) of the National Institute of Genetics (NIG) has initiated a cloud computing-based analytical pipeline, the DDBJ Read Annotation Pipeline (DDBJ Pipeline), for a high-throughput annotation of NGS reads. The DDBJ Pipeline offers a user-friendly graphical web interface and processes massive NGS datasets using decentralized processing by NIG supercomputers currently free of charge. The proposed pipeline consists of two analysis components: basic analysis for reference genome mapping and de novo assembly and subsequent high-level analysis of structural and functional annotations. Users may smoothly switch between the two components in the pipeline, facilitating web-based operations on a supercomputer for high-throughput data analysis. Moreover, public NGS reads of the DDBJ Sequence Read Archive located on the same supercomputer can be imported into the pipeline through the input of only an accession number. This proposed pipeline will facilitate research by utilizing unified analytical workflows applied to the NGS data. The DDBJ Pipeline is accessible at http://p.ddbj.nig.ac.jp/.
format Online
Article
Text
id pubmed-3738164
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-37381642013-08-08 DDBJ Read Annotation Pipeline: A Cloud Computing-Based Pipeline for High-Throughput Analysis of Next-Generation Sequencing Data Nagasaki, Hideki Mochizuki, Takako Kodama, Yuichi Saruhashi, Satoshi Morizaki, Shota Sugawara, Hideaki Ohyanagi, Hajime Kurata, Nori Okubo, Kousaku Takagi, Toshihisa Kaminuma, Eli Nakamura, Yasukazu DNA Res Full Papers High-performance next-generation sequencing (NGS) technologies are advancing genomics and molecular biological research. However, the immense amount of sequence data requires computational skills and suitable hardware resources that are a challenge to molecular biologists. The DNA Data Bank of Japan (DDBJ) of the National Institute of Genetics (NIG) has initiated a cloud computing-based analytical pipeline, the DDBJ Read Annotation Pipeline (DDBJ Pipeline), for a high-throughput annotation of NGS reads. The DDBJ Pipeline offers a user-friendly graphical web interface and processes massive NGS datasets using decentralized processing by NIG supercomputers currently free of charge. The proposed pipeline consists of two analysis components: basic analysis for reference genome mapping and de novo assembly and subsequent high-level analysis of structural and functional annotations. Users may smoothly switch between the two components in the pipeline, facilitating web-based operations on a supercomputer for high-throughput data analysis. Moreover, public NGS reads of the DDBJ Sequence Read Archive located on the same supercomputer can be imported into the pipeline through the input of only an accession number. This proposed pipeline will facilitate research by utilizing unified analytical workflows applied to the NGS data. The DDBJ Pipeline is accessible at http://p.ddbj.nig.ac.jp/. Oxford University Press 2013-08 2013-05-08 /pmc/articles/PMC3738164/ /pubmed/23657089 http://dx.doi.org/10.1093/dnares/dst017 Text en © The Author 2013. Published by Oxford University Press on behalf of Kazusa DNA Research Institute http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Full Papers
Nagasaki, Hideki
Mochizuki, Takako
Kodama, Yuichi
Saruhashi, Satoshi
Morizaki, Shota
Sugawara, Hideaki
Ohyanagi, Hajime
Kurata, Nori
Okubo, Kousaku
Takagi, Toshihisa
Kaminuma, Eli
Nakamura, Yasukazu
DDBJ Read Annotation Pipeline: A Cloud Computing-Based Pipeline for High-Throughput Analysis of Next-Generation Sequencing Data
title DDBJ Read Annotation Pipeline: A Cloud Computing-Based Pipeline for High-Throughput Analysis of Next-Generation Sequencing Data
title_full DDBJ Read Annotation Pipeline: A Cloud Computing-Based Pipeline for High-Throughput Analysis of Next-Generation Sequencing Data
title_fullStr DDBJ Read Annotation Pipeline: A Cloud Computing-Based Pipeline for High-Throughput Analysis of Next-Generation Sequencing Data
title_full_unstemmed DDBJ Read Annotation Pipeline: A Cloud Computing-Based Pipeline for High-Throughput Analysis of Next-Generation Sequencing Data
title_short DDBJ Read Annotation Pipeline: A Cloud Computing-Based Pipeline for High-Throughput Analysis of Next-Generation Sequencing Data
title_sort ddbj read annotation pipeline: a cloud computing-based pipeline for high-throughput analysis of next-generation sequencing data
topic Full Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3738164/
https://www.ncbi.nlm.nih.gov/pubmed/23657089
http://dx.doi.org/10.1093/dnares/dst017
work_keys_str_mv AT nagasakihideki ddbjreadannotationpipelineacloudcomputingbasedpipelineforhighthroughputanalysisofnextgenerationsequencingdata
AT mochizukitakako ddbjreadannotationpipelineacloudcomputingbasedpipelineforhighthroughputanalysisofnextgenerationsequencingdata
AT kodamayuichi ddbjreadannotationpipelineacloudcomputingbasedpipelineforhighthroughputanalysisofnextgenerationsequencingdata
AT saruhashisatoshi ddbjreadannotationpipelineacloudcomputingbasedpipelineforhighthroughputanalysisofnextgenerationsequencingdata
AT morizakishota ddbjreadannotationpipelineacloudcomputingbasedpipelineforhighthroughputanalysisofnextgenerationsequencingdata
AT sugawarahideaki ddbjreadannotationpipelineacloudcomputingbasedpipelineforhighthroughputanalysisofnextgenerationsequencingdata
AT ohyanagihajime ddbjreadannotationpipelineacloudcomputingbasedpipelineforhighthroughputanalysisofnextgenerationsequencingdata
AT kuratanori ddbjreadannotationpipelineacloudcomputingbasedpipelineforhighthroughputanalysisofnextgenerationsequencingdata
AT okubokousaku ddbjreadannotationpipelineacloudcomputingbasedpipelineforhighthroughputanalysisofnextgenerationsequencingdata
AT takagitoshihisa ddbjreadannotationpipelineacloudcomputingbasedpipelineforhighthroughputanalysisofnextgenerationsequencingdata
AT kaminumaeli ddbjreadannotationpipelineacloudcomputingbasedpipelineforhighthroughputanalysisofnextgenerationsequencingdata
AT nakamurayasukazu ddbjreadannotationpipelineacloudcomputingbasedpipelineforhighthroughputanalysisofnextgenerationsequencingdata