Cargando…

DDBJ Database updates and computational infrastructure enhancement

The Bioinformation and DDBJ Center (https://www.ddbj.nig.ac.jp) in the National Institute of Genetics (NIG) maintains a primary nucleotide sequence database as a member of the International Nucleotide Sequence Database Collaboration (INSDC) in partnership with the US National Center for Biotechnolog...

Descripción completa

Detalles Bibliográficos
Autores principales: Ogasawara, Osamu, Kodama, Yuichi, Mashima, Jun, Kosuge, Takehide, Fujisawa, Takatomo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7145692/
https://www.ncbi.nlm.nih.gov/pubmed/31724722
http://dx.doi.org/10.1093/nar/gkz982
_version_ 1783520041473409024
author Ogasawara, Osamu
Kodama, Yuichi
Mashima, Jun
Kosuge, Takehide
Fujisawa, Takatomo
author_facet Ogasawara, Osamu
Kodama, Yuichi
Mashima, Jun
Kosuge, Takehide
Fujisawa, Takatomo
author_sort Ogasawara, Osamu
collection PubMed
description The Bioinformation and DDBJ Center (https://www.ddbj.nig.ac.jp) in the National Institute of Genetics (NIG) maintains a primary nucleotide sequence database as a member of the International Nucleotide Sequence Database Collaboration (INSDC) in partnership with the US National Center for Biotechnology Information and the European Bioinformatics Institute. The NIG operates the NIG supercomputer as a computational basis for the construction of DDBJ databases and as a large-scale computational resource for Japanese biologists and medical researchers. In order to accommodate the rapidly growing amount of deoxyribonucleic acid (DNA) nucleotide sequence data, NIG replaced its supercomputer system, which is designed for big data analysis of genome data, in early 2019. The new system is equipped with 30 PB of DNA data archiving storage; large-scale parallel distributed file systems (13.8 PB in total) and 1.1 PFLOPS computation nodes and graphics processing units (GPUs). Moreover, as a starting point of developing multi-cloud infrastructure of bioinformatics, we have also installed an automatic file transfer system that allows users to prevent data lock-in and to achieve cost/performance balance by exploiting the most suitable environment from among the supercomputer and public clouds for different workloads.
format Online
Article
Text
id pubmed-7145692
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-71456922020-04-13 DDBJ Database updates and computational infrastructure enhancement Ogasawara, Osamu Kodama, Yuichi Mashima, Jun Kosuge, Takehide Fujisawa, Takatomo Nucleic Acids Res Database Issue The Bioinformation and DDBJ Center (https://www.ddbj.nig.ac.jp) in the National Institute of Genetics (NIG) maintains a primary nucleotide sequence database as a member of the International Nucleotide Sequence Database Collaboration (INSDC) in partnership with the US National Center for Biotechnology Information and the European Bioinformatics Institute. The NIG operates the NIG supercomputer as a computational basis for the construction of DDBJ databases and as a large-scale computational resource for Japanese biologists and medical researchers. In order to accommodate the rapidly growing amount of deoxyribonucleic acid (DNA) nucleotide sequence data, NIG replaced its supercomputer system, which is designed for big data analysis of genome data, in early 2019. The new system is equipped with 30 PB of DNA data archiving storage; large-scale parallel distributed file systems (13.8 PB in total) and 1.1 PFLOPS computation nodes and graphics processing units (GPUs). Moreover, as a starting point of developing multi-cloud infrastructure of bioinformatics, we have also installed an automatic file transfer system that allows users to prevent data lock-in and to achieve cost/performance balance by exploiting the most suitable environment from among the supercomputer and public clouds for different workloads. Oxford University Press 2020-01-08 2019-11-14 /pmc/articles/PMC7145692/ /pubmed/31724722 http://dx.doi.org/10.1093/nar/gkz982 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database Issue
Ogasawara, Osamu
Kodama, Yuichi
Mashima, Jun
Kosuge, Takehide
Fujisawa, Takatomo
DDBJ Database updates and computational infrastructure enhancement
title DDBJ Database updates and computational infrastructure enhancement
title_full DDBJ Database updates and computational infrastructure enhancement
title_fullStr DDBJ Database updates and computational infrastructure enhancement
title_full_unstemmed DDBJ Database updates and computational infrastructure enhancement
title_short DDBJ Database updates and computational infrastructure enhancement
title_sort ddbj database updates and computational infrastructure enhancement
topic Database Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7145692/
https://www.ncbi.nlm.nih.gov/pubmed/31724722
http://dx.doi.org/10.1093/nar/gkz982
work_keys_str_mv AT ogasawaraosamu ddbjdatabaseupdatesandcomputationalinfrastructureenhancement
AT kodamayuichi ddbjdatabaseupdatesandcomputationalinfrastructureenhancement
AT mashimajun ddbjdatabaseupdatesandcomputationalinfrastructureenhancement
AT kosugetakehide ddbjdatabaseupdatesandcomputationalinfrastructureenhancement
AT fujisawatakatomo ddbjdatabaseupdatesandcomputationalinfrastructureenhancement