Cargando…

DNApod: DNA polymorphism annotation database from next-generation sequence read archives

With the rapid advances in next-generation sequencing (NGS), datasets for DNA polymorphisms among various species and strains have been produced, stored, and distributed. However, reliability varies among these datasets because the experimental and analytical conditions used differ among assays. Fur...

Descripción completa

Detalles Bibliográficos
Autores principales: Mochizuki, Takako, Tanizawa, Yasuhiro, Fujisawa, Takatomo, Ohta, Tazro, Nikoh, Naruo, Shimizu, Tokurou, Toyoda, Atsushi, Fujiyama, Asao, Kurata, Nori, Nagasaki, Hideki, Kaminuma, Eli, Nakamura, Yasukazu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5325239/
https://www.ncbi.nlm.nih.gov/pubmed/28234924
http://dx.doi.org/10.1371/journal.pone.0172269
_version_ 1782510342319898624
author Mochizuki, Takako
Tanizawa, Yasuhiro
Fujisawa, Takatomo
Ohta, Tazro
Nikoh, Naruo
Shimizu, Tokurou
Toyoda, Atsushi
Fujiyama, Asao
Kurata, Nori
Nagasaki, Hideki
Kaminuma, Eli
Nakamura, Yasukazu
author_facet Mochizuki, Takako
Tanizawa, Yasuhiro
Fujisawa, Takatomo
Ohta, Tazro
Nikoh, Naruo
Shimizu, Tokurou
Toyoda, Atsushi
Fujiyama, Asao
Kurata, Nori
Nagasaki, Hideki
Kaminuma, Eli
Nakamura, Yasukazu
author_sort Mochizuki, Takako
collection PubMed
description With the rapid advances in next-generation sequencing (NGS), datasets for DNA polymorphisms among various species and strains have been produced, stored, and distributed. However, reliability varies among these datasets because the experimental and analytical conditions used differ among assays. Furthermore, such datasets have been frequently distributed from the websites of individual sequencing projects. It is desirable to integrate DNA polymorphism data into one database featuring uniform quality control that is distributed from a single platform at a single place. DNA polymorphism annotation database (DNApod; http://tga.nig.ac.jp/dnapod/) is an integrated database that stores genome-wide DNA polymorphism datasets acquired under uniform analytical conditions, and this includes uniformity in the quality of the raw data, the reference genome version, and evaluation algorithms. DNApod genotypic data are re-analyzed whole-genome shotgun datasets extracted from sequence read archives, and DNApod distributes genome-wide DNA polymorphism datasets and known-gene annotations for each DNA polymorphism. This new database was developed for storing genome-wide DNA polymorphism datasets of plants, with crops being the first priority. Here, we describe our analyzed data for 679, 404, and 66 strains of rice, maize, and sorghum, respectively. The analytical methods are available as a DNApod workflow in an NGS annotation system of the DNA Data Bank of Japan and a virtual machine image. Furthermore, DNApod provides tables of links of identifiers between DNApod genotypic data and public phenotypic data. To advance the sharing of organism knowledge, DNApod offers basic and ubiquitous functions for multiple alignment and phylogenetic tree construction by using orthologous gene information.
format Online
Article
Text
id pubmed-5325239
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-53252392017-03-09 DNApod: DNA polymorphism annotation database from next-generation sequence read archives Mochizuki, Takako Tanizawa, Yasuhiro Fujisawa, Takatomo Ohta, Tazro Nikoh, Naruo Shimizu, Tokurou Toyoda, Atsushi Fujiyama, Asao Kurata, Nori Nagasaki, Hideki Kaminuma, Eli Nakamura, Yasukazu PLoS One Research Article With the rapid advances in next-generation sequencing (NGS), datasets for DNA polymorphisms among various species and strains have been produced, stored, and distributed. However, reliability varies among these datasets because the experimental and analytical conditions used differ among assays. Furthermore, such datasets have been frequently distributed from the websites of individual sequencing projects. It is desirable to integrate DNA polymorphism data into one database featuring uniform quality control that is distributed from a single platform at a single place. DNA polymorphism annotation database (DNApod; http://tga.nig.ac.jp/dnapod/) is an integrated database that stores genome-wide DNA polymorphism datasets acquired under uniform analytical conditions, and this includes uniformity in the quality of the raw data, the reference genome version, and evaluation algorithms. DNApod genotypic data are re-analyzed whole-genome shotgun datasets extracted from sequence read archives, and DNApod distributes genome-wide DNA polymorphism datasets and known-gene annotations for each DNA polymorphism. This new database was developed for storing genome-wide DNA polymorphism datasets of plants, with crops being the first priority. Here, we describe our analyzed data for 679, 404, and 66 strains of rice, maize, and sorghum, respectively. The analytical methods are available as a DNApod workflow in an NGS annotation system of the DNA Data Bank of Japan and a virtual machine image. Furthermore, DNApod provides tables of links of identifiers between DNApod genotypic data and public phenotypic data. To advance the sharing of organism knowledge, DNApod offers basic and ubiquitous functions for multiple alignment and phylogenetic tree construction by using orthologous gene information. Public Library of Science 2017-02-24 /pmc/articles/PMC5325239/ /pubmed/28234924 http://dx.doi.org/10.1371/journal.pone.0172269 Text en © 2017 Mochizuki et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Mochizuki, Takako
Tanizawa, Yasuhiro
Fujisawa, Takatomo
Ohta, Tazro
Nikoh, Naruo
Shimizu, Tokurou
Toyoda, Atsushi
Fujiyama, Asao
Kurata, Nori
Nagasaki, Hideki
Kaminuma, Eli
Nakamura, Yasukazu
DNApod: DNA polymorphism annotation database from next-generation sequence read archives
title DNApod: DNA polymorphism annotation database from next-generation sequence read archives
title_full DNApod: DNA polymorphism annotation database from next-generation sequence read archives
title_fullStr DNApod: DNA polymorphism annotation database from next-generation sequence read archives
title_full_unstemmed DNApod: DNA polymorphism annotation database from next-generation sequence read archives
title_short DNApod: DNA polymorphism annotation database from next-generation sequence read archives
title_sort dnapod: dna polymorphism annotation database from next-generation sequence read archives
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5325239/
https://www.ncbi.nlm.nih.gov/pubmed/28234924
http://dx.doi.org/10.1371/journal.pone.0172269
work_keys_str_mv AT mochizukitakako dnapoddnapolymorphismannotationdatabasefromnextgenerationsequencereadarchives
AT tanizawayasuhiro dnapoddnapolymorphismannotationdatabasefromnextgenerationsequencereadarchives
AT fujisawatakatomo dnapoddnapolymorphismannotationdatabasefromnextgenerationsequencereadarchives
AT ohtatazro dnapoddnapolymorphismannotationdatabasefromnextgenerationsequencereadarchives
AT nikohnaruo dnapoddnapolymorphismannotationdatabasefromnextgenerationsequencereadarchives
AT shimizutokurou dnapoddnapolymorphismannotationdatabasefromnextgenerationsequencereadarchives
AT toyodaatsushi dnapoddnapolymorphismannotationdatabasefromnextgenerationsequencereadarchives
AT fujiyamaasao dnapoddnapolymorphismannotationdatabasefromnextgenerationsequencereadarchives
AT kuratanori dnapoddnapolymorphismannotationdatabasefromnextgenerationsequencereadarchives
AT nagasakihideki dnapoddnapolymorphismannotationdatabasefromnextgenerationsequencereadarchives
AT kaminumaeli dnapoddnapolymorphismannotationdatabasefromnextgenerationsequencereadarchives
AT nakamurayasukazu dnapoddnapolymorphismannotationdatabasefromnextgenerationsequencereadarchives