Cargando…

VCGDB: a dynamic genome database of the Chinese population

BACKGROUND: The data released by the 1000 Genomes Project contain an increasing number of genome sequences from different nations and populations with a large number of genetic variations. As a result, the focus of human genome studies is changing from single and static to complex and dynamic. The c...

Descripción completa

Detalles Bibliográficos
Autores principales: Ling, Yunchao, Jin, Zhong, Su, Mingming, Zhong, Jun, Zhao, Yongbing, Yu, Jun, Wu, Jiayan, Xiao, Jingfa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4028056/
https://www.ncbi.nlm.nih.gov/pubmed/24708222
http://dx.doi.org/10.1186/1471-2164-15-265
_version_ 1782317029907234816
author Ling, Yunchao
Jin, Zhong
Su, Mingming
Zhong, Jun
Zhao, Yongbing
Yu, Jun
Wu, Jiayan
Xiao, Jingfa
author_facet Ling, Yunchao
Jin, Zhong
Su, Mingming
Zhong, Jun
Zhao, Yongbing
Yu, Jun
Wu, Jiayan
Xiao, Jingfa
author_sort Ling, Yunchao
collection PubMed
description BACKGROUND: The data released by the 1000 Genomes Project contain an increasing number of genome sequences from different nations and populations with a large number of genetic variations. As a result, the focus of human genome studies is changing from single and static to complex and dynamic. The currently available human reference genome (GRCh37) is based on sequencing data from 13 anonymous Caucasian volunteers, which might limit the scope of genomics, transcriptomics, epigenetics, and genome wide association studies. DESCRIPTION: We used the massive amount of sequencing data published by the 1000 Genomes Project Consortium to construct the Virtual Chinese Genome Database (VCGDB), a dynamic genome database of the Chinese population based on the whole genome sequencing data of 194 individuals. VCGDB provides dynamic genomic information, which contains 35 million single nucleotide variations (SNVs), 0.5 million insertions/deletions (indels), and 29 million rare variations, together with genomic annotation information. VCGDB also provides a highly interactive user-friendly virtual Chinese genome browser (VCGBrowser) with functions like seamless zooming and real-time searching. In addition, we have established three population-specific consensus Chinese reference genomes that are compatible with mainstream alignment software. CONCLUSIONS: VCGDB offers a feasible strategy for processing big data to keep pace with the biological data explosion by providing a robust resource for genomics studies; in particular, studies aimed at finding regions of the genome associated with diseases.
format Online
Article
Text
id pubmed-4028056
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40280562014-05-30 VCGDB: a dynamic genome database of the Chinese population Ling, Yunchao Jin, Zhong Su, Mingming Zhong, Jun Zhao, Yongbing Yu, Jun Wu, Jiayan Xiao, Jingfa BMC Genomics Database BACKGROUND: The data released by the 1000 Genomes Project contain an increasing number of genome sequences from different nations and populations with a large number of genetic variations. As a result, the focus of human genome studies is changing from single and static to complex and dynamic. The currently available human reference genome (GRCh37) is based on sequencing data from 13 anonymous Caucasian volunteers, which might limit the scope of genomics, transcriptomics, epigenetics, and genome wide association studies. DESCRIPTION: We used the massive amount of sequencing data published by the 1000 Genomes Project Consortium to construct the Virtual Chinese Genome Database (VCGDB), a dynamic genome database of the Chinese population based on the whole genome sequencing data of 194 individuals. VCGDB provides dynamic genomic information, which contains 35 million single nucleotide variations (SNVs), 0.5 million insertions/deletions (indels), and 29 million rare variations, together with genomic annotation information. VCGDB also provides a highly interactive user-friendly virtual Chinese genome browser (VCGBrowser) with functions like seamless zooming and real-time searching. In addition, we have established three population-specific consensus Chinese reference genomes that are compatible with mainstream alignment software. CONCLUSIONS: VCGDB offers a feasible strategy for processing big data to keep pace with the biological data explosion by providing a robust resource for genomics studies; in particular, studies aimed at finding regions of the genome associated with diseases. BioMed Central 2014-04-05 /pmc/articles/PMC4028056/ /pubmed/24708222 http://dx.doi.org/10.1186/1471-2164-15-265 Text en Copyright © 2014 Ling et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Database
Ling, Yunchao
Jin, Zhong
Su, Mingming
Zhong, Jun
Zhao, Yongbing
Yu, Jun
Wu, Jiayan
Xiao, Jingfa
VCGDB: a dynamic genome database of the Chinese population
title VCGDB: a dynamic genome database of the Chinese population
title_full VCGDB: a dynamic genome database of the Chinese population
title_fullStr VCGDB: a dynamic genome database of the Chinese population
title_full_unstemmed VCGDB: a dynamic genome database of the Chinese population
title_short VCGDB: a dynamic genome database of the Chinese population
title_sort vcgdb: a dynamic genome database of the chinese population
topic Database
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4028056/
https://www.ncbi.nlm.nih.gov/pubmed/24708222
http://dx.doi.org/10.1186/1471-2164-15-265
work_keys_str_mv AT lingyunchao vcgdbadynamicgenomedatabaseofthechinesepopulation
AT jinzhong vcgdbadynamicgenomedatabaseofthechinesepopulation
AT sumingming vcgdbadynamicgenomedatabaseofthechinesepopulation
AT zhongjun vcgdbadynamicgenomedatabaseofthechinesepopulation
AT zhaoyongbing vcgdbadynamicgenomedatabaseofthechinesepopulation
AT yujun vcgdbadynamicgenomedatabaseofthechinesepopulation
AT wujiayan vcgdbadynamicgenomedatabaseofthechinesepopulation
AT xiaojingfa vcgdbadynamicgenomedatabaseofthechinesepopulation