Cargando…

A scalable, aggregated genotypic–phenotypic database for human disease variation

Next generation sequencing multi-gene panels have greatly improved the diagnostic yield and cost effectiveness of genetic testing and are rapidly being integrated into the clinic for hereditary cancer risk. With this technology comes a dramatic increase in the volume, type and complexity of data. Th...

Descripción completa

Detalles Bibliográficos
Autores principales: Barrett, Ryan, Neben, Cynthia L, Zimmer, Anjali D, Mishne, Gilad, McKennon, Wendy, Zhou, Alicia Y, Ginsberg, Jeremy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6372842/
https://www.ncbi.nlm.nih.gov/pubmed/30759220
http://dx.doi.org/10.1093/database/baz013
_version_ 1783394843966308352
author Barrett, Ryan
Neben, Cynthia L
Zimmer, Anjali D
Mishne, Gilad
McKennon, Wendy
Zhou, Alicia Y
Ginsberg, Jeremy
author_facet Barrett, Ryan
Neben, Cynthia L
Zimmer, Anjali D
Mishne, Gilad
McKennon, Wendy
Zhou, Alicia Y
Ginsberg, Jeremy
author_sort Barrett, Ryan
collection PubMed
description Next generation sequencing multi-gene panels have greatly improved the diagnostic yield and cost effectiveness of genetic testing and are rapidly being integrated into the clinic for hereditary cancer risk. With this technology comes a dramatic increase in the volume, type and complexity of data. This invaluable data though is too often buried or inaccessible to researchers, especially to those without strong analytical or programming skills. To effectively share comprehensive, integrated genotypic–phenotypic data, we built Color Data, a publicly available, cloud-based database that supports broad access and data literacy. The database is composed of 50 000 individuals who were sequenced for 30 genes associated with hereditary cancer risk and provides useful information on allele frequency and variant classification, as well as associated phenotypic information such as demographics and personal and family history. Our user-friendly interface allows researchers to easily execute their own queries with filtering, and the results of queries can be shared and/or downloaded. The rapid and broad dissemination of these research results will help increase the value of, and reduce the waste in, scientific resources and data. Furthermore, the database is able to quickly scale and support integration of additional genes and human hereditary conditions. We hope that this database will help researchers and scientists explore genotype–phenotype correlations in hereditary cancer, identify novel variants for functional analysis and enable data-driven drug discovery and development.
format Online
Article
Text
id pubmed-6372842
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-63728422019-02-21 A scalable, aggregated genotypic–phenotypic database for human disease variation Barrett, Ryan Neben, Cynthia L Zimmer, Anjali D Mishne, Gilad McKennon, Wendy Zhou, Alicia Y Ginsberg, Jeremy Database (Oxford) Original Article Next generation sequencing multi-gene panels have greatly improved the diagnostic yield and cost effectiveness of genetic testing and are rapidly being integrated into the clinic for hereditary cancer risk. With this technology comes a dramatic increase in the volume, type and complexity of data. This invaluable data though is too often buried or inaccessible to researchers, especially to those without strong analytical or programming skills. To effectively share comprehensive, integrated genotypic–phenotypic data, we built Color Data, a publicly available, cloud-based database that supports broad access and data literacy. The database is composed of 50 000 individuals who were sequenced for 30 genes associated with hereditary cancer risk and provides useful information on allele frequency and variant classification, as well as associated phenotypic information such as demographics and personal and family history. Our user-friendly interface allows researchers to easily execute their own queries with filtering, and the results of queries can be shared and/or downloaded. The rapid and broad dissemination of these research results will help increase the value of, and reduce the waste in, scientific resources and data. Furthermore, the database is able to quickly scale and support integration of additional genes and human hereditary conditions. We hope that this database will help researchers and scientists explore genotype–phenotype correlations in hereditary cancer, identify novel variants for functional analysis and enable data-driven drug discovery and development. Oxford University Press 2019-02-13 /pmc/articles/PMC6372842/ /pubmed/30759220 http://dx.doi.org/10.1093/database/baz013 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Barrett, Ryan
Neben, Cynthia L
Zimmer, Anjali D
Mishne, Gilad
McKennon, Wendy
Zhou, Alicia Y
Ginsberg, Jeremy
A scalable, aggregated genotypic–phenotypic database for human disease variation
title A scalable, aggregated genotypic–phenotypic database for human disease variation
title_full A scalable, aggregated genotypic–phenotypic database for human disease variation
title_fullStr A scalable, aggregated genotypic–phenotypic database for human disease variation
title_full_unstemmed A scalable, aggregated genotypic–phenotypic database for human disease variation
title_short A scalable, aggregated genotypic–phenotypic database for human disease variation
title_sort scalable, aggregated genotypic–phenotypic database for human disease variation
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6372842/
https://www.ncbi.nlm.nih.gov/pubmed/30759220
http://dx.doi.org/10.1093/database/baz013
work_keys_str_mv AT barrettryan ascalableaggregatedgenotypicphenotypicdatabaseforhumandiseasevariation
AT nebencynthial ascalableaggregatedgenotypicphenotypicdatabaseforhumandiseasevariation
AT zimmeranjalid ascalableaggregatedgenotypicphenotypicdatabaseforhumandiseasevariation
AT mishnegilad ascalableaggregatedgenotypicphenotypicdatabaseforhumandiseasevariation
AT mckennonwendy ascalableaggregatedgenotypicphenotypicdatabaseforhumandiseasevariation
AT zhoualiciay ascalableaggregatedgenotypicphenotypicdatabaseforhumandiseasevariation
AT ginsbergjeremy ascalableaggregatedgenotypicphenotypicdatabaseforhumandiseasevariation
AT barrettryan scalableaggregatedgenotypicphenotypicdatabaseforhumandiseasevariation
AT nebencynthial scalableaggregatedgenotypicphenotypicdatabaseforhumandiseasevariation
AT zimmeranjalid scalableaggregatedgenotypicphenotypicdatabaseforhumandiseasevariation
AT mishnegilad scalableaggregatedgenotypicphenotypicdatabaseforhumandiseasevariation
AT mckennonwendy scalableaggregatedgenotypicphenotypicdatabaseforhumandiseasevariation
AT zhoualiciay scalableaggregatedgenotypicphenotypicdatabaseforhumandiseasevariation
AT ginsbergjeremy scalableaggregatedgenotypicphenotypicdatabaseforhumandiseasevariation