Cargando…

Limitations of the Human Reference Genome for Personalized Genomics

Data from the 1000 genomes project (1KGP) and Complete Genomics (CG) have dramatically increased the numbers of known genetic variants and challenge several assumptions about the reference genome and its uses in both clinical and research settings. Specifically, 34% of published array-based GWAS stu...

Descripción completa

Detalles Bibliográficos
Autores principales: Rosenfeld, Jeffrey A., Mason, Christopher E., Smith, Todd M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3394790/
https://www.ncbi.nlm.nih.gov/pubmed/22811759
http://dx.doi.org/10.1371/journal.pone.0040294
_version_ 1782237899232641024
author Rosenfeld, Jeffrey A.
Mason, Christopher E.
Smith, Todd M.
author_facet Rosenfeld, Jeffrey A.
Mason, Christopher E.
Smith, Todd M.
author_sort Rosenfeld, Jeffrey A.
collection PubMed
description Data from the 1000 genomes project (1KGP) and Complete Genomics (CG) have dramatically increased the numbers of known genetic variants and challenge several assumptions about the reference genome and its uses in both clinical and research settings. Specifically, 34% of published array-based GWAS studies for a variety of diseases utilize probes that overlap unanticipated single nucleotide polymorphisms (SNPs), indels, or structural variants. Linkage disequilibrium (LD) block length depends on the numbers of markers used, and the mean LD block size decreases from 16 kb to 7 kb,when HapMap-based calculations are compared to blocks computed from1KGP data. Additionally, when 1KGP and CG variants are compared, 19% of the single nucleotide variants (SNVs) reported from common genomes are unique to one dataset; likely a result of differences in data collection methodology, alignment of reads to the reference genome, and variant-calling algorithms. Together these observations indicate that current research resources and informatics methods do not adequately account for the high level of variation that already exists in the human population and significant efforts are needed to create resources that can accurately assess personal genomics for health, disease, and predict treatment outcomes.
format Online
Article
Text
id pubmed-3394790
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-33947902012-07-18 Limitations of the Human Reference Genome for Personalized Genomics Rosenfeld, Jeffrey A. Mason, Christopher E. Smith, Todd M. PLoS One Research Article Data from the 1000 genomes project (1KGP) and Complete Genomics (CG) have dramatically increased the numbers of known genetic variants and challenge several assumptions about the reference genome and its uses in both clinical and research settings. Specifically, 34% of published array-based GWAS studies for a variety of diseases utilize probes that overlap unanticipated single nucleotide polymorphisms (SNPs), indels, or structural variants. Linkage disequilibrium (LD) block length depends on the numbers of markers used, and the mean LD block size decreases from 16 kb to 7 kb,when HapMap-based calculations are compared to blocks computed from1KGP data. Additionally, when 1KGP and CG variants are compared, 19% of the single nucleotide variants (SNVs) reported from common genomes are unique to one dataset; likely a result of differences in data collection methodology, alignment of reads to the reference genome, and variant-calling algorithms. Together these observations indicate that current research resources and informatics methods do not adequately account for the high level of variation that already exists in the human population and significant efforts are needed to create resources that can accurately assess personal genomics for health, disease, and predict treatment outcomes. Public Library of Science 2012-07-11 /pmc/articles/PMC3394790/ /pubmed/22811759 http://dx.doi.org/10.1371/journal.pone.0040294 Text en Rosenfeld et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Rosenfeld, Jeffrey A.
Mason, Christopher E.
Smith, Todd M.
Limitations of the Human Reference Genome for Personalized Genomics
title Limitations of the Human Reference Genome for Personalized Genomics
title_full Limitations of the Human Reference Genome for Personalized Genomics
title_fullStr Limitations of the Human Reference Genome for Personalized Genomics
title_full_unstemmed Limitations of the Human Reference Genome for Personalized Genomics
title_short Limitations of the Human Reference Genome for Personalized Genomics
title_sort limitations of the human reference genome for personalized genomics
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3394790/
https://www.ncbi.nlm.nih.gov/pubmed/22811759
http://dx.doi.org/10.1371/journal.pone.0040294
work_keys_str_mv AT rosenfeldjeffreya limitationsofthehumanreferencegenomeforpersonalizedgenomics
AT masonchristophere limitationsofthehumanreferencegenomeforpersonalizedgenomics
AT smithtoddm limitationsofthehumanreferencegenomeforpersonalizedgenomics