Cargando…
Limitations of the Human Reference Genome for Personalized Genomics
Data from the 1000 genomes project (1KGP) and Complete Genomics (CG) have dramatically increased the numbers of known genetic variants and challenge several assumptions about the reference genome and its uses in both clinical and research settings. Specifically, 34% of published array-based GWAS stu...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3394790/ https://www.ncbi.nlm.nih.gov/pubmed/22811759 http://dx.doi.org/10.1371/journal.pone.0040294 |
_version_ | 1782237899232641024 |
---|---|
author | Rosenfeld, Jeffrey A. Mason, Christopher E. Smith, Todd M. |
author_facet | Rosenfeld, Jeffrey A. Mason, Christopher E. Smith, Todd M. |
author_sort | Rosenfeld, Jeffrey A. |
collection | PubMed |
description | Data from the 1000 genomes project (1KGP) and Complete Genomics (CG) have dramatically increased the numbers of known genetic variants and challenge several assumptions about the reference genome and its uses in both clinical and research settings. Specifically, 34% of published array-based GWAS studies for a variety of diseases utilize probes that overlap unanticipated single nucleotide polymorphisms (SNPs), indels, or structural variants. Linkage disequilibrium (LD) block length depends on the numbers of markers used, and the mean LD block size decreases from 16 kb to 7 kb,when HapMap-based calculations are compared to blocks computed from1KGP data. Additionally, when 1KGP and CG variants are compared, 19% of the single nucleotide variants (SNVs) reported from common genomes are unique to one dataset; likely a result of differences in data collection methodology, alignment of reads to the reference genome, and variant-calling algorithms. Together these observations indicate that current research resources and informatics methods do not adequately account for the high level of variation that already exists in the human population and significant efforts are needed to create resources that can accurately assess personal genomics for health, disease, and predict treatment outcomes. |
format | Online Article Text |
id | pubmed-3394790 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-33947902012-07-18 Limitations of the Human Reference Genome for Personalized Genomics Rosenfeld, Jeffrey A. Mason, Christopher E. Smith, Todd M. PLoS One Research Article Data from the 1000 genomes project (1KGP) and Complete Genomics (CG) have dramatically increased the numbers of known genetic variants and challenge several assumptions about the reference genome and its uses in both clinical and research settings. Specifically, 34% of published array-based GWAS studies for a variety of diseases utilize probes that overlap unanticipated single nucleotide polymorphisms (SNPs), indels, or structural variants. Linkage disequilibrium (LD) block length depends on the numbers of markers used, and the mean LD block size decreases from 16 kb to 7 kb,when HapMap-based calculations are compared to blocks computed from1KGP data. Additionally, when 1KGP and CG variants are compared, 19% of the single nucleotide variants (SNVs) reported from common genomes are unique to one dataset; likely a result of differences in data collection methodology, alignment of reads to the reference genome, and variant-calling algorithms. Together these observations indicate that current research resources and informatics methods do not adequately account for the high level of variation that already exists in the human population and significant efforts are needed to create resources that can accurately assess personal genomics for health, disease, and predict treatment outcomes. Public Library of Science 2012-07-11 /pmc/articles/PMC3394790/ /pubmed/22811759 http://dx.doi.org/10.1371/journal.pone.0040294 Text en Rosenfeld et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Rosenfeld, Jeffrey A. Mason, Christopher E. Smith, Todd M. Limitations of the Human Reference Genome for Personalized Genomics |
title | Limitations of the Human Reference Genome for Personalized Genomics |
title_full | Limitations of the Human Reference Genome for Personalized Genomics |
title_fullStr | Limitations of the Human Reference Genome for Personalized Genomics |
title_full_unstemmed | Limitations of the Human Reference Genome for Personalized Genomics |
title_short | Limitations of the Human Reference Genome for Personalized Genomics |
title_sort | limitations of the human reference genome for personalized genomics |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3394790/ https://www.ncbi.nlm.nih.gov/pubmed/22811759 http://dx.doi.org/10.1371/journal.pone.0040294 |
work_keys_str_mv | AT rosenfeldjeffreya limitationsofthehumanreferencegenomeforpersonalizedgenomics AT masonchristophere limitationsofthehumanreferencegenomeforpersonalizedgenomics AT smithtoddm limitationsofthehumanreferencegenomeforpersonalizedgenomics |