Cargando…

Validation of a Cost-Efficient Multi-Purpose SNP Panel for Disease Based Research

BACKGROUND: Here we present convergent methodologies using theoretical calculations, empirical assessment on in-house and publicly available datasets as well as in silico simulations, that validate a panel of SNPs for a variety of necessary tasks in human genetics disease research before resources a...

Descripción completa

Detalles Bibliográficos
Autores principales: Hou, Liping, Phillips, Christopher, Azaro, Marco, Brzustowicz, Linda M., Bartlett, Christopher W.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3096622/
https://www.ncbi.nlm.nih.gov/pubmed/21611176
http://dx.doi.org/10.1371/journal.pone.0019699
_version_ 1782203737108905984
author Hou, Liping
Phillips, Christopher
Azaro, Marco
Brzustowicz, Linda M.
Bartlett, Christopher W.
author_facet Hou, Liping
Phillips, Christopher
Azaro, Marco
Brzustowicz, Linda M.
Bartlett, Christopher W.
author_sort Hou, Liping
collection PubMed
description BACKGROUND: Here we present convergent methodologies using theoretical calculations, empirical assessment on in-house and publicly available datasets as well as in silico simulations, that validate a panel of SNPs for a variety of necessary tasks in human genetics disease research before resources are committed to larger-scale genotyping studies on those samples. While large-scale well-funded human genetic studies routinely have up to a million SNP genotypes, samples in a human genetics laboratory that are not yet part of such studies may be productively utilized in pilot projects or as part of targeted follow-up work though such smaller scale applications require at least some genome-wide genotype data for quality control purposes such as DNA “barcoding” to detect swaps or contamination issues, determining familial relationships between samples and correcting biases due to population effects such as population stratification in pilot studies. PRINCIPAL FINDINGS: Empirical performance in classification of relative types for any two given DNA samples (e.g., full siblings, parental, etc) indicated that for outbred populations the panel performs sufficiently to classify relationship in extended families and therefore also for smaller structures such as trios and for twin zygosity testing. Additionally, familial relationships do not significantly diminish the (mean match) probability of sharing SNP genotypes in pedigrees, further indicating the uniqueness of the “barcode.” Simulation using these SNPs for an African American case-control disease association study demonstrated that population stratification, even in complex admixed samples, can be adequately corrected under a range of disease models using the SNP panel. CONCLUSION: The panel has been validated for use in a variety of human disease genetics research tasks including sample barcoding, relationship verification, population substructure detection and statistical correction. Given the ease of genotyping our specific assay contained herein, this panel represents a useful and economical panel for human geneticists.
format Text
id pubmed-3096622
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-30966222011-05-24 Validation of a Cost-Efficient Multi-Purpose SNP Panel for Disease Based Research Hou, Liping Phillips, Christopher Azaro, Marco Brzustowicz, Linda M. Bartlett, Christopher W. PLoS One Research Article BACKGROUND: Here we present convergent methodologies using theoretical calculations, empirical assessment on in-house and publicly available datasets as well as in silico simulations, that validate a panel of SNPs for a variety of necessary tasks in human genetics disease research before resources are committed to larger-scale genotyping studies on those samples. While large-scale well-funded human genetic studies routinely have up to a million SNP genotypes, samples in a human genetics laboratory that are not yet part of such studies may be productively utilized in pilot projects or as part of targeted follow-up work though such smaller scale applications require at least some genome-wide genotype data for quality control purposes such as DNA “barcoding” to detect swaps or contamination issues, determining familial relationships between samples and correcting biases due to population effects such as population stratification in pilot studies. PRINCIPAL FINDINGS: Empirical performance in classification of relative types for any two given DNA samples (e.g., full siblings, parental, etc) indicated that for outbred populations the panel performs sufficiently to classify relationship in extended families and therefore also for smaller structures such as trios and for twin zygosity testing. Additionally, familial relationships do not significantly diminish the (mean match) probability of sharing SNP genotypes in pedigrees, further indicating the uniqueness of the “barcode.” Simulation using these SNPs for an African American case-control disease association study demonstrated that population stratification, even in complex admixed samples, can be adequately corrected under a range of disease models using the SNP panel. CONCLUSION: The panel has been validated for use in a variety of human disease genetics research tasks including sample barcoding, relationship verification, population substructure detection and statistical correction. Given the ease of genotyping our specific assay contained herein, this panel represents a useful and economical panel for human geneticists. Public Library of Science 2011-05-17 /pmc/articles/PMC3096622/ /pubmed/21611176 http://dx.doi.org/10.1371/journal.pone.0019699 Text en Hou et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Hou, Liping
Phillips, Christopher
Azaro, Marco
Brzustowicz, Linda M.
Bartlett, Christopher W.
Validation of a Cost-Efficient Multi-Purpose SNP Panel for Disease Based Research
title Validation of a Cost-Efficient Multi-Purpose SNP Panel for Disease Based Research
title_full Validation of a Cost-Efficient Multi-Purpose SNP Panel for Disease Based Research
title_fullStr Validation of a Cost-Efficient Multi-Purpose SNP Panel for Disease Based Research
title_full_unstemmed Validation of a Cost-Efficient Multi-Purpose SNP Panel for Disease Based Research
title_short Validation of a Cost-Efficient Multi-Purpose SNP Panel for Disease Based Research
title_sort validation of a cost-efficient multi-purpose snp panel for disease based research
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3096622/
https://www.ncbi.nlm.nih.gov/pubmed/21611176
http://dx.doi.org/10.1371/journal.pone.0019699
work_keys_str_mv AT houliping validationofacostefficientmultipurposesnppanelfordiseasebasedresearch
AT phillipschristopher validationofacostefficientmultipurposesnppanelfordiseasebasedresearch
AT azaromarco validationofacostefficientmultipurposesnppanelfordiseasebasedresearch
AT brzustowiczlindam validationofacostefficientmultipurposesnppanelfordiseasebasedresearch
AT bartlettchristopherw validationofacostefficientmultipurposesnppanelfordiseasebasedresearch