Cargando…

Toward accurate high-throughput SNP genotyping in the presence of inherited copy number variation

BACKGROUND: The recent discovery of widespread copy number variation in humans has forced a shift away from the assumption of two copies per locus per cell throughout the autosomal genome. In particular, a SNP site can no longer always be accurately assigned one of three genotypes in an individual....

Descripción completa

Detalles Bibliográficos
Autores principales: MacConaill, Laura E, Aldred, Micheala A, Lu, Xincheng, LaFramboise, Thomas
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1934372/
https://www.ncbi.nlm.nih.gov/pubmed/17608949
http://dx.doi.org/10.1186/1471-2164-8-211
_version_ 1782134345627074560
author MacConaill, Laura E
Aldred, Micheala A
Lu, Xincheng
LaFramboise, Thomas
author_facet MacConaill, Laura E
Aldred, Micheala A
Lu, Xincheng
LaFramboise, Thomas
author_sort MacConaill, Laura E
collection PubMed
description BACKGROUND: The recent discovery of widespread copy number variation in humans has forced a shift away from the assumption of two copies per locus per cell throughout the autosomal genome. In particular, a SNP site can no longer always be accurately assigned one of three genotypes in an individual. In the presence of copy number variability, the individual may theoretically harbor any number of copies of each of the two SNP alleles. RESULTS: To address this issue, we have developed a method to infer a "generalized genotype" from raw SNP microarray data. Here we apply our approach to data from 48 individuals and uncover thousands of aberrant SNPs, most in regions that were previously unreported as copy number variants. We show that our allele-specific copy numbers follow Mendelian inheritance patterns that would be obscured in the absence of SNP allele information. The interplay between duplication and point mutation in our data shed light on the relative frequencies of these events in human history, showing that at least some of the duplication events were recurrent. CONCLUSION: This new multi-allelic view of SNPs has a complicated role in disease association studies, and further work will be necessary in order to accurately assess its importance. Software to perform generalized genotyping from SNP array data is freely available online [1].
format Text
id pubmed-1934372
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-19343722007-07-28 Toward accurate high-throughput SNP genotyping in the presence of inherited copy number variation MacConaill, Laura E Aldred, Micheala A Lu, Xincheng LaFramboise, Thomas BMC Genomics Research Article BACKGROUND: The recent discovery of widespread copy number variation in humans has forced a shift away from the assumption of two copies per locus per cell throughout the autosomal genome. In particular, a SNP site can no longer always be accurately assigned one of three genotypes in an individual. In the presence of copy number variability, the individual may theoretically harbor any number of copies of each of the two SNP alleles. RESULTS: To address this issue, we have developed a method to infer a "generalized genotype" from raw SNP microarray data. Here we apply our approach to data from 48 individuals and uncover thousands of aberrant SNPs, most in regions that were previously unreported as copy number variants. We show that our allele-specific copy numbers follow Mendelian inheritance patterns that would be obscured in the absence of SNP allele information. The interplay between duplication and point mutation in our data shed light on the relative frequencies of these events in human history, showing that at least some of the duplication events were recurrent. CONCLUSION: This new multi-allelic view of SNPs has a complicated role in disease association studies, and further work will be necessary in order to accurately assess its importance. Software to perform generalized genotyping from SNP array data is freely available online [1]. BioMed Central 2007-07-03 /pmc/articles/PMC1934372/ /pubmed/17608949 http://dx.doi.org/10.1186/1471-2164-8-211 Text en Copyright © 2007 MacConaill et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
MacConaill, Laura E
Aldred, Micheala A
Lu, Xincheng
LaFramboise, Thomas
Toward accurate high-throughput SNP genotyping in the presence of inherited copy number variation
title Toward accurate high-throughput SNP genotyping in the presence of inherited copy number variation
title_full Toward accurate high-throughput SNP genotyping in the presence of inherited copy number variation
title_fullStr Toward accurate high-throughput SNP genotyping in the presence of inherited copy number variation
title_full_unstemmed Toward accurate high-throughput SNP genotyping in the presence of inherited copy number variation
title_short Toward accurate high-throughput SNP genotyping in the presence of inherited copy number variation
title_sort toward accurate high-throughput snp genotyping in the presence of inherited copy number variation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1934372/
https://www.ncbi.nlm.nih.gov/pubmed/17608949
http://dx.doi.org/10.1186/1471-2164-8-211
work_keys_str_mv AT macconailllaurae towardaccuratehighthroughputsnpgenotypinginthepresenceofinheritedcopynumbervariation
AT aldredmichealaa towardaccuratehighthroughputsnpgenotypinginthepresenceofinheritedcopynumbervariation
AT luxincheng towardaccuratehighthroughputsnpgenotypinginthepresenceofinheritedcopynumbervariation
AT laframboisethomas towardaccuratehighthroughputsnpgenotypinginthepresenceofinheritedcopynumbervariation