Cargando…

Database mining for selection of SNP markers useful in admixture mapping

BACKGROUND: New technologies make it possible for the first time to genotype hundreds of thousands of SNPs simultaneously. A wealth of genomic information in the form of publicly available databases is underutilized as a potential resource for uncovering functionally relevant markers underlying comp...

Descripción completa

Detalles Bibliográficos
Autores principales: Baye, Tesfaye M, Tiwari, Hemant K, Allison, David B, Go, Rodney C
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2649128/
https://www.ncbi.nlm.nih.gov/pubmed/19216798
http://dx.doi.org/10.1186/1756-0381-2-1
_version_ 1782165024400211968
author Baye, Tesfaye M
Tiwari, Hemant K
Allison, David B
Go, Rodney C
author_facet Baye, Tesfaye M
Tiwari, Hemant K
Allison, David B
Go, Rodney C
author_sort Baye, Tesfaye M
collection PubMed
description BACKGROUND: New technologies make it possible for the first time to genotype hundreds of thousands of SNPs simultaneously. A wealth of genomic information in the form of publicly available databases is underutilized as a potential resource for uncovering functionally relevant markers underlying complex human traits. Given the huge amount of SNP data available from the annotation of human genetic variation, data mining is a reasonable approach to investigating the number of SNPs that are informative for ancestry information. METHODS: The distribution and density of SNPs across the genome of African and European populations were extensively investigated by using the HapMap, Affymetrix, and Illumina SNP databases. We exploited these resources by mining the data available from each of these databases to prioritize potential candidate SNPs useful for admixture mapping in complex human diseases and traits. Over 4 million SNPs were compared between Africans and Europeans on the basis of a pre-specified recommended allele frequency difference (delta) value of ≥ 0.3. RESULTS: The method identified 15% of HapMap, 11% of Affymetrix, and 14% of Illumina SNP sets as candidate SNPs, termed ancestry informative markers (AIMs). These AIM panels with assigned rs numbers, allele frequencies in each ethnic group, delta value, and map positions are all posted on our website . All marker information in this data set is freely and publicly available without restriction. CONCLUSION: The selected SNP sets represent valuable resources for admixture mapping studies. The overlap between selected AIMs by this single measure of marker informativeness in the different platforms is discussed.
format Text
id pubmed-2649128
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26491282009-02-28 Database mining for selection of SNP markers useful in admixture mapping Baye, Tesfaye M Tiwari, Hemant K Allison, David B Go, Rodney C BioData Min Research BACKGROUND: New technologies make it possible for the first time to genotype hundreds of thousands of SNPs simultaneously. A wealth of genomic information in the form of publicly available databases is underutilized as a potential resource for uncovering functionally relevant markers underlying complex human traits. Given the huge amount of SNP data available from the annotation of human genetic variation, data mining is a reasonable approach to investigating the number of SNPs that are informative for ancestry information. METHODS: The distribution and density of SNPs across the genome of African and European populations were extensively investigated by using the HapMap, Affymetrix, and Illumina SNP databases. We exploited these resources by mining the data available from each of these databases to prioritize potential candidate SNPs useful for admixture mapping in complex human diseases and traits. Over 4 million SNPs were compared between Africans and Europeans on the basis of a pre-specified recommended allele frequency difference (delta) value of ≥ 0.3. RESULTS: The method identified 15% of HapMap, 11% of Affymetrix, and 14% of Illumina SNP sets as candidate SNPs, termed ancestry informative markers (AIMs). These AIM panels with assigned rs numbers, allele frequencies in each ethnic group, delta value, and map positions are all posted on our website . All marker information in this data set is freely and publicly available without restriction. CONCLUSION: The selected SNP sets represent valuable resources for admixture mapping studies. The overlap between selected AIMs by this single measure of marker informativeness in the different platforms is discussed. BioMed Central 2009-02-14 /pmc/articles/PMC2649128/ /pubmed/19216798 http://dx.doi.org/10.1186/1756-0381-2-1 Text en Copyright © 2009 Baye et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Baye, Tesfaye M
Tiwari, Hemant K
Allison, David B
Go, Rodney C
Database mining for selection of SNP markers useful in admixture mapping
title Database mining for selection of SNP markers useful in admixture mapping
title_full Database mining for selection of SNP markers useful in admixture mapping
title_fullStr Database mining for selection of SNP markers useful in admixture mapping
title_full_unstemmed Database mining for selection of SNP markers useful in admixture mapping
title_short Database mining for selection of SNP markers useful in admixture mapping
title_sort database mining for selection of snp markers useful in admixture mapping
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2649128/
https://www.ncbi.nlm.nih.gov/pubmed/19216798
http://dx.doi.org/10.1186/1756-0381-2-1
work_keys_str_mv AT bayetesfayem databaseminingforselectionofsnpmarkersusefulinadmixturemapping
AT tiwarihemantk databaseminingforselectionofsnpmarkersusefulinadmixturemapping
AT allisondavidb databaseminingforselectionofsnpmarkersusefulinadmixturemapping
AT gorodneyc databaseminingforselectionofsnpmarkersusefulinadmixturemapping