Cargando…
Database mining for selection of SNP markers useful in admixture mapping
BACKGROUND: New technologies make it possible for the first time to genotype hundreds of thousands of SNPs simultaneously. A wealth of genomic information in the form of publicly available databases is underutilized as a potential resource for uncovering functionally relevant markers underlying comp...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2649128/ https://www.ncbi.nlm.nih.gov/pubmed/19216798 http://dx.doi.org/10.1186/1756-0381-2-1 |
_version_ | 1782165024400211968 |
---|---|
author | Baye, Tesfaye M Tiwari, Hemant K Allison, David B Go, Rodney C |
author_facet | Baye, Tesfaye M Tiwari, Hemant K Allison, David B Go, Rodney C |
author_sort | Baye, Tesfaye M |
collection | PubMed |
description | BACKGROUND: New technologies make it possible for the first time to genotype hundreds of thousands of SNPs simultaneously. A wealth of genomic information in the form of publicly available databases is underutilized as a potential resource for uncovering functionally relevant markers underlying complex human traits. Given the huge amount of SNP data available from the annotation of human genetic variation, data mining is a reasonable approach to investigating the number of SNPs that are informative for ancestry information. METHODS: The distribution and density of SNPs across the genome of African and European populations were extensively investigated by using the HapMap, Affymetrix, and Illumina SNP databases. We exploited these resources by mining the data available from each of these databases to prioritize potential candidate SNPs useful for admixture mapping in complex human diseases and traits. Over 4 million SNPs were compared between Africans and Europeans on the basis of a pre-specified recommended allele frequency difference (delta) value of ≥ 0.3. RESULTS: The method identified 15% of HapMap, 11% of Affymetrix, and 14% of Illumina SNP sets as candidate SNPs, termed ancestry informative markers (AIMs). These AIM panels with assigned rs numbers, allele frequencies in each ethnic group, delta value, and map positions are all posted on our website . All marker information in this data set is freely and publicly available without restriction. CONCLUSION: The selected SNP sets represent valuable resources for admixture mapping studies. The overlap between selected AIMs by this single measure of marker informativeness in the different platforms is discussed. |
format | Text |
id | pubmed-2649128 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-26491282009-02-28 Database mining for selection of SNP markers useful in admixture mapping Baye, Tesfaye M Tiwari, Hemant K Allison, David B Go, Rodney C BioData Min Research BACKGROUND: New technologies make it possible for the first time to genotype hundreds of thousands of SNPs simultaneously. A wealth of genomic information in the form of publicly available databases is underutilized as a potential resource for uncovering functionally relevant markers underlying complex human traits. Given the huge amount of SNP data available from the annotation of human genetic variation, data mining is a reasonable approach to investigating the number of SNPs that are informative for ancestry information. METHODS: The distribution and density of SNPs across the genome of African and European populations were extensively investigated by using the HapMap, Affymetrix, and Illumina SNP databases. We exploited these resources by mining the data available from each of these databases to prioritize potential candidate SNPs useful for admixture mapping in complex human diseases and traits. Over 4 million SNPs were compared between Africans and Europeans on the basis of a pre-specified recommended allele frequency difference (delta) value of ≥ 0.3. RESULTS: The method identified 15% of HapMap, 11% of Affymetrix, and 14% of Illumina SNP sets as candidate SNPs, termed ancestry informative markers (AIMs). These AIM panels with assigned rs numbers, allele frequencies in each ethnic group, delta value, and map positions are all posted on our website . All marker information in this data set is freely and publicly available without restriction. CONCLUSION: The selected SNP sets represent valuable resources for admixture mapping studies. The overlap between selected AIMs by this single measure of marker informativeness in the different platforms is discussed. BioMed Central 2009-02-14 /pmc/articles/PMC2649128/ /pubmed/19216798 http://dx.doi.org/10.1186/1756-0381-2-1 Text en Copyright © 2009 Baye et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Baye, Tesfaye M Tiwari, Hemant K Allison, David B Go, Rodney C Database mining for selection of SNP markers useful in admixture mapping |
title | Database mining for selection of SNP markers useful in admixture mapping |
title_full | Database mining for selection of SNP markers useful in admixture mapping |
title_fullStr | Database mining for selection of SNP markers useful in admixture mapping |
title_full_unstemmed | Database mining for selection of SNP markers useful in admixture mapping |
title_short | Database mining for selection of SNP markers useful in admixture mapping |
title_sort | database mining for selection of snp markers useful in admixture mapping |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2649128/ https://www.ncbi.nlm.nih.gov/pubmed/19216798 http://dx.doi.org/10.1186/1756-0381-2-1 |
work_keys_str_mv | AT bayetesfayem databaseminingforselectionofsnpmarkersusefulinadmixturemapping AT tiwarihemantk databaseminingforselectionofsnpmarkersusefulinadmixturemapping AT allisondavidb databaseminingforselectionofsnpmarkersusefulinadmixturemapping AT gorodneyc databaseminingforselectionofsnpmarkersusefulinadmixturemapping |