Cargando…
A hierarchical and modular approach to the discovery of robust associations in genome-wide association studies from pooled DNA samples
BACKGROUND: One of the challenges of the analysis of pooling-based genome wide association studies is to identify authentic associations among potentially thousands of false positive associations. RESULTS: We present a hierarchical and modular approach to the analysis of genome wide genotype data th...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2248205/ https://www.ncbi.nlm.nih.gov/pubmed/18194558 http://dx.doi.org/10.1186/1471-2156-9-6 |
_version_ | 1782150979943137280 |
---|---|
author | Sebastiani, Paola Zhao, Zhenming Abad-Grau, Maria M Riva, Alberto Hartley, Stephen W Sedgewick, Amanda E Doria, Alessandro Montano, Monty Melista, Efthymia Terry, Dellara Perls, Thomas T Steinberg, Martin H Baldwin, Clinton T |
author_facet | Sebastiani, Paola Zhao, Zhenming Abad-Grau, Maria M Riva, Alberto Hartley, Stephen W Sedgewick, Amanda E Doria, Alessandro Montano, Monty Melista, Efthymia Terry, Dellara Perls, Thomas T Steinberg, Martin H Baldwin, Clinton T |
author_sort | Sebastiani, Paola |
collection | PubMed |
description | BACKGROUND: One of the challenges of the analysis of pooling-based genome wide association studies is to identify authentic associations among potentially thousands of false positive associations. RESULTS: We present a hierarchical and modular approach to the analysis of genome wide genotype data that incorporates quality control, linkage disequilibrium, physical distance and gene ontology to identify authentic associations among those found by statistical association tests. The method is developed for the allelic association analysis of pooled DNA samples, but it can be easily generalized to the analysis of individually genotyped samples. We evaluate the approach using data sets from diverse genome wide association studies including fetal hemoglobin levels in sickle cell anemia and a sample of centenarians and show that the approach is highly reproducible and allows for discovery at different levels of synthesis. CONCLUSION: Results from the integration of Bayesian tests and other machine learning techniques with linkage disequilibrium data suggest that we do not need to use too stringent thresholds to reduce the number of false positive associations. This method yields increased power even with relatively small samples. In fact, our evaluation shows that the method can reach almost 70% sensitivity with samples of only 100 subjects. |
format | Text |
id | pubmed-2248205 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-22482052008-02-20 A hierarchical and modular approach to the discovery of robust associations in genome-wide association studies from pooled DNA samples Sebastiani, Paola Zhao, Zhenming Abad-Grau, Maria M Riva, Alberto Hartley, Stephen W Sedgewick, Amanda E Doria, Alessandro Montano, Monty Melista, Efthymia Terry, Dellara Perls, Thomas T Steinberg, Martin H Baldwin, Clinton T BMC Genet Methodology Article BACKGROUND: One of the challenges of the analysis of pooling-based genome wide association studies is to identify authentic associations among potentially thousands of false positive associations. RESULTS: We present a hierarchical and modular approach to the analysis of genome wide genotype data that incorporates quality control, linkage disequilibrium, physical distance and gene ontology to identify authentic associations among those found by statistical association tests. The method is developed for the allelic association analysis of pooled DNA samples, but it can be easily generalized to the analysis of individually genotyped samples. We evaluate the approach using data sets from diverse genome wide association studies including fetal hemoglobin levels in sickle cell anemia and a sample of centenarians and show that the approach is highly reproducible and allows for discovery at different levels of synthesis. CONCLUSION: Results from the integration of Bayesian tests and other machine learning techniques with linkage disequilibrium data suggest that we do not need to use too stringent thresholds to reduce the number of false positive associations. This method yields increased power even with relatively small samples. In fact, our evaluation shows that the method can reach almost 70% sensitivity with samples of only 100 subjects. BioMed Central 2008-01-14 /pmc/articles/PMC2248205/ /pubmed/18194558 http://dx.doi.org/10.1186/1471-2156-9-6 Text en Copyright © 2008 Sebastiani et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Sebastiani, Paola Zhao, Zhenming Abad-Grau, Maria M Riva, Alberto Hartley, Stephen W Sedgewick, Amanda E Doria, Alessandro Montano, Monty Melista, Efthymia Terry, Dellara Perls, Thomas T Steinberg, Martin H Baldwin, Clinton T A hierarchical and modular approach to the discovery of robust associations in genome-wide association studies from pooled DNA samples |
title | A hierarchical and modular approach to the discovery of robust associations in genome-wide association studies from pooled DNA samples |
title_full | A hierarchical and modular approach to the discovery of robust associations in genome-wide association studies from pooled DNA samples |
title_fullStr | A hierarchical and modular approach to the discovery of robust associations in genome-wide association studies from pooled DNA samples |
title_full_unstemmed | A hierarchical and modular approach to the discovery of robust associations in genome-wide association studies from pooled DNA samples |
title_short | A hierarchical and modular approach to the discovery of robust associations in genome-wide association studies from pooled DNA samples |
title_sort | hierarchical and modular approach to the discovery of robust associations in genome-wide association studies from pooled dna samples |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2248205/ https://www.ncbi.nlm.nih.gov/pubmed/18194558 http://dx.doi.org/10.1186/1471-2156-9-6 |
work_keys_str_mv | AT sebastianipaola ahierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples AT zhaozhenming ahierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples AT abadgraumariam ahierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples AT rivaalberto ahierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples AT hartleystephenw ahierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples AT sedgewickamandae ahierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples AT doriaalessandro ahierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples AT montanomonty ahierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples AT melistaefthymia ahierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples AT terrydellara ahierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples AT perlsthomast ahierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples AT steinbergmartinh ahierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples AT baldwinclintont ahierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples AT sebastianipaola hierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples AT zhaozhenming hierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples AT abadgraumariam hierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples AT rivaalberto hierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples AT hartleystephenw hierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples AT sedgewickamandae hierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples AT doriaalessandro hierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples AT montanomonty hierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples AT melistaefthymia hierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples AT terrydellara hierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples AT perlsthomast hierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples AT steinbergmartinh hierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples AT baldwinclintont hierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples |