Cargando…

A hierarchical and modular approach to the discovery of robust associations in genome-wide association studies from pooled DNA samples

BACKGROUND: One of the challenges of the analysis of pooling-based genome wide association studies is to identify authentic associations among potentially thousands of false positive associations. RESULTS: We present a hierarchical and modular approach to the analysis of genome wide genotype data th...

Descripción completa

Detalles Bibliográficos
Autores principales: Sebastiani, Paola, Zhao, Zhenming, Abad-Grau, Maria M, Riva, Alberto, Hartley, Stephen W, Sedgewick, Amanda E, Doria, Alessandro, Montano, Monty, Melista, Efthymia, Terry, Dellara, Perls, Thomas T, Steinberg, Martin H, Baldwin, Clinton T
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2248205/
https://www.ncbi.nlm.nih.gov/pubmed/18194558
http://dx.doi.org/10.1186/1471-2156-9-6
_version_ 1782150979943137280
author Sebastiani, Paola
Zhao, Zhenming
Abad-Grau, Maria M
Riva, Alberto
Hartley, Stephen W
Sedgewick, Amanda E
Doria, Alessandro
Montano, Monty
Melista, Efthymia
Terry, Dellara
Perls, Thomas T
Steinberg, Martin H
Baldwin, Clinton T
author_facet Sebastiani, Paola
Zhao, Zhenming
Abad-Grau, Maria M
Riva, Alberto
Hartley, Stephen W
Sedgewick, Amanda E
Doria, Alessandro
Montano, Monty
Melista, Efthymia
Terry, Dellara
Perls, Thomas T
Steinberg, Martin H
Baldwin, Clinton T
author_sort Sebastiani, Paola
collection PubMed
description BACKGROUND: One of the challenges of the analysis of pooling-based genome wide association studies is to identify authentic associations among potentially thousands of false positive associations. RESULTS: We present a hierarchical and modular approach to the analysis of genome wide genotype data that incorporates quality control, linkage disequilibrium, physical distance and gene ontology to identify authentic associations among those found by statistical association tests. The method is developed for the allelic association analysis of pooled DNA samples, but it can be easily generalized to the analysis of individually genotyped samples. We evaluate the approach using data sets from diverse genome wide association studies including fetal hemoglobin levels in sickle cell anemia and a sample of centenarians and show that the approach is highly reproducible and allows for discovery at different levels of synthesis. CONCLUSION: Results from the integration of Bayesian tests and other machine learning techniques with linkage disequilibrium data suggest that we do not need to use too stringent thresholds to reduce the number of false positive associations. This method yields increased power even with relatively small samples. In fact, our evaluation shows that the method can reach almost 70% sensitivity with samples of only 100 subjects.
format Text
id pubmed-2248205
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-22482052008-02-20 A hierarchical and modular approach to the discovery of robust associations in genome-wide association studies from pooled DNA samples Sebastiani, Paola Zhao, Zhenming Abad-Grau, Maria M Riva, Alberto Hartley, Stephen W Sedgewick, Amanda E Doria, Alessandro Montano, Monty Melista, Efthymia Terry, Dellara Perls, Thomas T Steinberg, Martin H Baldwin, Clinton T BMC Genet Methodology Article BACKGROUND: One of the challenges of the analysis of pooling-based genome wide association studies is to identify authentic associations among potentially thousands of false positive associations. RESULTS: We present a hierarchical and modular approach to the analysis of genome wide genotype data that incorporates quality control, linkage disequilibrium, physical distance and gene ontology to identify authentic associations among those found by statistical association tests. The method is developed for the allelic association analysis of pooled DNA samples, but it can be easily generalized to the analysis of individually genotyped samples. We evaluate the approach using data sets from diverse genome wide association studies including fetal hemoglobin levels in sickle cell anemia and a sample of centenarians and show that the approach is highly reproducible and allows for discovery at different levels of synthesis. CONCLUSION: Results from the integration of Bayesian tests and other machine learning techniques with linkage disequilibrium data suggest that we do not need to use too stringent thresholds to reduce the number of false positive associations. This method yields increased power even with relatively small samples. In fact, our evaluation shows that the method can reach almost 70% sensitivity with samples of only 100 subjects. BioMed Central 2008-01-14 /pmc/articles/PMC2248205/ /pubmed/18194558 http://dx.doi.org/10.1186/1471-2156-9-6 Text en Copyright © 2008 Sebastiani et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Sebastiani, Paola
Zhao, Zhenming
Abad-Grau, Maria M
Riva, Alberto
Hartley, Stephen W
Sedgewick, Amanda E
Doria, Alessandro
Montano, Monty
Melista, Efthymia
Terry, Dellara
Perls, Thomas T
Steinberg, Martin H
Baldwin, Clinton T
A hierarchical and modular approach to the discovery of robust associations in genome-wide association studies from pooled DNA samples
title A hierarchical and modular approach to the discovery of robust associations in genome-wide association studies from pooled DNA samples
title_full A hierarchical and modular approach to the discovery of robust associations in genome-wide association studies from pooled DNA samples
title_fullStr A hierarchical and modular approach to the discovery of robust associations in genome-wide association studies from pooled DNA samples
title_full_unstemmed A hierarchical and modular approach to the discovery of robust associations in genome-wide association studies from pooled DNA samples
title_short A hierarchical and modular approach to the discovery of robust associations in genome-wide association studies from pooled DNA samples
title_sort hierarchical and modular approach to the discovery of robust associations in genome-wide association studies from pooled dna samples
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2248205/
https://www.ncbi.nlm.nih.gov/pubmed/18194558
http://dx.doi.org/10.1186/1471-2156-9-6
work_keys_str_mv AT sebastianipaola ahierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples
AT zhaozhenming ahierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples
AT abadgraumariam ahierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples
AT rivaalberto ahierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples
AT hartleystephenw ahierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples
AT sedgewickamandae ahierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples
AT doriaalessandro ahierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples
AT montanomonty ahierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples
AT melistaefthymia ahierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples
AT terrydellara ahierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples
AT perlsthomast ahierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples
AT steinbergmartinh ahierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples
AT baldwinclintont ahierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples
AT sebastianipaola hierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples
AT zhaozhenming hierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples
AT abadgraumariam hierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples
AT rivaalberto hierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples
AT hartleystephenw hierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples
AT sedgewickamandae hierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples
AT doriaalessandro hierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples
AT montanomonty hierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples
AT melistaefthymia hierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples
AT terrydellara hierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples
AT perlsthomast hierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples
AT steinbergmartinh hierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples
AT baldwinclintont hierarchicalandmodularapproachtothediscoveryofrobustassociationsingenomewideassociationstudiesfrompooleddnasamples