Cargando…

Diverse convergent evidence in the genetic analysis of complex disease: coordinating omic, informatic, and experimental evidence to better identify and validate risk factors

In omic research, such as genome wide association studies, researchers seek to repeat their results in other datasets to reduce false positive findings and thus provide evidence for the existence of true associations. Unfortunately this standard validation approach cannot completely eliminate false...

Descripción completa

Detalles Bibliográficos
Autores principales: Ciesielski, Timothy H, Pendergrass, Sarah A, White, Marquitta J, Kodaman, Nuri, Sobota, Rafal S, Huang, Minjun, Bartlett, Jacquelaine, Li, Jing, Pan, Qinxin, Gui, Jiang, Selleck, Scott B, Amos, Christopher I, Ritchie, Marylyn D, Moore, Jason H, Williams, Scott M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4112852/
https://www.ncbi.nlm.nih.gov/pubmed/25071867
http://dx.doi.org/10.1186/1756-0381-7-10
_version_ 1782328221763633152
author Ciesielski, Timothy H
Pendergrass, Sarah A
White, Marquitta J
Kodaman, Nuri
Sobota, Rafal S
Huang, Minjun
Bartlett, Jacquelaine
Li, Jing
Pan, Qinxin
Gui, Jiang
Selleck, Scott B
Amos, Christopher I
Ritchie, Marylyn D
Moore, Jason H
Williams, Scott M
author_facet Ciesielski, Timothy H
Pendergrass, Sarah A
White, Marquitta J
Kodaman, Nuri
Sobota, Rafal S
Huang, Minjun
Bartlett, Jacquelaine
Li, Jing
Pan, Qinxin
Gui, Jiang
Selleck, Scott B
Amos, Christopher I
Ritchie, Marylyn D
Moore, Jason H
Williams, Scott M
author_sort Ciesielski, Timothy H
collection PubMed
description In omic research, such as genome wide association studies, researchers seek to repeat their results in other datasets to reduce false positive findings and thus provide evidence for the existence of true associations. Unfortunately this standard validation approach cannot completely eliminate false positive conclusions, and it can also mask many true associations that might otherwise advance our understanding of pathology. These issues beg the question: How can we increase the amount of knowledge gained from high throughput genetic data? To address this challenge, we present an approach that complements standard statistical validation methods by drawing attention to both potential false negative and false positive conclusions, as well as providing broad information for directing future research. The Diverse Convergent Evidence approach (DiCE) we propose integrates information from multiple sources (omics, informatics, and laboratory experiments) to estimate the strength of the available corroborating evidence supporting a given association. This process is designed to yield an evidence metric that has utility when etiologic heterogeneity, variable risk factor frequencies, and a variety of observational data imperfections might lead to false conclusions. We provide proof of principle examples in which DiCE identified strong evidence for associations that have established biological importance, when standard validation methods alone did not provide support. If used as an adjunct to standard validation methods this approach can leverage multiple distinct data types to improve genetic risk factor discovery/validation, promote effective science communication, and guide future research directions.
format Online
Article
Text
id pubmed-4112852
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41128522014-07-29 Diverse convergent evidence in the genetic analysis of complex disease: coordinating omic, informatic, and experimental evidence to better identify and validate risk factors Ciesielski, Timothy H Pendergrass, Sarah A White, Marquitta J Kodaman, Nuri Sobota, Rafal S Huang, Minjun Bartlett, Jacquelaine Li, Jing Pan, Qinxin Gui, Jiang Selleck, Scott B Amos, Christopher I Ritchie, Marylyn D Moore, Jason H Williams, Scott M BioData Min Methodology In omic research, such as genome wide association studies, researchers seek to repeat their results in other datasets to reduce false positive findings and thus provide evidence for the existence of true associations. Unfortunately this standard validation approach cannot completely eliminate false positive conclusions, and it can also mask many true associations that might otherwise advance our understanding of pathology. These issues beg the question: How can we increase the amount of knowledge gained from high throughput genetic data? To address this challenge, we present an approach that complements standard statistical validation methods by drawing attention to both potential false negative and false positive conclusions, as well as providing broad information for directing future research. The Diverse Convergent Evidence approach (DiCE) we propose integrates information from multiple sources (omics, informatics, and laboratory experiments) to estimate the strength of the available corroborating evidence supporting a given association. This process is designed to yield an evidence metric that has utility when etiologic heterogeneity, variable risk factor frequencies, and a variety of observational data imperfections might lead to false conclusions. We provide proof of principle examples in which DiCE identified strong evidence for associations that have established biological importance, when standard validation methods alone did not provide support. If used as an adjunct to standard validation methods this approach can leverage multiple distinct data types to improve genetic risk factor discovery/validation, promote effective science communication, and guide future research directions. BioMed Central 2014-06-30 /pmc/articles/PMC4112852/ /pubmed/25071867 http://dx.doi.org/10.1186/1756-0381-7-10 Text en Copyright © 2014 Ciesielski et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology
Ciesielski, Timothy H
Pendergrass, Sarah A
White, Marquitta J
Kodaman, Nuri
Sobota, Rafal S
Huang, Minjun
Bartlett, Jacquelaine
Li, Jing
Pan, Qinxin
Gui, Jiang
Selleck, Scott B
Amos, Christopher I
Ritchie, Marylyn D
Moore, Jason H
Williams, Scott M
Diverse convergent evidence in the genetic analysis of complex disease: coordinating omic, informatic, and experimental evidence to better identify and validate risk factors
title Diverse convergent evidence in the genetic analysis of complex disease: coordinating omic, informatic, and experimental evidence to better identify and validate risk factors
title_full Diverse convergent evidence in the genetic analysis of complex disease: coordinating omic, informatic, and experimental evidence to better identify and validate risk factors
title_fullStr Diverse convergent evidence in the genetic analysis of complex disease: coordinating omic, informatic, and experimental evidence to better identify and validate risk factors
title_full_unstemmed Diverse convergent evidence in the genetic analysis of complex disease: coordinating omic, informatic, and experimental evidence to better identify and validate risk factors
title_short Diverse convergent evidence in the genetic analysis of complex disease: coordinating omic, informatic, and experimental evidence to better identify and validate risk factors
title_sort diverse convergent evidence in the genetic analysis of complex disease: coordinating omic, informatic, and experimental evidence to better identify and validate risk factors
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4112852/
https://www.ncbi.nlm.nih.gov/pubmed/25071867
http://dx.doi.org/10.1186/1756-0381-7-10
work_keys_str_mv AT ciesielskitimothyh diverseconvergentevidenceinthegeneticanalysisofcomplexdiseasecoordinatingomicinformaticandexperimentalevidencetobetteridentifyandvalidateriskfactors
AT pendergrasssaraha diverseconvergentevidenceinthegeneticanalysisofcomplexdiseasecoordinatingomicinformaticandexperimentalevidencetobetteridentifyandvalidateriskfactors
AT whitemarquittaj diverseconvergentevidenceinthegeneticanalysisofcomplexdiseasecoordinatingomicinformaticandexperimentalevidencetobetteridentifyandvalidateriskfactors
AT kodamannuri diverseconvergentevidenceinthegeneticanalysisofcomplexdiseasecoordinatingomicinformaticandexperimentalevidencetobetteridentifyandvalidateriskfactors
AT sobotarafals diverseconvergentevidenceinthegeneticanalysisofcomplexdiseasecoordinatingomicinformaticandexperimentalevidencetobetteridentifyandvalidateriskfactors
AT huangminjun diverseconvergentevidenceinthegeneticanalysisofcomplexdiseasecoordinatingomicinformaticandexperimentalevidencetobetteridentifyandvalidateriskfactors
AT bartlettjacquelaine diverseconvergentevidenceinthegeneticanalysisofcomplexdiseasecoordinatingomicinformaticandexperimentalevidencetobetteridentifyandvalidateriskfactors
AT lijing diverseconvergentevidenceinthegeneticanalysisofcomplexdiseasecoordinatingomicinformaticandexperimentalevidencetobetteridentifyandvalidateriskfactors
AT panqinxin diverseconvergentevidenceinthegeneticanalysisofcomplexdiseasecoordinatingomicinformaticandexperimentalevidencetobetteridentifyandvalidateriskfactors
AT guijiang diverseconvergentevidenceinthegeneticanalysisofcomplexdiseasecoordinatingomicinformaticandexperimentalevidencetobetteridentifyandvalidateriskfactors
AT selleckscottb diverseconvergentevidenceinthegeneticanalysisofcomplexdiseasecoordinatingomicinformaticandexperimentalevidencetobetteridentifyandvalidateriskfactors
AT amoschristopheri diverseconvergentevidenceinthegeneticanalysisofcomplexdiseasecoordinatingomicinformaticandexperimentalevidencetobetteridentifyandvalidateriskfactors
AT ritchiemarylynd diverseconvergentevidenceinthegeneticanalysisofcomplexdiseasecoordinatingomicinformaticandexperimentalevidencetobetteridentifyandvalidateriskfactors
AT moorejasonh diverseconvergentevidenceinthegeneticanalysisofcomplexdiseasecoordinatingomicinformaticandexperimentalevidencetobetteridentifyandvalidateriskfactors
AT williamsscottm diverseconvergentevidenceinthegeneticanalysisofcomplexdiseasecoordinatingomicinformaticandexperimentalevidencetobetteridentifyandvalidateriskfactors