Cargando…

Data Integration Methods for Phenotype Harmonization in Multi-Cohort Genome-Wide Association Studies With Behavioral Outcomes

Parallel meta-analysis is a popular approach for increasing the power to detect genetic effects in genome-wide association studies across multiple cohorts. Consortia studying the genetics of behavioral phenotypes are oftentimes faced with systematic differences in phenotype measurement across cohort...

Descripción completa

Detalles Bibliográficos
Autores principales: Luningham, Justin M., McArtor, Daniel B., Hendriks, Anne M., van Beijsterveldt, Catharina E. M., Lichtenstein, Paul, Lundström, Sebastian, Larsson, Henrik, Bartels, Meike, Boomsma, Dorret I., Lubke, Gitta H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6914843/
https://www.ncbi.nlm.nih.gov/pubmed/31921287
http://dx.doi.org/10.3389/fgene.2019.01227
_version_ 1783479894200549376
author Luningham, Justin M.
McArtor, Daniel B.
Hendriks, Anne M.
van Beijsterveldt, Catharina E. M.
Lichtenstein, Paul
Lundström, Sebastian
Larsson, Henrik
Bartels, Meike
Boomsma, Dorret I.
Lubke, Gitta H.
author_facet Luningham, Justin M.
McArtor, Daniel B.
Hendriks, Anne M.
van Beijsterveldt, Catharina E. M.
Lichtenstein, Paul
Lundström, Sebastian
Larsson, Henrik
Bartels, Meike
Boomsma, Dorret I.
Lubke, Gitta H.
author_sort Luningham, Justin M.
collection PubMed
description Parallel meta-analysis is a popular approach for increasing the power to detect genetic effects in genome-wide association studies across multiple cohorts. Consortia studying the genetics of behavioral phenotypes are oftentimes faced with systematic differences in phenotype measurement across cohorts, introducing heterogeneity into the meta-analysis and reducing statistical power. This study investigated integrative data analysis (IDA) as an approach for jointly modeling the phenotype across multiple datasets. We put forth a bi-factor integration model (BFIM) that provides a single common phenotype score and accounts for sources of study-specific variability in the phenotype. In order to capitalize on this modeling strategy, a phenotype reference panel was utilized as a supplemental sample with complete data on all behavioral measures. A simulation study showed that a mega-analysis of genetic variant effects in a BFIM were more powerful than meta-analysis of genetic effects on a cohort-specific sum score of items. Saving the factor scores from the BFIM and using those as the outcome in meta-analysis was also more powerful than the sum score in most simulation conditions, but a small degree of bias was introduced by this approach. The reference panel was necessary to realize these power gains. An empirical demonstration used the BFIM to harmonize aggression scores in 9-year old children across the Netherlands Twin Register and the Child and Adolescent Twin Study in Sweden, providing a template for application of the BFIM to a range of different phenotypes. A supplemental data collection in the Netherlands Twin Register served as a reference panel for phenotype modeling across both cohorts. Our results indicate that model-based harmonization for the study of complex traits is a useful step within genetic consortia.
format Online
Article
Text
id pubmed-6914843
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-69148432020-01-09 Data Integration Methods for Phenotype Harmonization in Multi-Cohort Genome-Wide Association Studies With Behavioral Outcomes Luningham, Justin M. McArtor, Daniel B. Hendriks, Anne M. van Beijsterveldt, Catharina E. M. Lichtenstein, Paul Lundström, Sebastian Larsson, Henrik Bartels, Meike Boomsma, Dorret I. Lubke, Gitta H. Front Genet Genetics Parallel meta-analysis is a popular approach for increasing the power to detect genetic effects in genome-wide association studies across multiple cohorts. Consortia studying the genetics of behavioral phenotypes are oftentimes faced with systematic differences in phenotype measurement across cohorts, introducing heterogeneity into the meta-analysis and reducing statistical power. This study investigated integrative data analysis (IDA) as an approach for jointly modeling the phenotype across multiple datasets. We put forth a bi-factor integration model (BFIM) that provides a single common phenotype score and accounts for sources of study-specific variability in the phenotype. In order to capitalize on this modeling strategy, a phenotype reference panel was utilized as a supplemental sample with complete data on all behavioral measures. A simulation study showed that a mega-analysis of genetic variant effects in a BFIM were more powerful than meta-analysis of genetic effects on a cohort-specific sum score of items. Saving the factor scores from the BFIM and using those as the outcome in meta-analysis was also more powerful than the sum score in most simulation conditions, but a small degree of bias was introduced by this approach. The reference panel was necessary to realize these power gains. An empirical demonstration used the BFIM to harmonize aggression scores in 9-year old children across the Netherlands Twin Register and the Child and Adolescent Twin Study in Sweden, providing a template for application of the BFIM to a range of different phenotypes. A supplemental data collection in the Netherlands Twin Register served as a reference panel for phenotype modeling across both cohorts. Our results indicate that model-based harmonization for the study of complex traits is a useful step within genetic consortia. Frontiers Media S.A. 2019-12-10 /pmc/articles/PMC6914843/ /pubmed/31921287 http://dx.doi.org/10.3389/fgene.2019.01227 Text en Copyright © 2019 Luningham, McArtor, Hendriks, van Beijsterveldt, Lichtenstein, Lundström, Larsson, Bartels, Boomsma and Lubke http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Luningham, Justin M.
McArtor, Daniel B.
Hendriks, Anne M.
van Beijsterveldt, Catharina E. M.
Lichtenstein, Paul
Lundström, Sebastian
Larsson, Henrik
Bartels, Meike
Boomsma, Dorret I.
Lubke, Gitta H.
Data Integration Methods for Phenotype Harmonization in Multi-Cohort Genome-Wide Association Studies With Behavioral Outcomes
title Data Integration Methods for Phenotype Harmonization in Multi-Cohort Genome-Wide Association Studies With Behavioral Outcomes
title_full Data Integration Methods for Phenotype Harmonization in Multi-Cohort Genome-Wide Association Studies With Behavioral Outcomes
title_fullStr Data Integration Methods for Phenotype Harmonization in Multi-Cohort Genome-Wide Association Studies With Behavioral Outcomes
title_full_unstemmed Data Integration Methods for Phenotype Harmonization in Multi-Cohort Genome-Wide Association Studies With Behavioral Outcomes
title_short Data Integration Methods for Phenotype Harmonization in Multi-Cohort Genome-Wide Association Studies With Behavioral Outcomes
title_sort data integration methods for phenotype harmonization in multi-cohort genome-wide association studies with behavioral outcomes
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6914843/
https://www.ncbi.nlm.nih.gov/pubmed/31921287
http://dx.doi.org/10.3389/fgene.2019.01227
work_keys_str_mv AT luninghamjustinm dataintegrationmethodsforphenotypeharmonizationinmulticohortgenomewideassociationstudieswithbehavioraloutcomes
AT mcartordanielb dataintegrationmethodsforphenotypeharmonizationinmulticohortgenomewideassociationstudieswithbehavioraloutcomes
AT hendriksannem dataintegrationmethodsforphenotypeharmonizationinmulticohortgenomewideassociationstudieswithbehavioraloutcomes
AT vanbeijsterveldtcatharinaem dataintegrationmethodsforphenotypeharmonizationinmulticohortgenomewideassociationstudieswithbehavioraloutcomes
AT lichtensteinpaul dataintegrationmethodsforphenotypeharmonizationinmulticohortgenomewideassociationstudieswithbehavioraloutcomes
AT lundstromsebastian dataintegrationmethodsforphenotypeharmonizationinmulticohortgenomewideassociationstudieswithbehavioraloutcomes
AT larssonhenrik dataintegrationmethodsforphenotypeharmonizationinmulticohortgenomewideassociationstudieswithbehavioraloutcomes
AT bartelsmeike dataintegrationmethodsforphenotypeharmonizationinmulticohortgenomewideassociationstudieswithbehavioraloutcomes
AT boomsmadorreti dataintegrationmethodsforphenotypeharmonizationinmulticohortgenomewideassociationstudieswithbehavioraloutcomes
AT lubkegittah dataintegrationmethodsforphenotypeharmonizationinmulticohortgenomewideassociationstudieswithbehavioraloutcomes