Cargando…

Meta-Analysis of Repository Data: Impact of Data Regularization on NIMH Schizophrenia Linkage Results

Human geneticists are increasingly turning to study designs based on very large sample sizes to overcome difficulties in studying complex disorders. This in turn almost always requires multi-site data collection and processing of data through centralized repositories. While such repositories offer m...

Descripción completa

Detalles Bibliográficos
Autores principales: Walters, Kimberly A., Huang, Yungui, Azaro, Marco, Tobin, Kathleen, Lehner, Thomas, Brzustowicz, Linda M., Vieland, Veronica J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3891773/
https://www.ncbi.nlm.nih.gov/pubmed/24454738
http://dx.doi.org/10.1371/journal.pone.0084696
_version_ 1782299422985551872
author Walters, Kimberly A.
Huang, Yungui
Azaro, Marco
Tobin, Kathleen
Lehner, Thomas
Brzustowicz, Linda M.
Vieland, Veronica J.
author_facet Walters, Kimberly A.
Huang, Yungui
Azaro, Marco
Tobin, Kathleen
Lehner, Thomas
Brzustowicz, Linda M.
Vieland, Veronica J.
author_sort Walters, Kimberly A.
collection PubMed
description Human geneticists are increasingly turning to study designs based on very large sample sizes to overcome difficulties in studying complex disorders. This in turn almost always requires multi-site data collection and processing of data through centralized repositories. While such repositories offer many advantages, including the ability to return to previously collected data to apply new analytic techniques, they also have some limitations. To illustrate, we reviewed data from seven older schizophrenia studies available from the NIMH-funded Center for Collaborative Genomic Studies on Mental Disorders, also known as the Human Genetics Initiative (HGI), and assessed the impact of data cleaning and regularization on linkage analyses. Extensive data regularization protocols were developed and applied to both genotypic and phenotypic data. Genome-wide nonparametric linkage (NPL) statistics were computed for each study, over various stages of data processing. To assess the impact of data processing on aggregate results, Genome-Scan Meta-Analysis (GSMA) was performed. Examples of increased, reduced and shifted linkage peaks were found when comparing linkage results based on original HGI data to results using post-processed data within the same set of pedigrees. Interestingly, reducing the number of affected individuals tended to increase rather than decrease linkage peaks. But most importantly, while the effects of data regularization within individual data sets were small, GSMA applied to the data in aggregate yielded a substantially different picture after data regularization. These results have implications for analyses based on other types of data (e.g., case-control GWAS or sequencing data) as well as data obtained from other repositories.
format Online
Article
Text
id pubmed-3891773
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-38917732014-01-21 Meta-Analysis of Repository Data: Impact of Data Regularization on NIMH Schizophrenia Linkage Results Walters, Kimberly A. Huang, Yungui Azaro, Marco Tobin, Kathleen Lehner, Thomas Brzustowicz, Linda M. Vieland, Veronica J. PLoS One Research Article Human geneticists are increasingly turning to study designs based on very large sample sizes to overcome difficulties in studying complex disorders. This in turn almost always requires multi-site data collection and processing of data through centralized repositories. While such repositories offer many advantages, including the ability to return to previously collected data to apply new analytic techniques, they also have some limitations. To illustrate, we reviewed data from seven older schizophrenia studies available from the NIMH-funded Center for Collaborative Genomic Studies on Mental Disorders, also known as the Human Genetics Initiative (HGI), and assessed the impact of data cleaning and regularization on linkage analyses. Extensive data regularization protocols were developed and applied to both genotypic and phenotypic data. Genome-wide nonparametric linkage (NPL) statistics were computed for each study, over various stages of data processing. To assess the impact of data processing on aggregate results, Genome-Scan Meta-Analysis (GSMA) was performed. Examples of increased, reduced and shifted linkage peaks were found when comparing linkage results based on original HGI data to results using post-processed data within the same set of pedigrees. Interestingly, reducing the number of affected individuals tended to increase rather than decrease linkage peaks. But most importantly, while the effects of data regularization within individual data sets were small, GSMA applied to the data in aggregate yielded a substantially different picture after data regularization. These results have implications for analyses based on other types of data (e.g., case-control GWAS or sequencing data) as well as data obtained from other repositories. Public Library of Science 2014-01-14 /pmc/articles/PMC3891773/ /pubmed/24454738 http://dx.doi.org/10.1371/journal.pone.0084696 Text en © 2014 Walters et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Walters, Kimberly A.
Huang, Yungui
Azaro, Marco
Tobin, Kathleen
Lehner, Thomas
Brzustowicz, Linda M.
Vieland, Veronica J.
Meta-Analysis of Repository Data: Impact of Data Regularization on NIMH Schizophrenia Linkage Results
title Meta-Analysis of Repository Data: Impact of Data Regularization on NIMH Schizophrenia Linkage Results
title_full Meta-Analysis of Repository Data: Impact of Data Regularization on NIMH Schizophrenia Linkage Results
title_fullStr Meta-Analysis of Repository Data: Impact of Data Regularization on NIMH Schizophrenia Linkage Results
title_full_unstemmed Meta-Analysis of Repository Data: Impact of Data Regularization on NIMH Schizophrenia Linkage Results
title_short Meta-Analysis of Repository Data: Impact of Data Regularization on NIMH Schizophrenia Linkage Results
title_sort meta-analysis of repository data: impact of data regularization on nimh schizophrenia linkage results
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3891773/
https://www.ncbi.nlm.nih.gov/pubmed/24454738
http://dx.doi.org/10.1371/journal.pone.0084696
work_keys_str_mv AT walterskimberlya metaanalysisofrepositorydataimpactofdataregularizationonnimhschizophrenialinkageresults
AT huangyungui metaanalysisofrepositorydataimpactofdataregularizationonnimhschizophrenialinkageresults
AT azaromarco metaanalysisofrepositorydataimpactofdataregularizationonnimhschizophrenialinkageresults
AT tobinkathleen metaanalysisofrepositorydataimpactofdataregularizationonnimhschizophrenialinkageresults
AT lehnerthomas metaanalysisofrepositorydataimpactofdataregularizationonnimhschizophrenialinkageresults
AT brzustowiczlindam metaanalysisofrepositorydataimpactofdataregularizationonnimhschizophrenialinkageresults
AT vielandveronicaj metaanalysisofrepositorydataimpactofdataregularizationonnimhschizophrenialinkageresults