Cargando…

HASE: Framework for efficient high-dimensional association analyses

High-throughput technology can now provide rich information on a person’s biological makeup and environmental surroundings. Important discoveries have been made by relating these data to various health outcomes in fields such as genomics, proteomics, and medical imaging. However, cross-investigation...

Descripción completa

Detalles Bibliográficos
Autores principales: Roshchupkin, G. V., Adams, H. H. H., Vernooij, M. W., Hofman, A., Van Duijn, C. M., Ikram, M. A., Niessen, W. J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5080584/
https://www.ncbi.nlm.nih.gov/pubmed/27782180
http://dx.doi.org/10.1038/srep36076
_version_ 1782462748445114368
author Roshchupkin, G. V.
Adams, H. H. H.
Vernooij, M. W.
Hofman, A.
Van Duijn, C. M.
Ikram, M. A.
Niessen, W. J.
author_facet Roshchupkin, G. V.
Adams, H. H. H.
Vernooij, M. W.
Hofman, A.
Van Duijn, C. M.
Ikram, M. A.
Niessen, W. J.
author_sort Roshchupkin, G. V.
collection PubMed
description High-throughput technology can now provide rich information on a person’s biological makeup and environmental surroundings. Important discoveries have been made by relating these data to various health outcomes in fields such as genomics, proteomics, and medical imaging. However, cross-investigations between several high-throughput technologies remain impractical due to demanding computational requirements (hundreds of years of computing resources) and unsuitability for collaborative settings (terabytes of data to share). Here we introduce the HASE framework that overcomes both of these issues. Our approach dramatically reduces computational time from years to only hours and also requires several gigabytes to be exchanged between collaborators. We implemented a novel meta-analytical method that yields identical power as pooled analyses without the need of sharing individual participant data. The efficiency of the framework is illustrated by associating 9 million genetic variants with 1.5 million brain imaging voxels in three cohorts (total N = 4,034) followed by meta-analysis, on a standard computational infrastructure. These experiments indicate that HASE facilitates high-dimensional association studies enabling large multicenter association studies for future discoveries.
format Online
Article
Text
id pubmed-5080584
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-50805842016-10-31 HASE: Framework for efficient high-dimensional association analyses Roshchupkin, G. V. Adams, H. H. H. Vernooij, M. W. Hofman, A. Van Duijn, C. M. Ikram, M. A. Niessen, W. J. Sci Rep Article High-throughput technology can now provide rich information on a person’s biological makeup and environmental surroundings. Important discoveries have been made by relating these data to various health outcomes in fields such as genomics, proteomics, and medical imaging. However, cross-investigations between several high-throughput technologies remain impractical due to demanding computational requirements (hundreds of years of computing resources) and unsuitability for collaborative settings (terabytes of data to share). Here we introduce the HASE framework that overcomes both of these issues. Our approach dramatically reduces computational time from years to only hours and also requires several gigabytes to be exchanged between collaborators. We implemented a novel meta-analytical method that yields identical power as pooled analyses without the need of sharing individual participant data. The efficiency of the framework is illustrated by associating 9 million genetic variants with 1.5 million brain imaging voxels in three cohorts (total N = 4,034) followed by meta-analysis, on a standard computational infrastructure. These experiments indicate that HASE facilitates high-dimensional association studies enabling large multicenter association studies for future discoveries. Nature Publishing Group 2016-10-26 /pmc/articles/PMC5080584/ /pubmed/27782180 http://dx.doi.org/10.1038/srep36076 Text en Copyright © 2016, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Roshchupkin, G. V.
Adams, H. H. H.
Vernooij, M. W.
Hofman, A.
Van Duijn, C. M.
Ikram, M. A.
Niessen, W. J.
HASE: Framework for efficient high-dimensional association analyses
title HASE: Framework for efficient high-dimensional association analyses
title_full HASE: Framework for efficient high-dimensional association analyses
title_fullStr HASE: Framework for efficient high-dimensional association analyses
title_full_unstemmed HASE: Framework for efficient high-dimensional association analyses
title_short HASE: Framework for efficient high-dimensional association analyses
title_sort hase: framework for efficient high-dimensional association analyses
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5080584/
https://www.ncbi.nlm.nih.gov/pubmed/27782180
http://dx.doi.org/10.1038/srep36076
work_keys_str_mv AT roshchupkingv haseframeworkforefficienthighdimensionalassociationanalyses
AT adamshhh haseframeworkforefficienthighdimensionalassociationanalyses
AT vernooijmw haseframeworkforefficienthighdimensionalassociationanalyses
AT hofmana haseframeworkforefficienthighdimensionalassociationanalyses
AT vanduijncm haseframeworkforefficienthighdimensionalassociationanalyses
AT ikramma haseframeworkforefficienthighdimensionalassociationanalyses
AT niessenwj haseframeworkforefficienthighdimensionalassociationanalyses