Cargando…
HASE: Framework for efficient high-dimensional association analyses
High-throughput technology can now provide rich information on a person’s biological makeup and environmental surroundings. Important discoveries have been made by relating these data to various health outcomes in fields such as genomics, proteomics, and medical imaging. However, cross-investigation...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5080584/ https://www.ncbi.nlm.nih.gov/pubmed/27782180 http://dx.doi.org/10.1038/srep36076 |
_version_ | 1782462748445114368 |
---|---|
author | Roshchupkin, G. V. Adams, H. H. H. Vernooij, M. W. Hofman, A. Van Duijn, C. M. Ikram, M. A. Niessen, W. J. |
author_facet | Roshchupkin, G. V. Adams, H. H. H. Vernooij, M. W. Hofman, A. Van Duijn, C. M. Ikram, M. A. Niessen, W. J. |
author_sort | Roshchupkin, G. V. |
collection | PubMed |
description | High-throughput technology can now provide rich information on a person’s biological makeup and environmental surroundings. Important discoveries have been made by relating these data to various health outcomes in fields such as genomics, proteomics, and medical imaging. However, cross-investigations between several high-throughput technologies remain impractical due to demanding computational requirements (hundreds of years of computing resources) and unsuitability for collaborative settings (terabytes of data to share). Here we introduce the HASE framework that overcomes both of these issues. Our approach dramatically reduces computational time from years to only hours and also requires several gigabytes to be exchanged between collaborators. We implemented a novel meta-analytical method that yields identical power as pooled analyses without the need of sharing individual participant data. The efficiency of the framework is illustrated by associating 9 million genetic variants with 1.5 million brain imaging voxels in three cohorts (total N = 4,034) followed by meta-analysis, on a standard computational infrastructure. These experiments indicate that HASE facilitates high-dimensional association studies enabling large multicenter association studies for future discoveries. |
format | Online Article Text |
id | pubmed-5080584 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-50805842016-10-31 HASE: Framework for efficient high-dimensional association analyses Roshchupkin, G. V. Adams, H. H. H. Vernooij, M. W. Hofman, A. Van Duijn, C. M. Ikram, M. A. Niessen, W. J. Sci Rep Article High-throughput technology can now provide rich information on a person’s biological makeup and environmental surroundings. Important discoveries have been made by relating these data to various health outcomes in fields such as genomics, proteomics, and medical imaging. However, cross-investigations between several high-throughput technologies remain impractical due to demanding computational requirements (hundreds of years of computing resources) and unsuitability for collaborative settings (terabytes of data to share). Here we introduce the HASE framework that overcomes both of these issues. Our approach dramatically reduces computational time from years to only hours and also requires several gigabytes to be exchanged between collaborators. We implemented a novel meta-analytical method that yields identical power as pooled analyses without the need of sharing individual participant data. The efficiency of the framework is illustrated by associating 9 million genetic variants with 1.5 million brain imaging voxels in three cohorts (total N = 4,034) followed by meta-analysis, on a standard computational infrastructure. These experiments indicate that HASE facilitates high-dimensional association studies enabling large multicenter association studies for future discoveries. Nature Publishing Group 2016-10-26 /pmc/articles/PMC5080584/ /pubmed/27782180 http://dx.doi.org/10.1038/srep36076 Text en Copyright © 2016, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Roshchupkin, G. V. Adams, H. H. H. Vernooij, M. W. Hofman, A. Van Duijn, C. M. Ikram, M. A. Niessen, W. J. HASE: Framework for efficient high-dimensional association analyses |
title | HASE: Framework for efficient high-dimensional association analyses |
title_full | HASE: Framework for efficient high-dimensional association analyses |
title_fullStr | HASE: Framework for efficient high-dimensional association analyses |
title_full_unstemmed | HASE: Framework for efficient high-dimensional association analyses |
title_short | HASE: Framework for efficient high-dimensional association analyses |
title_sort | hase: framework for efficient high-dimensional association analyses |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5080584/ https://www.ncbi.nlm.nih.gov/pubmed/27782180 http://dx.doi.org/10.1038/srep36076 |
work_keys_str_mv | AT roshchupkingv haseframeworkforefficienthighdimensionalassociationanalyses AT adamshhh haseframeworkforefficienthighdimensionalassociationanalyses AT vernooijmw haseframeworkforefficienthighdimensionalassociationanalyses AT hofmana haseframeworkforefficienthighdimensionalassociationanalyses AT vanduijncm haseframeworkforefficienthighdimensionalassociationanalyses AT ikramma haseframeworkforefficienthighdimensionalassociationanalyses AT niessenwj haseframeworkforefficienthighdimensionalassociationanalyses |