Cargando…

ViPAR: a software platform for the Virtual Pooling and Analysis of Research Data

Background: Research studies exploring the determinants of disease require sufficient statistical power to detect meaningful effects. Sample size is often increased through centralized pooling of disparately located datasets, though ethical, privacy and data ownership issues can often hamper this pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Carter, Kim W, Francis, Richard W, Carter, KW, Francis, RW, Bresnahan, M, Gissler, M, Grønborg, TK, Gross, R, Gunnes, N, Hammond, G, Hornig, M, Hultman, CM, Huttunen, J, Langridge, A, Leonard, H, Newman, S, Parner, ET, Petersson, G, Reichenberg, A, Sandin, S, Schendel, DE, Schalkwyk, L, Sourander, A, Steadman, C, Stoltenberg, C, Suominen, A, Surén, P, Susser, E, Sylvester Vethanayagam, A, Yusof, Z
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4864874/
https://www.ncbi.nlm.nih.gov/pubmed/26452388
http://dx.doi.org/10.1093/ije/dyv193
_version_ 1782431692927008768
author Carter, Kim W
Francis, Richard W
Carter, KW
Francis, RW
Bresnahan, M
Gissler, M
Grønborg, TK
Gross, R
Gunnes, N
Hammond, G
Hornig, M
Hultman, CM
Huttunen, J
Langridge, A
Leonard, H
Newman, S
Parner, ET
Petersson, G
Reichenberg, A
Sandin, S
Schendel, DE
Schalkwyk, L
Sourander, A
Steadman, C
Stoltenberg, C
Suominen, A
Surén, P
Susser, E
Sylvester Vethanayagam, A
Yusof, Z
author_facet Carter, Kim W
Francis, Richard W
Carter, KW
Francis, RW
Bresnahan, M
Gissler, M
Grønborg, TK
Gross, R
Gunnes, N
Hammond, G
Hornig, M
Hultman, CM
Huttunen, J
Langridge, A
Leonard, H
Newman, S
Parner, ET
Petersson, G
Reichenberg, A
Sandin, S
Schendel, DE
Schalkwyk, L
Sourander, A
Steadman, C
Stoltenberg, C
Suominen, A
Surén, P
Susser, E
Sylvester Vethanayagam, A
Yusof, Z
author_sort Carter, Kim W
collection PubMed
description Background: Research studies exploring the determinants of disease require sufficient statistical power to detect meaningful effects. Sample size is often increased through centralized pooling of disparately located datasets, though ethical, privacy and data ownership issues can often hamper this process. Methods that facilitate the sharing of research data that are sympathetic with these issues and which allow flexible and detailed statistical analyses are therefore in critical need. We have created a software platform for the Virtual Pooling and Analysis of Research data (ViPAR), which employs free and open source methods to provide researchers with a web-based platform to analyse datasets housed in disparate locations. Methods: Database federation permits controlled access to remotely located datasets from a central location. The Secure Shell protocol allows data to be securely exchanged between devices over an insecure network. ViPAR combines these free technologies into a solution that facilitates ‘virtual pooling’ where data can be temporarily pooled into computer memory and made available for analysis without the need for permanent central storage. Results: Within the ViPAR infrastructure, remote sites manage their own harmonized research dataset in a database hosted at their site, while a central server hosts the data federation component and a secure analysis portal. When an analysis is initiated, requested data are retrieved from each remote site and virtually pooled at the central site. The data are then analysed by statistical software and, on completion, results of the analysis are returned to the user and the virtually pooled data are removed from memory. Conclusions: ViPAR is a secure, flexible and powerful analysis platform built on open source technology that is currently in use by large international consortia, and is made publicly available at [ http://bioinformatics.childhealthresearch.org.au/software/vipar/ ].
format Online
Article
Text
id pubmed-4864874
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-48648742016-05-13 ViPAR: a software platform for the Virtual Pooling and Analysis of Research Data Carter, Kim W Francis, Richard W Carter, KW Francis, RW Bresnahan, M Gissler, M Grønborg, TK Gross, R Gunnes, N Hammond, G Hornig, M Hultman, CM Huttunen, J Langridge, A Leonard, H Newman, S Parner, ET Petersson, G Reichenberg, A Sandin, S Schendel, DE Schalkwyk, L Sourander, A Steadman, C Stoltenberg, C Suominen, A Surén, P Susser, E Sylvester Vethanayagam, A Yusof, Z Int J Epidemiol Miscellaneous Background: Research studies exploring the determinants of disease require sufficient statistical power to detect meaningful effects. Sample size is often increased through centralized pooling of disparately located datasets, though ethical, privacy and data ownership issues can often hamper this process. Methods that facilitate the sharing of research data that are sympathetic with these issues and which allow flexible and detailed statistical analyses are therefore in critical need. We have created a software platform for the Virtual Pooling and Analysis of Research data (ViPAR), which employs free and open source methods to provide researchers with a web-based platform to analyse datasets housed in disparate locations. Methods: Database federation permits controlled access to remotely located datasets from a central location. The Secure Shell protocol allows data to be securely exchanged between devices over an insecure network. ViPAR combines these free technologies into a solution that facilitates ‘virtual pooling’ where data can be temporarily pooled into computer memory and made available for analysis without the need for permanent central storage. Results: Within the ViPAR infrastructure, remote sites manage their own harmonized research dataset in a database hosted at their site, while a central server hosts the data federation component and a secure analysis portal. When an analysis is initiated, requested data are retrieved from each remote site and virtually pooled at the central site. The data are then analysed by statistical software and, on completion, results of the analysis are returned to the user and the virtually pooled data are removed from memory. Conclusions: ViPAR is a secure, flexible and powerful analysis platform built on open source technology that is currently in use by large international consortia, and is made publicly available at [ http://bioinformatics.childhealthresearch.org.au/software/vipar/ ]. Oxford University Press 2016-04 2015-10-08 /pmc/articles/PMC4864874/ /pubmed/26452388 http://dx.doi.org/10.1093/ije/dyv193 Text en © The Author 2015. Published by Oxford University Press on behalf of the International Epidemiological Association http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License ( http://creativecommons.org/licenses/by-nc/4.0/ ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Miscellaneous
Carter, Kim W
Francis, Richard W
Carter, KW
Francis, RW
Bresnahan, M
Gissler, M
Grønborg, TK
Gross, R
Gunnes, N
Hammond, G
Hornig, M
Hultman, CM
Huttunen, J
Langridge, A
Leonard, H
Newman, S
Parner, ET
Petersson, G
Reichenberg, A
Sandin, S
Schendel, DE
Schalkwyk, L
Sourander, A
Steadman, C
Stoltenberg, C
Suominen, A
Surén, P
Susser, E
Sylvester Vethanayagam, A
Yusof, Z
ViPAR: a software platform for the Virtual Pooling and Analysis of Research Data
title ViPAR: a software platform for the Virtual Pooling and Analysis of Research Data
title_full ViPAR: a software platform for the Virtual Pooling and Analysis of Research Data
title_fullStr ViPAR: a software platform for the Virtual Pooling and Analysis of Research Data
title_full_unstemmed ViPAR: a software platform for the Virtual Pooling and Analysis of Research Data
title_short ViPAR: a software platform for the Virtual Pooling and Analysis of Research Data
title_sort vipar: a software platform for the virtual pooling and analysis of research data
topic Miscellaneous
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4864874/
https://www.ncbi.nlm.nih.gov/pubmed/26452388
http://dx.doi.org/10.1093/ije/dyv193
work_keys_str_mv AT carterkimw viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT francisrichardw viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT carterkw viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT francisrw viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT bresnahanm viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT gisslerm viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT grønborgtk viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT grossr viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT gunnesn viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT hammondg viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT hornigm viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT hultmancm viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT huttunenj viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT langridgea viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT leonardh viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT newmans viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT parneret viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT peterssong viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT reichenberga viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT sandins viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT schendelde viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT schalkwykl viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT sourandera viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT steadmanc viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT stoltenbergc viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT suominena viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT surenp viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT sussere viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT sylvestervethanayagama viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata
AT yusofz viparasoftwareplatformforthevirtualpoolingandanalysisofresearchdata