Cargando…

Fostering population-based cohort data discovery: The Maelstrom Research cataloguing toolkit

BACKGROUND: The lack of accessible and structured documentation creates major barriers for investigators interested in understanding, properly interpreting and analyzing cohort data and biological samples. Providing the scientific community with open information is essential to optimize usage of the...

Descripción completa

Detalles Bibliográficos
Autores principales: Bergeron, Julie, Doiron, Dany, Marcon, Yannick, Ferretti, Vincent, Fortier, Isabel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6057635/
https://www.ncbi.nlm.nih.gov/pubmed/30040866
http://dx.doi.org/10.1371/journal.pone.0200926
_version_ 1783341563714207744
author Bergeron, Julie
Doiron, Dany
Marcon, Yannick
Ferretti, Vincent
Fortier, Isabel
author_facet Bergeron, Julie
Doiron, Dany
Marcon, Yannick
Ferretti, Vincent
Fortier, Isabel
author_sort Bergeron, Julie
collection PubMed
description BACKGROUND: The lack of accessible and structured documentation creates major barriers for investigators interested in understanding, properly interpreting and analyzing cohort data and biological samples. Providing the scientific community with open information is essential to optimize usage of these resources. A cataloguing toolkit is proposed by Maelstrom Research to answer these needs and support the creation of comprehensive and user-friendly study- and network-specific web-based metadata catalogues. METHODS: Development of the Maelstrom Research cataloguing toolkit was initiated in 2004. It was supported by the exploration of existing catalogues and standards, and guided by input from partner initiatives having used or pilot tested incremental versions of the toolkit. RESULTS: The cataloguing toolkit is built upon two main components: a metadata model and a suite of open-source software applications. The model sets out specific fields to describe study profiles; characteristics of the subpopulations of participants; timing and design of data collection events; and datasets/variables collected at each data collection event. It also includes the possibility to annotate variables with different classification schemes. When combined, the model and software support implementation of study and variable catalogues and provide a powerful search engine to facilitate data discovery. CONCLUSIONS: The Maelstrom Research cataloguing toolkit already serves several national and international initiatives and the suite of software is available to new initiatives through the Maelstrom Research website. With the support of new and existing partners, we hope to ensure regular improvements of the toolkit.
format Online
Article
Text
id pubmed-6057635
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-60576352018-08-06 Fostering population-based cohort data discovery: The Maelstrom Research cataloguing toolkit Bergeron, Julie Doiron, Dany Marcon, Yannick Ferretti, Vincent Fortier, Isabel PLoS One Research Article BACKGROUND: The lack of accessible and structured documentation creates major barriers for investigators interested in understanding, properly interpreting and analyzing cohort data and biological samples. Providing the scientific community with open information is essential to optimize usage of these resources. A cataloguing toolkit is proposed by Maelstrom Research to answer these needs and support the creation of comprehensive and user-friendly study- and network-specific web-based metadata catalogues. METHODS: Development of the Maelstrom Research cataloguing toolkit was initiated in 2004. It was supported by the exploration of existing catalogues and standards, and guided by input from partner initiatives having used or pilot tested incremental versions of the toolkit. RESULTS: The cataloguing toolkit is built upon two main components: a metadata model and a suite of open-source software applications. The model sets out specific fields to describe study profiles; characteristics of the subpopulations of participants; timing and design of data collection events; and datasets/variables collected at each data collection event. It also includes the possibility to annotate variables with different classification schemes. When combined, the model and software support implementation of study and variable catalogues and provide a powerful search engine to facilitate data discovery. CONCLUSIONS: The Maelstrom Research cataloguing toolkit already serves several national and international initiatives and the suite of software is available to new initiatives through the Maelstrom Research website. With the support of new and existing partners, we hope to ensure regular improvements of the toolkit. Public Library of Science 2018-07-24 /pmc/articles/PMC6057635/ /pubmed/30040866 http://dx.doi.org/10.1371/journal.pone.0200926 Text en © 2018 Bergeron et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Bergeron, Julie
Doiron, Dany
Marcon, Yannick
Ferretti, Vincent
Fortier, Isabel
Fostering population-based cohort data discovery: The Maelstrom Research cataloguing toolkit
title Fostering population-based cohort data discovery: The Maelstrom Research cataloguing toolkit
title_full Fostering population-based cohort data discovery: The Maelstrom Research cataloguing toolkit
title_fullStr Fostering population-based cohort data discovery: The Maelstrom Research cataloguing toolkit
title_full_unstemmed Fostering population-based cohort data discovery: The Maelstrom Research cataloguing toolkit
title_short Fostering population-based cohort data discovery: The Maelstrom Research cataloguing toolkit
title_sort fostering population-based cohort data discovery: the maelstrom research cataloguing toolkit
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6057635/
https://www.ncbi.nlm.nih.gov/pubmed/30040866
http://dx.doi.org/10.1371/journal.pone.0200926
work_keys_str_mv AT bergeronjulie fosteringpopulationbasedcohortdatadiscoverythemaelstromresearchcataloguingtoolkit
AT doirondany fosteringpopulationbasedcohortdatadiscoverythemaelstromresearchcataloguingtoolkit
AT marconyannick fosteringpopulationbasedcohortdatadiscoverythemaelstromresearchcataloguingtoolkit
AT ferrettivincent fosteringpopulationbasedcohortdatadiscoverythemaelstromresearchcataloguingtoolkit
AT fortierisabel fosteringpopulationbasedcohortdatadiscoverythemaelstromresearchcataloguingtoolkit