Cargando…

Foundry: a message-oriented, horizontally scalable ETL system for scientific data integration and enhancement

Data generated by scientific research enables further advancement in science through reanalyses and pooling of data for novel analyses. With the increasing amounts of scientific data generated by biomedical research providing researchers with more data than they have ever had access to, finding the...

Descripción completa

Detalles Bibliográficos
Autores principales: Ozyurt, Ibrahim Burak, Grethe, Jeffrey S
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6301337/
https://www.ncbi.nlm.nih.gov/pubmed/30576493
http://dx.doi.org/10.1093/database/bay130
_version_ 1783381819446525952
author Ozyurt, Ibrahim Burak
Grethe, Jeffrey S
author_facet Ozyurt, Ibrahim Burak
Grethe, Jeffrey S
author_sort Ozyurt, Ibrahim Burak
collection PubMed
description Data generated by scientific research enables further advancement in science through reanalyses and pooling of data for novel analyses. With the increasing amounts of scientific data generated by biomedical research providing researchers with more data than they have ever had access to, finding the data matching the researchers' requirements continues to be a major challenge and will only grow more challenging as more data is produced and shared. In this paper, we introduce a horizontally scalable distributed extract-transform-load system to tackle scientific data aggregation, transformation and enhancement for scientific data discovery and retrieval. We also introduce a data transformation language for biomedical curators allowing for the transformation and combination of data/metadata from heterogeneous data sources. Applicability of the system for scientific data is illustrated in biomedical and earth science domains.
format Online
Article
Text
id pubmed-6301337
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-63013372018-12-27 Foundry: a message-oriented, horizontally scalable ETL system for scientific data integration and enhancement Ozyurt, Ibrahim Burak Grethe, Jeffrey S Database (Oxford) Original Article Data generated by scientific research enables further advancement in science through reanalyses and pooling of data for novel analyses. With the increasing amounts of scientific data generated by biomedical research providing researchers with more data than they have ever had access to, finding the data matching the researchers' requirements continues to be a major challenge and will only grow more challenging as more data is produced and shared. In this paper, we introduce a horizontally scalable distributed extract-transform-load system to tackle scientific data aggregation, transformation and enhancement for scientific data discovery and retrieval. We also introduce a data transformation language for biomedical curators allowing for the transformation and combination of data/metadata from heterogeneous data sources. Applicability of the system for scientific data is illustrated in biomedical and earth science domains. Oxford University Press 2018-12-17 /pmc/articles/PMC6301337/ /pubmed/30576493 http://dx.doi.org/10.1093/database/bay130 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Ozyurt, Ibrahim Burak
Grethe, Jeffrey S
Foundry: a message-oriented, horizontally scalable ETL system for scientific data integration and enhancement
title Foundry: a message-oriented, horizontally scalable ETL system for scientific data integration and enhancement
title_full Foundry: a message-oriented, horizontally scalable ETL system for scientific data integration and enhancement
title_fullStr Foundry: a message-oriented, horizontally scalable ETL system for scientific data integration and enhancement
title_full_unstemmed Foundry: a message-oriented, horizontally scalable ETL system for scientific data integration and enhancement
title_short Foundry: a message-oriented, horizontally scalable ETL system for scientific data integration and enhancement
title_sort foundry: a message-oriented, horizontally scalable etl system for scientific data integration and enhancement
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6301337/
https://www.ncbi.nlm.nih.gov/pubmed/30576493
http://dx.doi.org/10.1093/database/bay130
work_keys_str_mv AT ozyurtibrahimburak foundryamessageorientedhorizontallyscalableetlsystemforscientificdataintegrationandenhancement
AT grethejeffreys foundryamessageorientedhorizontallyscalableetlsystemforscientificdataintegrationandenhancement