Cargando…

Computational framework to support integration of biomolecular and clinical data within a translational approach

BACKGROUND: The use of the knowledge produced by sciences to promote human health is the main goal of translational medicine. To make it feasible we need computational methods to handle the large amount of information that arises from bench to bedside and to deal with its heterogeneity. A computatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Miyoshi, Newton Shydeo Brandão, Pinheiro, Daniel Guariz, Silva, Wilson Araújo, Felipe, Joaquim Cezar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3688149/
https://www.ncbi.nlm.nih.gov/pubmed/23742129
http://dx.doi.org/10.1186/1471-2105-14-180
_version_ 1782476149049262080
author Miyoshi, Newton Shydeo Brandão
Pinheiro, Daniel Guariz
Silva, Wilson Araújo
Felipe, Joaquim Cezar
author_facet Miyoshi, Newton Shydeo Brandão
Pinheiro, Daniel Guariz
Silva, Wilson Araújo
Felipe, Joaquim Cezar
author_sort Miyoshi, Newton Shydeo Brandão
collection PubMed
description BACKGROUND: The use of the knowledge produced by sciences to promote human health is the main goal of translational medicine. To make it feasible we need computational methods to handle the large amount of information that arises from bench to bedside and to deal with its heterogeneity. A computational challenge that must be faced is to promote the integration of clinical, socio-demographic and biological data. In this effort, ontologies play an essential role as a powerful artifact for knowledge representation. Chado is a modular ontology-oriented database model that gained popularity due to its robustness and flexibility as a generic platform to store biological data; however it lacks supporting representation of clinical and socio-demographic information. RESULTS: We have implemented an extension of Chado – the Clinical Module - to allow the representation of this kind of information. Our approach consists of a framework for data integration through the use of a common reference ontology. The design of this framework has four levels: data level, to store the data; semantic level, to integrate and standardize the data by the use of ontologies; application level, to manage clinical databases, ontologies and data integration process; and web interface level, to allow interaction between the user and the system. The clinical module was built based on the Entity-Attribute-Value (EAV) model. We also proposed a methodology to migrate data from legacy clinical databases to the integrative framework. A Chado instance was initialized using a relational database management system. The Clinical Module was implemented and the framework was loaded using data from a factual clinical research database. Clinical and demographic data as well as biomaterial data were obtained from patients with tumors of head and neck. We implemented the IPTrans tool that is a complete environment for data migration, which comprises: the construction of a model to describe the legacy clinical data, based on an ontology; the Extraction, Transformation and Load (ETL) process to extract the data from the source clinical database and load it in the Clinical Module of Chado; the development of a web tool and a Bridge Layer to adapt the web tool to Chado, as well as other applications. CONCLUSIONS: Open-source computational solutions currently available for translational science does not have a model to represent biomolecular information and also are not integrated with the existing bioinformatics tools. On the other hand, existing genomic data models do not represent clinical patient data. A framework was developed to support translational research by integrating biomolecular information coming from different “omics” technologies with patient’s clinical and socio-demographic data. This framework should present some features: flexibility, compression and robustness. The experiments accomplished from a use case demonstrated that the proposed system meets requirements of flexibility and robustness, leading to the desired integration. The Clinical Module can be accessed in http://dcm.ffclrp.usp.br/caib/pg=iptrans.
format Online
Article
Text
id pubmed-3688149
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36881492013-06-21 Computational framework to support integration of biomolecular and clinical data within a translational approach Miyoshi, Newton Shydeo Brandão Pinheiro, Daniel Guariz Silva, Wilson Araújo Felipe, Joaquim Cezar BMC Bioinformatics Research Article BACKGROUND: The use of the knowledge produced by sciences to promote human health is the main goal of translational medicine. To make it feasible we need computational methods to handle the large amount of information that arises from bench to bedside and to deal with its heterogeneity. A computational challenge that must be faced is to promote the integration of clinical, socio-demographic and biological data. In this effort, ontologies play an essential role as a powerful artifact for knowledge representation. Chado is a modular ontology-oriented database model that gained popularity due to its robustness and flexibility as a generic platform to store biological data; however it lacks supporting representation of clinical and socio-demographic information. RESULTS: We have implemented an extension of Chado – the Clinical Module - to allow the representation of this kind of information. Our approach consists of a framework for data integration through the use of a common reference ontology. The design of this framework has four levels: data level, to store the data; semantic level, to integrate and standardize the data by the use of ontologies; application level, to manage clinical databases, ontologies and data integration process; and web interface level, to allow interaction between the user and the system. The clinical module was built based on the Entity-Attribute-Value (EAV) model. We also proposed a methodology to migrate data from legacy clinical databases to the integrative framework. A Chado instance was initialized using a relational database management system. The Clinical Module was implemented and the framework was loaded using data from a factual clinical research database. Clinical and demographic data as well as biomaterial data were obtained from patients with tumors of head and neck. We implemented the IPTrans tool that is a complete environment for data migration, which comprises: the construction of a model to describe the legacy clinical data, based on an ontology; the Extraction, Transformation and Load (ETL) process to extract the data from the source clinical database and load it in the Clinical Module of Chado; the development of a web tool and a Bridge Layer to adapt the web tool to Chado, as well as other applications. CONCLUSIONS: Open-source computational solutions currently available for translational science does not have a model to represent biomolecular information and also are not integrated with the existing bioinformatics tools. On the other hand, existing genomic data models do not represent clinical patient data. A framework was developed to support translational research by integrating biomolecular information coming from different “omics” technologies with patient’s clinical and socio-demographic data. This framework should present some features: flexibility, compression and robustness. The experiments accomplished from a use case demonstrated that the proposed system meets requirements of flexibility and robustness, leading to the desired integration. The Clinical Module can be accessed in http://dcm.ffclrp.usp.br/caib/pg=iptrans. BioMed Central 2013-06-06 /pmc/articles/PMC3688149/ /pubmed/23742129 http://dx.doi.org/10.1186/1471-2105-14-180 Text en Copyright © 2013 Miyoshi et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Miyoshi, Newton Shydeo Brandão
Pinheiro, Daniel Guariz
Silva, Wilson Araújo
Felipe, Joaquim Cezar
Computational framework to support integration of biomolecular and clinical data within a translational approach
title Computational framework to support integration of biomolecular and clinical data within a translational approach
title_full Computational framework to support integration of biomolecular and clinical data within a translational approach
title_fullStr Computational framework to support integration of biomolecular and clinical data within a translational approach
title_full_unstemmed Computational framework to support integration of biomolecular and clinical data within a translational approach
title_short Computational framework to support integration of biomolecular and clinical data within a translational approach
title_sort computational framework to support integration of biomolecular and clinical data within a translational approach
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3688149/
https://www.ncbi.nlm.nih.gov/pubmed/23742129
http://dx.doi.org/10.1186/1471-2105-14-180
work_keys_str_mv AT miyoshinewtonshydeobrandao computationalframeworktosupportintegrationofbiomolecularandclinicaldatawithinatranslationalapproach
AT pinheirodanielguariz computationalframeworktosupportintegrationofbiomolecularandclinicaldatawithinatranslationalapproach
AT silvawilsonaraujo computationalframeworktosupportintegrationofbiomolecularandclinicaldatawithinatranslationalapproach
AT felipejoaquimcezar computationalframeworktosupportintegrationofbiomolecularandclinicaldatawithinatranslationalapproach