Cargando…

Structuring and extracting knowledge for the support of hypothesis generation in molecular biology

BACKGROUND: Hypothesis generation in molecular and cellular biology is an empirical process in which knowledge derived from prior experiments is distilled into a comprehensible model. The requirement of automated support is exemplified by the difficulty of considering all relevant facts that are con...

Descripción completa

Detalles Bibliográficos
Autores principales:	Roos, Marco, Marshall, M Scott, Gibson, Andrew P, Schuemie, Martijn, Meij, Edgar, Katrenko, Sophia, van Hage, Willem Robert, Krommydas, Konstantinos, Adriaans, Pieter W
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2009
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2755830/ https://www.ncbi.nlm.nih.gov/pubmed/19796406 http://dx.doi.org/10.1186/1471-2105-10-S10-S9

_version_	1782172474636500992
author	Roos, Marco Marshall, M Scott Gibson, Andrew P Schuemie, Martijn Meij, Edgar Katrenko, Sophia van Hage, Willem Robert Krommydas, Konstantinos Adriaans, Pieter W
author_facet	Roos, Marco Marshall, M Scott Gibson, Andrew P Schuemie, Martijn Meij, Edgar Katrenko, Sophia van Hage, Willem Robert Krommydas, Konstantinos Adriaans, Pieter W
author_sort	Roos, Marco
collection	PubMed
description	BACKGROUND: Hypothesis generation in molecular and cellular biology is an empirical process in which knowledge derived from prior experiments is distilled into a comprehensible model. The requirement of automated support is exemplified by the difficulty of considering all relevant facts that are contained in the millions of documents available from PubMed. Semantic Web provides tools for sharing prior knowledge, while information retrieval and information extraction techniques enable its extraction from literature. Their combination makes prior knowledge available for computational analysis and inference. While some tools provide complete solutions that limit the control over the modeling and extraction processes, we seek a methodology that supports control by the experimenter over these critical processes. RESULTS: We describe progress towards automated support for the generation of biomolecular hypotheses. Semantic Web technologies are used to structure and store knowledge, while a workflow extracts knowledge from text. We designed minimal proto-ontologies in OWL for capturing different aspects of a text mining experiment: the biological hypothesis, text and documents, text mining, and workflow provenance. The models fit a methodology that allows focus on the requirements of a single experiment while supporting reuse and posterior analysis of extracted knowledge from multiple experiments. Our workflow is composed of services from the 'Adaptive Information Disclosure Application' (AIDA) toolkit as well as a few others. The output is a semantic model with putative biological relations, with each relation linked to the corresponding evidence. CONCLUSION: We demonstrated a 'do-it-yourself' approach for structuring and extracting knowledge in the context of experimental research on biomolecular mechanisms. The methodology can be used to bootstrap the construction of semantically rich biological models using the results of knowledge extraction processes. Models specific to particular experiments can be constructed that, in turn, link with other semantic models, creating a web of knowledge that spans experiments. Mapping mechanisms can link to other knowledge resources such as OBO ontologies or SKOS vocabularies. AIDA Web Services can be used to design personalized knowledge extraction procedures. In our example experiment, we found three proteins (NF-Kappa B, p21, and Bax) potentially playing a role in the interplay between nutrients and epigenetic gene regulation.
format	Text
id	pubmed-2755830
institution	National Center for Biotechnology Information
language	English
publishDate	2009
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-27558302009-10-03 Structuring and extracting knowledge for the support of hypothesis generation in molecular biology Roos, Marco Marshall, M Scott Gibson, Andrew P Schuemie, Martijn Meij, Edgar Katrenko, Sophia van Hage, Willem Robert Krommydas, Konstantinos Adriaans, Pieter W BMC Bioinformatics Research BACKGROUND: Hypothesis generation in molecular and cellular biology is an empirical process in which knowledge derived from prior experiments is distilled into a comprehensible model. The requirement of automated support is exemplified by the difficulty of considering all relevant facts that are contained in the millions of documents available from PubMed. Semantic Web provides tools for sharing prior knowledge, while information retrieval and information extraction techniques enable its extraction from literature. Their combination makes prior knowledge available for computational analysis and inference. While some tools provide complete solutions that limit the control over the modeling and extraction processes, we seek a methodology that supports control by the experimenter over these critical processes. RESULTS: We describe progress towards automated support for the generation of biomolecular hypotheses. Semantic Web technologies are used to structure and store knowledge, while a workflow extracts knowledge from text. We designed minimal proto-ontologies in OWL for capturing different aspects of a text mining experiment: the biological hypothesis, text and documents, text mining, and workflow provenance. The models fit a methodology that allows focus on the requirements of a single experiment while supporting reuse and posterior analysis of extracted knowledge from multiple experiments. Our workflow is composed of services from the 'Adaptive Information Disclosure Application' (AIDA) toolkit as well as a few others. The output is a semantic model with putative biological relations, with each relation linked to the corresponding evidence. CONCLUSION: We demonstrated a 'do-it-yourself' approach for structuring and extracting knowledge in the context of experimental research on biomolecular mechanisms. The methodology can be used to bootstrap the construction of semantically rich biological models using the results of knowledge extraction processes. Models specific to particular experiments can be constructed that, in turn, link with other semantic models, creating a web of knowledge that spans experiments. Mapping mechanisms can link to other knowledge resources such as OBO ontologies or SKOS vocabularies. AIDA Web Services can be used to design personalized knowledge extraction procedures. In our example experiment, we found three proteins (NF-Kappa B, p21, and Bax) potentially playing a role in the interplay between nutrients and epigenetic gene regulation. BioMed Central 2009-10-01 /pmc/articles/PMC2755830/ /pubmed/19796406 http://dx.doi.org/10.1186/1471-2105-10-S10-S9 Text en © Roos et al; licensee BioMed Central Ltd. 2009 https://creativecommons.org/licenses/by/2.0/This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0 (https://creativecommons.org/licenses/by/2.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Roos, Marco Marshall, M Scott Gibson, Andrew P Schuemie, Martijn Meij, Edgar Katrenko, Sophia van Hage, Willem Robert Krommydas, Konstantinos Adriaans, Pieter W Structuring and extracting knowledge for the support of hypothesis generation in molecular biology
title	Structuring and extracting knowledge for the support of hypothesis generation in molecular biology
title_full	Structuring and extracting knowledge for the support of hypothesis generation in molecular biology
title_fullStr	Structuring and extracting knowledge for the support of hypothesis generation in molecular biology
title_full_unstemmed	Structuring and extracting knowledge for the support of hypothesis generation in molecular biology
title_short	Structuring and extracting knowledge for the support of hypothesis generation in molecular biology
title_sort	structuring and extracting knowledge for the support of hypothesis generation in molecular biology
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2755830/ https://www.ncbi.nlm.nih.gov/pubmed/19796406 http://dx.doi.org/10.1186/1471-2105-10-S10-S9
work_keys_str_mv	AT roosmarco structuringandextractingknowledgeforthesupportofhypothesisgenerationinmolecularbiology AT marshallmscott structuringandextractingknowledgeforthesupportofhypothesisgenerationinmolecularbiology AT gibsonandrewp structuringandextractingknowledgeforthesupportofhypothesisgenerationinmolecularbiology AT schuemiemartijn structuringandextractingknowledgeforthesupportofhypothesisgenerationinmolecularbiology AT meijedgar structuringandextractingknowledgeforthesupportofhypothesisgenerationinmolecularbiology AT katrenkosophia structuringandextractingknowledgeforthesupportofhypothesisgenerationinmolecularbiology AT vanhagewillemrobert structuringandextractingknowledgeforthesupportofhypothesisgenerationinmolecularbiology AT krommydaskonstantinos structuringandextractingknowledgeforthesupportofhypothesisgenerationinmolecularbiology AT adriaanspieterw structuringandextractingknowledgeforthesupportofhypothesisgenerationinmolecularbiology

Structuring and extracting knowledge for the support of hypothesis generation in molecular biology

Ejemplares similares