Cargando…

Semantic text mining support for lignocellulose research

BACKGROUND: Biofuels produced from biomass are considered to be promising sustainable alternatives to fossil fuels. The conversion of lignocellulose into fermentable sugars for biofuels production requires the use of enzyme cocktails that can efficiently and economically hydrolyze lignocellulosic bi...

Descripción completa

Detalles Bibliográficos
Autores principales: Meurs, Marie-Jean, Murphy, Caitlin, Morgenstern, Ingo, Butler, Greg, Powlowski, Justin, Tsang, Adrian, Witte, René
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3339392/
https://www.ncbi.nlm.nih.gov/pubmed/22595090
http://dx.doi.org/10.1186/1472-6947-12-S1-S5
_version_ 1782231349737816064
author Meurs, Marie-Jean
Murphy, Caitlin
Morgenstern, Ingo
Butler, Greg
Powlowski, Justin
Tsang, Adrian
Witte, René
author_facet Meurs, Marie-Jean
Murphy, Caitlin
Morgenstern, Ingo
Butler, Greg
Powlowski, Justin
Tsang, Adrian
Witte, René
author_sort Meurs, Marie-Jean
collection PubMed
description BACKGROUND: Biofuels produced from biomass are considered to be promising sustainable alternatives to fossil fuels. The conversion of lignocellulose into fermentable sugars for biofuels production requires the use of enzyme cocktails that can efficiently and economically hydrolyze lignocellulosic biomass. As many fungi naturally break down lignocellulose, the identification and characterization of the enzymes involved is a key challenge in the research and development of biomass-derived products and fuels. One approach to meeting this challenge is to mine the rapidly-expanding repertoire of microbial genomes for enzymes with the appropriate catalytic properties. RESULTS: Semantic technologies, including natural language processing, ontologies, semantic Web services and Web-based collaboration tools, promise to support users in handling complex data, thereby facilitating knowledge-intensive tasks. An ongoing challenge is to select the appropriate technologies and combine them in a coherent system that brings measurable improvements to the users. We present our ongoing development of a semantic infrastructure in support of genomics-based lignocellulose research. Part of this effort is the automated curation of knowledge from information on fungal enzymes that is available in the literature and genome resources. CONCLUSIONS: Working closely with fungal biology researchers who manually curate the existing literature, we developed ontological natural language processing pipelines integrated in a Web-based interface to assist them in two main tasks: mining the literature for relevant knowledge, and at the same time providing rich and semantically linked information.
format Online
Article
Text
id pubmed-3339392
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33393922012-05-01 Semantic text mining support for lignocellulose research Meurs, Marie-Jean Murphy, Caitlin Morgenstern, Ingo Butler, Greg Powlowski, Justin Tsang, Adrian Witte, René BMC Med Inform Decis Mak Proceedings BACKGROUND: Biofuels produced from biomass are considered to be promising sustainable alternatives to fossil fuels. The conversion of lignocellulose into fermentable sugars for biofuels production requires the use of enzyme cocktails that can efficiently and economically hydrolyze lignocellulosic biomass. As many fungi naturally break down lignocellulose, the identification and characterization of the enzymes involved is a key challenge in the research and development of biomass-derived products and fuels. One approach to meeting this challenge is to mine the rapidly-expanding repertoire of microbial genomes for enzymes with the appropriate catalytic properties. RESULTS: Semantic technologies, including natural language processing, ontologies, semantic Web services and Web-based collaboration tools, promise to support users in handling complex data, thereby facilitating knowledge-intensive tasks. An ongoing challenge is to select the appropriate technologies and combine them in a coherent system that brings measurable improvements to the users. We present our ongoing development of a semantic infrastructure in support of genomics-based lignocellulose research. Part of this effort is the automated curation of knowledge from information on fungal enzymes that is available in the literature and genome resources. CONCLUSIONS: Working closely with fungal biology researchers who manually curate the existing literature, we developed ontological natural language processing pipelines integrated in a Web-based interface to assist them in two main tasks: mining the literature for relevant knowledge, and at the same time providing rich and semantically linked information. BioMed Central 2012-04-30 /pmc/articles/PMC3339392/ /pubmed/22595090 http://dx.doi.org/10.1186/1472-6947-12-S1-S5 Text en Copyright ©2012 Meurs et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Meurs, Marie-Jean
Murphy, Caitlin
Morgenstern, Ingo
Butler, Greg
Powlowski, Justin
Tsang, Adrian
Witte, René
Semantic text mining support for lignocellulose research
title Semantic text mining support for lignocellulose research
title_full Semantic text mining support for lignocellulose research
title_fullStr Semantic text mining support for lignocellulose research
title_full_unstemmed Semantic text mining support for lignocellulose research
title_short Semantic text mining support for lignocellulose research
title_sort semantic text mining support for lignocellulose research
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3339392/
https://www.ncbi.nlm.nih.gov/pubmed/22595090
http://dx.doi.org/10.1186/1472-6947-12-S1-S5
work_keys_str_mv AT meursmariejean semantictextminingsupportforlignocelluloseresearch
AT murphycaitlin semantictextminingsupportforlignocelluloseresearch
AT morgensterningo semantictextminingsupportforlignocelluloseresearch
AT butlergreg semantictextminingsupportforlignocelluloseresearch
AT powlowskijustin semantictextminingsupportforlignocelluloseresearch
AT tsangadrian semantictextminingsupportforlignocelluloseresearch
AT witterene semantictextminingsupportforlignocelluloseresearch