Cargando…

Event extraction for DNA methylation

BACKGROUND: We consider the task of automatically extracting DNA methylation events from the biomedical domain literature. DNA methylation is a key mechanism of epigenetic control of gene expression and implicated in many cancers, but there has been little study of automatic information extraction f...

Descripción completa

Detalles Bibliográficos
Autores principales: Ohta, Tomoko, Pyysalo, Sampo, Miwa, Makoto, Tsujii, Jun’ichi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3239302/
https://www.ncbi.nlm.nih.gov/pubmed/22166595
http://dx.doi.org/10.1186/2041-1480-2-S5-S2
_version_ 1782219163280867328
author Ohta, Tomoko
Pyysalo, Sampo
Miwa, Makoto
Tsujii, Jun’ichi
author_facet Ohta, Tomoko
Pyysalo, Sampo
Miwa, Makoto
Tsujii, Jun’ichi
author_sort Ohta, Tomoko
collection PubMed
description BACKGROUND: We consider the task of automatically extracting DNA methylation events from the biomedical domain literature. DNA methylation is a key mechanism of epigenetic control of gene expression and implicated in many cancers, but there has been little study of automatic information extraction for DNA methylation. RESULTS: We present an annotation scheme for DNA methylation following the representation of the BioNLP shared task on event extraction, select a set of 200 abstracts including a representative sample of all PubMed citations relevant to DNA methylation, and introduce manual annotation for this corpus marking nearly 3000 gene/protein mentions and 1500 DNA methylation and demethylation events. We retrain a state-of-the-art event extraction system on the corpus and find that automatic extraction of DNA methylation events, the methylated genes, and their methylation sites can be performed at 78% precision and 76% recall. CONCLUSIONS: Our results demonstrate that reliable extraction methods for DNA methylation events can be created through corpus annotation and straightforward retraining of a general event extraction system. The introduced resources are freely available for use in research from the GENIA project homepage http://www-tsujii.is.s.u-tokyo.ac.jp/GENIA.
format Online
Article
Text
id pubmed-3239302
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32393022011-12-16 Event extraction for DNA methylation Ohta, Tomoko Pyysalo, Sampo Miwa, Makoto Tsujii, Jun’ichi J Biomed Semantics Research BACKGROUND: We consider the task of automatically extracting DNA methylation events from the biomedical domain literature. DNA methylation is a key mechanism of epigenetic control of gene expression and implicated in many cancers, but there has been little study of automatic information extraction for DNA methylation. RESULTS: We present an annotation scheme for DNA methylation following the representation of the BioNLP shared task on event extraction, select a set of 200 abstracts including a representative sample of all PubMed citations relevant to DNA methylation, and introduce manual annotation for this corpus marking nearly 3000 gene/protein mentions and 1500 DNA methylation and demethylation events. We retrain a state-of-the-art event extraction system on the corpus and find that automatic extraction of DNA methylation events, the methylated genes, and their methylation sites can be performed at 78% precision and 76% recall. CONCLUSIONS: Our results demonstrate that reliable extraction methods for DNA methylation events can be created through corpus annotation and straightforward retraining of a general event extraction system. The introduced resources are freely available for use in research from the GENIA project homepage http://www-tsujii.is.s.u-tokyo.ac.jp/GENIA. BioMed Central 2011-10-06 /pmc/articles/PMC3239302/ /pubmed/22166595 http://dx.doi.org/10.1186/2041-1480-2-S5-S2 Text en Copyright ©2011 Ohta et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Ohta, Tomoko
Pyysalo, Sampo
Miwa, Makoto
Tsujii, Jun’ichi
Event extraction for DNA methylation
title Event extraction for DNA methylation
title_full Event extraction for DNA methylation
title_fullStr Event extraction for DNA methylation
title_full_unstemmed Event extraction for DNA methylation
title_short Event extraction for DNA methylation
title_sort event extraction for dna methylation
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3239302/
https://www.ncbi.nlm.nih.gov/pubmed/22166595
http://dx.doi.org/10.1186/2041-1480-2-S5-S2
work_keys_str_mv AT ohtatomoko eventextractionfordnamethylation
AT pyysalosampo eventextractionfordnamethylation
AT miwamakoto eventextractionfordnamethylation
AT tsujiijunichi eventextractionfordnamethylation