Cargando…
Event extraction for DNA methylation
BACKGROUND: We consider the task of automatically extracting DNA methylation events from the biomedical domain literature. DNA methylation is a key mechanism of epigenetic control of gene expression and implicated in many cancers, but there has been little study of automatic information extraction f...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3239302/ https://www.ncbi.nlm.nih.gov/pubmed/22166595 http://dx.doi.org/10.1186/2041-1480-2-S5-S2 |
_version_ | 1782219163280867328 |
---|---|
author | Ohta, Tomoko Pyysalo, Sampo Miwa, Makoto Tsujii, Jun’ichi |
author_facet | Ohta, Tomoko Pyysalo, Sampo Miwa, Makoto Tsujii, Jun’ichi |
author_sort | Ohta, Tomoko |
collection | PubMed |
description | BACKGROUND: We consider the task of automatically extracting DNA methylation events from the biomedical domain literature. DNA methylation is a key mechanism of epigenetic control of gene expression and implicated in many cancers, but there has been little study of automatic information extraction for DNA methylation. RESULTS: We present an annotation scheme for DNA methylation following the representation of the BioNLP shared task on event extraction, select a set of 200 abstracts including a representative sample of all PubMed citations relevant to DNA methylation, and introduce manual annotation for this corpus marking nearly 3000 gene/protein mentions and 1500 DNA methylation and demethylation events. We retrain a state-of-the-art event extraction system on the corpus and find that automatic extraction of DNA methylation events, the methylated genes, and their methylation sites can be performed at 78% precision and 76% recall. CONCLUSIONS: Our results demonstrate that reliable extraction methods for DNA methylation events can be created through corpus annotation and straightforward retraining of a general event extraction system. The introduced resources are freely available for use in research from the GENIA project homepage http://www-tsujii.is.s.u-tokyo.ac.jp/GENIA. |
format | Online Article Text |
id | pubmed-3239302 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32393022011-12-16 Event extraction for DNA methylation Ohta, Tomoko Pyysalo, Sampo Miwa, Makoto Tsujii, Jun’ichi J Biomed Semantics Research BACKGROUND: We consider the task of automatically extracting DNA methylation events from the biomedical domain literature. DNA methylation is a key mechanism of epigenetic control of gene expression and implicated in many cancers, but there has been little study of automatic information extraction for DNA methylation. RESULTS: We present an annotation scheme for DNA methylation following the representation of the BioNLP shared task on event extraction, select a set of 200 abstracts including a representative sample of all PubMed citations relevant to DNA methylation, and introduce manual annotation for this corpus marking nearly 3000 gene/protein mentions and 1500 DNA methylation and demethylation events. We retrain a state-of-the-art event extraction system on the corpus and find that automatic extraction of DNA methylation events, the methylated genes, and their methylation sites can be performed at 78% precision and 76% recall. CONCLUSIONS: Our results demonstrate that reliable extraction methods for DNA methylation events can be created through corpus annotation and straightforward retraining of a general event extraction system. The introduced resources are freely available for use in research from the GENIA project homepage http://www-tsujii.is.s.u-tokyo.ac.jp/GENIA. BioMed Central 2011-10-06 /pmc/articles/PMC3239302/ /pubmed/22166595 http://dx.doi.org/10.1186/2041-1480-2-S5-S2 Text en Copyright ©2011 Ohta et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Ohta, Tomoko Pyysalo, Sampo Miwa, Makoto Tsujii, Jun’ichi Event extraction for DNA methylation |
title | Event extraction for DNA methylation |
title_full | Event extraction for DNA methylation |
title_fullStr | Event extraction for DNA methylation |
title_full_unstemmed | Event extraction for DNA methylation |
title_short | Event extraction for DNA methylation |
title_sort | event extraction for dna methylation |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3239302/ https://www.ncbi.nlm.nih.gov/pubmed/22166595 http://dx.doi.org/10.1186/2041-1480-2-S5-S2 |
work_keys_str_mv | AT ohtatomoko eventextractionfordnamethylation AT pyysalosampo eventextractionfordnamethylation AT miwamakoto eventextractionfordnamethylation AT tsujiijunichi eventextractionfordnamethylation |