Cargando…

Identification of histone modifications in biomedical text for supporting epigenomic research

BACKGROUND: Posttranslational modifications of histones influence the structure of chromatine and in such a way take part in the regulation of gene expression. Certain histone modification patterns, distributed over the genome, are connected to cell as well as tissue differentiation and to the adapt...

Descripción completa

Detalles Bibliográficos
Autores principales: Kolářik, Corinna, Klinger, Roman, Hofmann-Apitius, Martin
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2648793/
https://www.ncbi.nlm.nih.gov/pubmed/19208128
http://dx.doi.org/10.1186/1471-2105-10-S1-S28
_version_ 1782164989643063296
author Kolářik, Corinna
Klinger, Roman
Hofmann-Apitius, Martin
author_facet Kolářik, Corinna
Klinger, Roman
Hofmann-Apitius, Martin
author_sort Kolářik, Corinna
collection PubMed
description BACKGROUND: Posttranslational modifications of histones influence the structure of chromatine and in such a way take part in the regulation of gene expression. Certain histone modification patterns, distributed over the genome, are connected to cell as well as tissue differentiation and to the adaption of organisms to their environment. Abnormal changes instead influence the development of disease states like cancer. The regulation mechanisms for modifying histones and its functionalities are the subject of epigenomics investigation and are still not completely understood. Text provides a rich resource of knowledge on epigenomics and modifications of histones in particular. It contains information about experimental studies, the conditions used, and results. To our knowledge, no approach has been published so far for identifying histone modifications in text. RESULTS: We have developed an approach for identifying histone modifications in biomedical literature with Conditional Random Fields (CRF) and for resolving the recognized histone modification term variants by term standardization. For the term identification F(1 )measures of 0.84 by 10-fold cross-validation on the training corpus and 0.81 on an independent test corpus have been obtained. The standardization enabled the correct transformation of 96% of the terms from training and 98% from test the corpus. Due to the lack of terminologies exhaustively covering specific histone modification types, we developed a histone modification term hierarchy for use in a semantic text retrieval system. CONCLUSION: The developed approach highly improves the retrieval of articles describing histone modifications. Since text contains context information about performed studies and experiments, the identification of histone modifications is the basis for supporting literature-based knowledge discovery and hypothesis generation to accelerate epigenomic research.
format Text
id pubmed-2648793
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26487932009-03-03 Identification of histone modifications in biomedical text for supporting epigenomic research Kolářik, Corinna Klinger, Roman Hofmann-Apitius, Martin BMC Bioinformatics Research BACKGROUND: Posttranslational modifications of histones influence the structure of chromatine and in such a way take part in the regulation of gene expression. Certain histone modification patterns, distributed over the genome, are connected to cell as well as tissue differentiation and to the adaption of organisms to their environment. Abnormal changes instead influence the development of disease states like cancer. The regulation mechanisms for modifying histones and its functionalities are the subject of epigenomics investigation and are still not completely understood. Text provides a rich resource of knowledge on epigenomics and modifications of histones in particular. It contains information about experimental studies, the conditions used, and results. To our knowledge, no approach has been published so far for identifying histone modifications in text. RESULTS: We have developed an approach for identifying histone modifications in biomedical literature with Conditional Random Fields (CRF) and for resolving the recognized histone modification term variants by term standardization. For the term identification F(1 )measures of 0.84 by 10-fold cross-validation on the training corpus and 0.81 on an independent test corpus have been obtained. The standardization enabled the correct transformation of 96% of the terms from training and 98% from test the corpus. Due to the lack of terminologies exhaustively covering specific histone modification types, we developed a histone modification term hierarchy for use in a semantic text retrieval system. CONCLUSION: The developed approach highly improves the retrieval of articles describing histone modifications. Since text contains context information about performed studies and experiments, the identification of histone modifications is the basis for supporting literature-based knowledge discovery and hypothesis generation to accelerate epigenomic research. BioMed Central 2009-01-30 /pmc/articles/PMC2648793/ /pubmed/19208128 http://dx.doi.org/10.1186/1471-2105-10-S1-S28 Text en Copyright © 2009 Kolářik et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Kolářik, Corinna
Klinger, Roman
Hofmann-Apitius, Martin
Identification of histone modifications in biomedical text for supporting epigenomic research
title Identification of histone modifications in biomedical text for supporting epigenomic research
title_full Identification of histone modifications in biomedical text for supporting epigenomic research
title_fullStr Identification of histone modifications in biomedical text for supporting epigenomic research
title_full_unstemmed Identification of histone modifications in biomedical text for supporting epigenomic research
title_short Identification of histone modifications in biomedical text for supporting epigenomic research
title_sort identification of histone modifications in biomedical text for supporting epigenomic research
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2648793/
https://www.ncbi.nlm.nih.gov/pubmed/19208128
http://dx.doi.org/10.1186/1471-2105-10-S1-S28
work_keys_str_mv AT kolarikcorinna identificationofhistonemodificationsinbiomedicaltextforsupportingepigenomicresearch
AT klingerroman identificationofhistonemodificationsinbiomedicaltextforsupportingepigenomicresearch
AT hofmannapitiusmartin identificationofhistonemodificationsinbiomedicaltextforsupportingepigenomicresearch