Cargando…

CSI-OMIM - Clinical Synopsis Search in OMIM

BACKGROUND: The OMIM database is a tool used daily by geneticists. Syndrome pages include a Clinical Synopsis section containing a list of known phenotypes comprising a clinical syndrome. The phenotypes are in free text and different phrases are often used to describe the same phenotype, the differe...

Descripción completa

Detalles Bibliográficos
Autores principales: Cohen, Raphael, Gefen, Avitan, Elhadad, Michael, Birk, Ohad S
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3053257/
https://www.ncbi.nlm.nih.gov/pubmed/21362185
http://dx.doi.org/10.1186/1471-2105-12-65
_version_ 1782199719927218176
author Cohen, Raphael
Gefen, Avitan
Elhadad, Michael
Birk, Ohad S
author_facet Cohen, Raphael
Gefen, Avitan
Elhadad, Michael
Birk, Ohad S
author_sort Cohen, Raphael
collection PubMed
description BACKGROUND: The OMIM database is a tool used daily by geneticists. Syndrome pages include a Clinical Synopsis section containing a list of known phenotypes comprising a clinical syndrome. The phenotypes are in free text and different phrases are often used to describe the same phenotype, the differences originating in spelling variations or typing errors, varying sentence structures and terminological variants. These variations hinder searching for syndromes or using the large amount of phenotypic information for research purposes. In addition, negation forms also create false positives when searching the textual description of phenotypes and induce noise in text mining applications. DESCRIPTION: Our method allows efficient and complete search of OMIM phenotypes as well as improved data-mining of the OMIM phenome. Applying natural language processing, each phrase is tagged with additional semantic information using UMLS and MESH. Using a grammar based method, annotated phrases are clustered into groups denoting similar phenotypes. These groups of synonymous expressions enable precise search, as query terms can be matched with the many variations that appear in OMIM, while avoiding over-matching expressions that include the query term in a negative context. On the basis of these clusters, we computed pair-wise similarity among syndromes in OMIM. Using this new similarity measure, we identified 79,770 new connections between syndromes, an average of 16 new connections per syndrome. Our project is Web-based and available at http://fohs.bgu.ac.il/s2g/csiomim CONCLUSIONS: The resulting enhanced search functionality provides clinicians with an efficient tool for diagnosis. This search application is also used for finding similar syndromes for the candidate gene prioritization tool S2G. The enhanced OMIM database we produced can be further used for bioinformatics purposes such as linking phenotypes and genes based on syndrome similarities and the known genes in Morbidmap.
format Text
id pubmed-3053257
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30532572011-03-11 CSI-OMIM - Clinical Synopsis Search in OMIM Cohen, Raphael Gefen, Avitan Elhadad, Michael Birk, Ohad S BMC Bioinformatics Database BACKGROUND: The OMIM database is a tool used daily by geneticists. Syndrome pages include a Clinical Synopsis section containing a list of known phenotypes comprising a clinical syndrome. The phenotypes are in free text and different phrases are often used to describe the same phenotype, the differences originating in spelling variations or typing errors, varying sentence structures and terminological variants. These variations hinder searching for syndromes or using the large amount of phenotypic information for research purposes. In addition, negation forms also create false positives when searching the textual description of phenotypes and induce noise in text mining applications. DESCRIPTION: Our method allows efficient and complete search of OMIM phenotypes as well as improved data-mining of the OMIM phenome. Applying natural language processing, each phrase is tagged with additional semantic information using UMLS and MESH. Using a grammar based method, annotated phrases are clustered into groups denoting similar phenotypes. These groups of synonymous expressions enable precise search, as query terms can be matched with the many variations that appear in OMIM, while avoiding over-matching expressions that include the query term in a negative context. On the basis of these clusters, we computed pair-wise similarity among syndromes in OMIM. Using this new similarity measure, we identified 79,770 new connections between syndromes, an average of 16 new connections per syndrome. Our project is Web-based and available at http://fohs.bgu.ac.il/s2g/csiomim CONCLUSIONS: The resulting enhanced search functionality provides clinicians with an efficient tool for diagnosis. This search application is also used for finding similar syndromes for the candidate gene prioritization tool S2G. The enhanced OMIM database we produced can be further used for bioinformatics purposes such as linking phenotypes and genes based on syndrome similarities and the known genes in Morbidmap. BioMed Central 2011-03-01 /pmc/articles/PMC3053257/ /pubmed/21362185 http://dx.doi.org/10.1186/1471-2105-12-65 Text en Copyright ©2011 Cohen et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database
Cohen, Raphael
Gefen, Avitan
Elhadad, Michael
Birk, Ohad S
CSI-OMIM - Clinical Synopsis Search in OMIM
title CSI-OMIM - Clinical Synopsis Search in OMIM
title_full CSI-OMIM - Clinical Synopsis Search in OMIM
title_fullStr CSI-OMIM - Clinical Synopsis Search in OMIM
title_full_unstemmed CSI-OMIM - Clinical Synopsis Search in OMIM
title_short CSI-OMIM - Clinical Synopsis Search in OMIM
title_sort csi-omim - clinical synopsis search in omim
topic Database
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3053257/
https://www.ncbi.nlm.nih.gov/pubmed/21362185
http://dx.doi.org/10.1186/1471-2105-12-65
work_keys_str_mv AT cohenraphael csiomimclinicalsynopsissearchinomim
AT gefenavitan csiomimclinicalsynopsissearchinomim
AT elhadadmichael csiomimclinicalsynopsissearchinomim
AT birkohads csiomimclinicalsynopsissearchinomim