Cargando…

Mapping proteins to disease terminologies: from UniProt to MeSH

BACKGROUND: Although the UniProt KnowledgeBase is not a medical-oriented database, it contains information on more than 2,000 human proteins involved in pathologies. However, these annotations are not standardized, which impairs the interoperability between biological and clinical resources. In orde...

Descripción completa

Detalles Bibliográficos
Autores principales: Mottaz, Anaïs, Yip, Yum L, Ruch, Patrick, Veuthey, Anne-Lise
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2367626/
https://www.ncbi.nlm.nih.gov/pubmed/18460185
http://dx.doi.org/10.1186/1471-2105-9-S5-S3
_version_ 1782154336958152704
author Mottaz, Anaïs
Yip, Yum L
Ruch, Patrick
Veuthey, Anne-Lise
author_facet Mottaz, Anaïs
Yip, Yum L
Ruch, Patrick
Veuthey, Anne-Lise
author_sort Mottaz, Anaïs
collection PubMed
description BACKGROUND: Although the UniProt KnowledgeBase is not a medical-oriented database, it contains information on more than 2,000 human proteins involved in pathologies. However, these annotations are not standardized, which impairs the interoperability between biological and clinical resources. In order to make these data easily accessible to clinical researchers, we have developed a procedure to link diseases described in the UniProtKB/Swiss-Prot entries to the MeSH disease terminology. RESULTS: We mapped disease names extracted either from the UniProtKB/Swiss-Prot entry comment lines or from the corresponding OMIM entry to the MeSH. Different methods were assessed on a benchmark set of 200 disease names manually mapped to MeSH terms. The performance of the retained procedure in term of precision and recall was 86% and 64% respectively. Using the same procedure, more than 3,000 disease names in Swiss-Prot were mapped to MeSH with comparable efficiency. CONCLUSIONS: This study is a first attempt to link proteins in UniProtKB to the medical resources. The indexing we provided will help clinicians and researchers navigate from diseases to genes and from genes to diseases in an efficient way. The mapping is available at: .
format Text
id pubmed-2367626
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-23676262008-05-07 Mapping proteins to disease terminologies: from UniProt to MeSH Mottaz, Anaïs Yip, Yum L Ruch, Patrick Veuthey, Anne-Lise BMC Bioinformatics Proceedings BACKGROUND: Although the UniProt KnowledgeBase is not a medical-oriented database, it contains information on more than 2,000 human proteins involved in pathologies. However, these annotations are not standardized, which impairs the interoperability between biological and clinical resources. In order to make these data easily accessible to clinical researchers, we have developed a procedure to link diseases described in the UniProtKB/Swiss-Prot entries to the MeSH disease terminology. RESULTS: We mapped disease names extracted either from the UniProtKB/Swiss-Prot entry comment lines or from the corresponding OMIM entry to the MeSH. Different methods were assessed on a benchmark set of 200 disease names manually mapped to MeSH terms. The performance of the retained procedure in term of precision and recall was 86% and 64% respectively. Using the same procedure, more than 3,000 disease names in Swiss-Prot were mapped to MeSH with comparable efficiency. CONCLUSIONS: This study is a first attempt to link proteins in UniProtKB to the medical resources. The indexing we provided will help clinicians and researchers navigate from diseases to genes and from genes to diseases in an efficient way. The mapping is available at: . BioMed Central 2008-04-29 /pmc/articles/PMC2367626/ /pubmed/18460185 http://dx.doi.org/10.1186/1471-2105-9-S5-S3 Text en Copyright © 2008 Mottaz et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Mottaz, Anaïs
Yip, Yum L
Ruch, Patrick
Veuthey, Anne-Lise
Mapping proteins to disease terminologies: from UniProt to MeSH
title Mapping proteins to disease terminologies: from UniProt to MeSH
title_full Mapping proteins to disease terminologies: from UniProt to MeSH
title_fullStr Mapping proteins to disease terminologies: from UniProt to MeSH
title_full_unstemmed Mapping proteins to disease terminologies: from UniProt to MeSH
title_short Mapping proteins to disease terminologies: from UniProt to MeSH
title_sort mapping proteins to disease terminologies: from uniprot to mesh
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2367626/
https://www.ncbi.nlm.nih.gov/pubmed/18460185
http://dx.doi.org/10.1186/1471-2105-9-S5-S3
work_keys_str_mv AT mottazanais mappingproteinstodiseaseterminologiesfromuniprottomesh
AT yipyuml mappingproteinstodiseaseterminologiesfromuniprottomesh
AT ruchpatrick mappingproteinstodiseaseterminologiesfromuniprottomesh
AT veutheyannelise mappingproteinstodiseaseterminologiesfromuniprottomesh