Cargando…

An ontology-based search engine for protein-protein interactions

BACKGROUND: Keyword matching or ID matching is the most common searching method in a large database of protein-protein interactions. They are purely syntactic methods, and retrieve the records in the database that contain a keyword or ID specified in a query. Such syntactic search methods often retr...

Descripción completa

Detalles Bibliográficos
Autores principales: Park, Byungkyu, Han, Kyungsook
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3009494/
https://www.ncbi.nlm.nih.gov/pubmed/20122195
http://dx.doi.org/10.1186/1471-2105-11-S1-S23
_version_ 1782194691375104000
author Park, Byungkyu
Han, Kyungsook
author_facet Park, Byungkyu
Han, Kyungsook
author_sort Park, Byungkyu
collection PubMed
description BACKGROUND: Keyword matching or ID matching is the most common searching method in a large database of protein-protein interactions. They are purely syntactic methods, and retrieve the records in the database that contain a keyword or ID specified in a query. Such syntactic search methods often retrieve too few search results or no results despite many potential matches present in the database. RESULTS: We have developed a new method for representing protein-protein interactions and the Gene Ontology (GO) using modified Gödel numbers. This representation is hidden from users but enables a search engine using the representation to efficiently search protein-protein interactions in a biologically meaningful way. Given a query protein with optional search conditions expressed in one or more GO terms, the search engine finds all the interaction partners of the query protein by unique prime factorization of the modified Gödel numbers representing the query protein and the search conditions. CONCLUSION: Representing the biological relations of proteins and their GO annotations by modified Gödel numbers makes a search engine efficiently find all protein-protein interactions by prime factorization of the numbers. Keyword matching or ID matching search methods often miss the interactions involving a protein that has no explicit annotations matching the search condition, but our search engine retrieves such interactions as well if they satisfy the search condition with a more specific term in the ontology.
format Text
id pubmed-3009494
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30094942010-12-23 An ontology-based search engine for protein-protein interactions Park, Byungkyu Han, Kyungsook BMC Bioinformatics Research BACKGROUND: Keyword matching or ID matching is the most common searching method in a large database of protein-protein interactions. They are purely syntactic methods, and retrieve the records in the database that contain a keyword or ID specified in a query. Such syntactic search methods often retrieve too few search results or no results despite many potential matches present in the database. RESULTS: We have developed a new method for representing protein-protein interactions and the Gene Ontology (GO) using modified Gödel numbers. This representation is hidden from users but enables a search engine using the representation to efficiently search protein-protein interactions in a biologically meaningful way. Given a query protein with optional search conditions expressed in one or more GO terms, the search engine finds all the interaction partners of the query protein by unique prime factorization of the modified Gödel numbers representing the query protein and the search conditions. CONCLUSION: Representing the biological relations of proteins and their GO annotations by modified Gödel numbers makes a search engine efficiently find all protein-protein interactions by prime factorization of the numbers. Keyword matching or ID matching search methods often miss the interactions involving a protein that has no explicit annotations matching the search condition, but our search engine retrieves such interactions as well if they satisfy the search condition with a more specific term in the ontology. BioMed Central 2010-01-18 /pmc/articles/PMC3009494/ /pubmed/20122195 http://dx.doi.org/10.1186/1471-2105-11-S1-S23 Text en Copyright ©2010 Park and Han; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Park, Byungkyu
Han, Kyungsook
An ontology-based search engine for protein-protein interactions
title An ontology-based search engine for protein-protein interactions
title_full An ontology-based search engine for protein-protein interactions
title_fullStr An ontology-based search engine for protein-protein interactions
title_full_unstemmed An ontology-based search engine for protein-protein interactions
title_short An ontology-based search engine for protein-protein interactions
title_sort ontology-based search engine for protein-protein interactions
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3009494/
https://www.ncbi.nlm.nih.gov/pubmed/20122195
http://dx.doi.org/10.1186/1471-2105-11-S1-S23
work_keys_str_mv AT parkbyungkyu anontologybasedsearchengineforproteinproteininteractions
AT hankyungsook anontologybasedsearchengineforproteinproteininteractions
AT parkbyungkyu ontologybasedsearchengineforproteinproteininteractions
AT hankyungsook ontologybasedsearchengineforproteinproteininteractions