Cargando…

Improving disease gene prioritization using the semantic similarity of Gene Ontology terms

Motivation: Many hereditary human diseases are polygenic, resulting from sequence alterations in multiple genes. Genomic linkage and association studies are commonly performed for identifying disease-related genes. Such studies often yield lists of up to several hundred candidate genes, which have t...

Descripción completa

Detalles Bibliográficos
Autores principales: Schlicker, Andreas, Lengauer, Thomas, Albrecht, Mario
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2935448/
https://www.ncbi.nlm.nih.gov/pubmed/20823322
http://dx.doi.org/10.1093/bioinformatics/btq384
_version_ 1782186405477220352
author Schlicker, Andreas
Lengauer, Thomas
Albrecht, Mario
author_facet Schlicker, Andreas
Lengauer, Thomas
Albrecht, Mario
author_sort Schlicker, Andreas
collection PubMed
description Motivation: Many hereditary human diseases are polygenic, resulting from sequence alterations in multiple genes. Genomic linkage and association studies are commonly performed for identifying disease-related genes. Such studies often yield lists of up to several hundred candidate genes, which have to be prioritized and validated further. Recent studies discovered that genes involved in phenotypically similar diseases are often functionally related on the molecular level. Results: Here, we introduce MedSim, a novel approach for ranking candidate genes for a particular disease based on functional comparisons involving the Gene Ontology. MedSim uses functional annotations of known disease genes for assessing the similarity of diseases as well as the disease relevance of candidate genes. We benchmarked our approach with genes known to be involved in 99 diseases taken from the OMIM database. Using artificial quantitative trait loci, MedSim achieved excellent performance with an area under the ROC curve of up to 0.90 and a sensitivity of over 70% at 90% specificity when classifying gene products according to their disease relatedness. This performance is comparable or even superior to related methods in the field, albeit using less and thus more easily accessible information. Availability: MedSim is offered as part of our FunSimMat web service (http://www.funsimmat.de). Contact: mario.albrecht@mpi-inf.mpg.de Supplementary information: Supplementary data are available at Bioinformatics online.
format Text
id pubmed-2935448
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-29354482010-09-08 Improving disease gene prioritization using the semantic similarity of Gene Ontology terms Schlicker, Andreas Lengauer, Thomas Albrecht, Mario Bioinformatics Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium Motivation: Many hereditary human diseases are polygenic, resulting from sequence alterations in multiple genes. Genomic linkage and association studies are commonly performed for identifying disease-related genes. Such studies often yield lists of up to several hundred candidate genes, which have to be prioritized and validated further. Recent studies discovered that genes involved in phenotypically similar diseases are often functionally related on the molecular level. Results: Here, we introduce MedSim, a novel approach for ranking candidate genes for a particular disease based on functional comparisons involving the Gene Ontology. MedSim uses functional annotations of known disease genes for assessing the similarity of diseases as well as the disease relevance of candidate genes. We benchmarked our approach with genes known to be involved in 99 diseases taken from the OMIM database. Using artificial quantitative trait loci, MedSim achieved excellent performance with an area under the ROC curve of up to 0.90 and a sensitivity of over 70% at 90% specificity when classifying gene products according to their disease relatedness. This performance is comparable or even superior to related methods in the field, albeit using less and thus more easily accessible information. Availability: MedSim is offered as part of our FunSimMat web service (http://www.funsimmat.de). Contact: mario.albrecht@mpi-inf.mpg.de Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2010-09-15 2010-09-04 /pmc/articles/PMC2935448/ /pubmed/20823322 http://dx.doi.org/10.1093/bioinformatics/btq384 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium
Schlicker, Andreas
Lengauer, Thomas
Albrecht, Mario
Improving disease gene prioritization using the semantic similarity of Gene Ontology terms
title Improving disease gene prioritization using the semantic similarity of Gene Ontology terms
title_full Improving disease gene prioritization using the semantic similarity of Gene Ontology terms
title_fullStr Improving disease gene prioritization using the semantic similarity of Gene Ontology terms
title_full_unstemmed Improving disease gene prioritization using the semantic similarity of Gene Ontology terms
title_short Improving disease gene prioritization using the semantic similarity of Gene Ontology terms
title_sort improving disease gene prioritization using the semantic similarity of gene ontology terms
topic Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2935448/
https://www.ncbi.nlm.nih.gov/pubmed/20823322
http://dx.doi.org/10.1093/bioinformatics/btq384
work_keys_str_mv AT schlickerandreas improvingdiseasegeneprioritizationusingthesemanticsimilarityofgeneontologyterms
AT lengauerthomas improvingdiseasegeneprioritizationusingthesemanticsimilarityofgeneontologyterms
AT albrechtmario improvingdiseasegeneprioritizationusingthesemanticsimilarityofgeneontologyterms