Cargando…
Improving disease gene prioritization using the semantic similarity of Gene Ontology terms
Motivation: Many hereditary human diseases are polygenic, resulting from sequence alterations in multiple genes. Genomic linkage and association studies are commonly performed for identifying disease-related genes. Such studies often yield lists of up to several hundred candidate genes, which have t...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2935448/ https://www.ncbi.nlm.nih.gov/pubmed/20823322 http://dx.doi.org/10.1093/bioinformatics/btq384 |
_version_ | 1782186405477220352 |
---|---|
author | Schlicker, Andreas Lengauer, Thomas Albrecht, Mario |
author_facet | Schlicker, Andreas Lengauer, Thomas Albrecht, Mario |
author_sort | Schlicker, Andreas |
collection | PubMed |
description | Motivation: Many hereditary human diseases are polygenic, resulting from sequence alterations in multiple genes. Genomic linkage and association studies are commonly performed for identifying disease-related genes. Such studies often yield lists of up to several hundred candidate genes, which have to be prioritized and validated further. Recent studies discovered that genes involved in phenotypically similar diseases are often functionally related on the molecular level. Results: Here, we introduce MedSim, a novel approach for ranking candidate genes for a particular disease based on functional comparisons involving the Gene Ontology. MedSim uses functional annotations of known disease genes for assessing the similarity of diseases as well as the disease relevance of candidate genes. We benchmarked our approach with genes known to be involved in 99 diseases taken from the OMIM database. Using artificial quantitative trait loci, MedSim achieved excellent performance with an area under the ROC curve of up to 0.90 and a sensitivity of over 70% at 90% specificity when classifying gene products according to their disease relatedness. This performance is comparable or even superior to related methods in the field, albeit using less and thus more easily accessible information. Availability: MedSim is offered as part of our FunSimMat web service (http://www.funsimmat.de). Contact: mario.albrecht@mpi-inf.mpg.de Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Text |
id | pubmed-2935448 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-29354482010-09-08 Improving disease gene prioritization using the semantic similarity of Gene Ontology terms Schlicker, Andreas Lengauer, Thomas Albrecht, Mario Bioinformatics Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium Motivation: Many hereditary human diseases are polygenic, resulting from sequence alterations in multiple genes. Genomic linkage and association studies are commonly performed for identifying disease-related genes. Such studies often yield lists of up to several hundred candidate genes, which have to be prioritized and validated further. Recent studies discovered that genes involved in phenotypically similar diseases are often functionally related on the molecular level. Results: Here, we introduce MedSim, a novel approach for ranking candidate genes for a particular disease based on functional comparisons involving the Gene Ontology. MedSim uses functional annotations of known disease genes for assessing the similarity of diseases as well as the disease relevance of candidate genes. We benchmarked our approach with genes known to be involved in 99 diseases taken from the OMIM database. Using artificial quantitative trait loci, MedSim achieved excellent performance with an area under the ROC curve of up to 0.90 and a sensitivity of over 70% at 90% specificity when classifying gene products according to their disease relatedness. This performance is comparable or even superior to related methods in the field, albeit using less and thus more easily accessible information. Availability: MedSim is offered as part of our FunSimMat web service (http://www.funsimmat.de). Contact: mario.albrecht@mpi-inf.mpg.de Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2010-09-15 2010-09-04 /pmc/articles/PMC2935448/ /pubmed/20823322 http://dx.doi.org/10.1093/bioinformatics/btq384 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium Schlicker, Andreas Lengauer, Thomas Albrecht, Mario Improving disease gene prioritization using the semantic similarity of Gene Ontology terms |
title | Improving disease gene prioritization using the semantic similarity of Gene Ontology terms |
title_full | Improving disease gene prioritization using the semantic similarity of Gene Ontology terms |
title_fullStr | Improving disease gene prioritization using the semantic similarity of Gene Ontology terms |
title_full_unstemmed | Improving disease gene prioritization using the semantic similarity of Gene Ontology terms |
title_short | Improving disease gene prioritization using the semantic similarity of Gene Ontology terms |
title_sort | improving disease gene prioritization using the semantic similarity of gene ontology terms |
topic | Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2935448/ https://www.ncbi.nlm.nih.gov/pubmed/20823322 http://dx.doi.org/10.1093/bioinformatics/btq384 |
work_keys_str_mv | AT schlickerandreas improvingdiseasegeneprioritizationusingthesemanticsimilarityofgeneontologyterms AT lengauerthomas improvingdiseasegeneprioritizationusingthesemanticsimilarityofgeneontologyterms AT albrechtmario improvingdiseasegeneprioritizationusingthesemanticsimilarityofgeneontologyterms |