Cargando…

Inferring novel gene-disease associations using Medical Subject Heading Over-representation Profiles

BACKGROUND: MEDLINE(®)/PubMed(® )currently indexes over 18 million biomedical articles, providing unprecedented opportunities and challenges for text analysis. Using Medical Subject Heading Over-representation Profiles (MeSHOPs), an entity of interest can be robustly summarized, quantitatively ident...

Descripción completa

Detalles Bibliográficos
Autores principales: Cheung, Warren A, Ouellette, BF Francis, Wasserman, Wyeth W
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3580445/
https://www.ncbi.nlm.nih.gov/pubmed/23021552
http://dx.doi.org/10.1186/gm376
_version_ 1782260249209602048
author Cheung, Warren A
Ouellette, BF Francis
Wasserman, Wyeth W
author_facet Cheung, Warren A
Ouellette, BF Francis
Wasserman, Wyeth W
author_sort Cheung, Warren A
collection PubMed
description BACKGROUND: MEDLINE(®)/PubMed(® )currently indexes over 18 million biomedical articles, providing unprecedented opportunities and challenges for text analysis. Using Medical Subject Heading Over-representation Profiles (MeSHOPs), an entity of interest can be robustly summarized, quantitatively identifying associated biomedical terms and predicting novel indirect associations. METHODS: A procedure is introduced for quantitative comparison of MeSHOPs derived from a group of MEDLINE(® )articles for a biomedical topic (for example, articles for a specific gene or disease). Similarity scores are computed to compare MeSHOPs of genes and diseases. RESULTS: Similarity scores successfully infer novel associations between diseases and genes. The number of papers addressing a gene or disease has a strong influence on predicted associations, revealing an important bias for gene-disease relationship prediction. Predictions derived from comparisons of MeSHOPs achieves a mean 8% AUC improvement in the identification of gene-disease relationships compared to gene-independent baseline properties. CONCLUSIONS: MeSHOP comparisons are demonstrated to provide predictive capacity for novel relationships between genes and human diseases. We demonstrate the impact of literature bias on the performance of gene-disease prediction methods. MeSHOPs provide a rich source of annotation to facilitate relationship discovery in biomedical informatics.
format Online
Article
Text
id pubmed-3580445
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35804452013-03-04 Inferring novel gene-disease associations using Medical Subject Heading Over-representation Profiles Cheung, Warren A Ouellette, BF Francis Wasserman, Wyeth W Genome Med Research BACKGROUND: MEDLINE(®)/PubMed(® )currently indexes over 18 million biomedical articles, providing unprecedented opportunities and challenges for text analysis. Using Medical Subject Heading Over-representation Profiles (MeSHOPs), an entity of interest can be robustly summarized, quantitatively identifying associated biomedical terms and predicting novel indirect associations. METHODS: A procedure is introduced for quantitative comparison of MeSHOPs derived from a group of MEDLINE(® )articles for a biomedical topic (for example, articles for a specific gene or disease). Similarity scores are computed to compare MeSHOPs of genes and diseases. RESULTS: Similarity scores successfully infer novel associations between diseases and genes. The number of papers addressing a gene or disease has a strong influence on predicted associations, revealing an important bias for gene-disease relationship prediction. Predictions derived from comparisons of MeSHOPs achieves a mean 8% AUC improvement in the identification of gene-disease relationships compared to gene-independent baseline properties. CONCLUSIONS: MeSHOP comparisons are demonstrated to provide predictive capacity for novel relationships between genes and human diseases. We demonstrate the impact of literature bias on the performance of gene-disease prediction methods. MeSHOPs provide a rich source of annotation to facilitate relationship discovery in biomedical informatics. BioMed Central 2012-09-28 /pmc/articles/PMC3580445/ /pubmed/23021552 http://dx.doi.org/10.1186/gm376 Text en Copyright ©2012 Cheung et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Cheung, Warren A
Ouellette, BF Francis
Wasserman, Wyeth W
Inferring novel gene-disease associations using Medical Subject Heading Over-representation Profiles
title Inferring novel gene-disease associations using Medical Subject Heading Over-representation Profiles
title_full Inferring novel gene-disease associations using Medical Subject Heading Over-representation Profiles
title_fullStr Inferring novel gene-disease associations using Medical Subject Heading Over-representation Profiles
title_full_unstemmed Inferring novel gene-disease associations using Medical Subject Heading Over-representation Profiles
title_short Inferring novel gene-disease associations using Medical Subject Heading Over-representation Profiles
title_sort inferring novel gene-disease associations using medical subject heading over-representation profiles
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3580445/
https://www.ncbi.nlm.nih.gov/pubmed/23021552
http://dx.doi.org/10.1186/gm376
work_keys_str_mv AT cheungwarrena inferringnovelgenediseaseassociationsusingmedicalsubjectheadingoverrepresentationprofiles
AT ouellettebffrancis inferringnovelgenediseaseassociationsusingmedicalsubjectheadingoverrepresentationprofiles
AT wassermanwyethw inferringnovelgenediseaseassociationsusingmedicalsubjectheadingoverrepresentationprofiles