Cargando…
Inferring novel gene-disease associations using Medical Subject Heading Over-representation Profiles
BACKGROUND: MEDLINE(®)/PubMed(® )currently indexes over 18 million biomedical articles, providing unprecedented opportunities and challenges for text analysis. Using Medical Subject Heading Over-representation Profiles (MeSHOPs), an entity of interest can be robustly summarized, quantitatively ident...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3580445/ https://www.ncbi.nlm.nih.gov/pubmed/23021552 http://dx.doi.org/10.1186/gm376 |
_version_ | 1782260249209602048 |
---|---|
author | Cheung, Warren A Ouellette, BF Francis Wasserman, Wyeth W |
author_facet | Cheung, Warren A Ouellette, BF Francis Wasserman, Wyeth W |
author_sort | Cheung, Warren A |
collection | PubMed |
description | BACKGROUND: MEDLINE(®)/PubMed(® )currently indexes over 18 million biomedical articles, providing unprecedented opportunities and challenges for text analysis. Using Medical Subject Heading Over-representation Profiles (MeSHOPs), an entity of interest can be robustly summarized, quantitatively identifying associated biomedical terms and predicting novel indirect associations. METHODS: A procedure is introduced for quantitative comparison of MeSHOPs derived from a group of MEDLINE(® )articles for a biomedical topic (for example, articles for a specific gene or disease). Similarity scores are computed to compare MeSHOPs of genes and diseases. RESULTS: Similarity scores successfully infer novel associations between diseases and genes. The number of papers addressing a gene or disease has a strong influence on predicted associations, revealing an important bias for gene-disease relationship prediction. Predictions derived from comparisons of MeSHOPs achieves a mean 8% AUC improvement in the identification of gene-disease relationships compared to gene-independent baseline properties. CONCLUSIONS: MeSHOP comparisons are demonstrated to provide predictive capacity for novel relationships between genes and human diseases. We demonstrate the impact of literature bias on the performance of gene-disease prediction methods. MeSHOPs provide a rich source of annotation to facilitate relationship discovery in biomedical informatics. |
format | Online Article Text |
id | pubmed-3580445 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-35804452013-03-04 Inferring novel gene-disease associations using Medical Subject Heading Over-representation Profiles Cheung, Warren A Ouellette, BF Francis Wasserman, Wyeth W Genome Med Research BACKGROUND: MEDLINE(®)/PubMed(® )currently indexes over 18 million biomedical articles, providing unprecedented opportunities and challenges for text analysis. Using Medical Subject Heading Over-representation Profiles (MeSHOPs), an entity of interest can be robustly summarized, quantitatively identifying associated biomedical terms and predicting novel indirect associations. METHODS: A procedure is introduced for quantitative comparison of MeSHOPs derived from a group of MEDLINE(® )articles for a biomedical topic (for example, articles for a specific gene or disease). Similarity scores are computed to compare MeSHOPs of genes and diseases. RESULTS: Similarity scores successfully infer novel associations between diseases and genes. The number of papers addressing a gene or disease has a strong influence on predicted associations, revealing an important bias for gene-disease relationship prediction. Predictions derived from comparisons of MeSHOPs achieves a mean 8% AUC improvement in the identification of gene-disease relationships compared to gene-independent baseline properties. CONCLUSIONS: MeSHOP comparisons are demonstrated to provide predictive capacity for novel relationships between genes and human diseases. We demonstrate the impact of literature bias on the performance of gene-disease prediction methods. MeSHOPs provide a rich source of annotation to facilitate relationship discovery in biomedical informatics. BioMed Central 2012-09-28 /pmc/articles/PMC3580445/ /pubmed/23021552 http://dx.doi.org/10.1186/gm376 Text en Copyright ©2012 Cheung et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Cheung, Warren A Ouellette, BF Francis Wasserman, Wyeth W Inferring novel gene-disease associations using Medical Subject Heading Over-representation Profiles |
title | Inferring novel gene-disease associations using Medical Subject Heading Over-representation Profiles |
title_full | Inferring novel gene-disease associations using Medical Subject Heading Over-representation Profiles |
title_fullStr | Inferring novel gene-disease associations using Medical Subject Heading Over-representation Profiles |
title_full_unstemmed | Inferring novel gene-disease associations using Medical Subject Heading Over-representation Profiles |
title_short | Inferring novel gene-disease associations using Medical Subject Heading Over-representation Profiles |
title_sort | inferring novel gene-disease associations using medical subject heading over-representation profiles |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3580445/ https://www.ncbi.nlm.nih.gov/pubmed/23021552 http://dx.doi.org/10.1186/gm376 |
work_keys_str_mv | AT cheungwarrena inferringnovelgenediseaseassociationsusingmedicalsubjectheadingoverrepresentationprofiles AT ouellettebffrancis inferringnovelgenediseaseassociationsusingmedicalsubjectheadingoverrepresentationprofiles AT wassermanwyethw inferringnovelgenediseaseassociationsusingmedicalsubjectheadingoverrepresentationprofiles |