Cargando…
PheSom: a term frequency-based method for measuring human phenotype similarity on the basis of MeSH vocabulary
Background: Phenotype similarity calculation should be used to help improve drug repurposing. In this study, based on the MeSH terms describing the phenotypes deposited in OMIM, we proposed a method, namely, PheSom (Phenotype Similarity On MeSH), to measure the similarity between phenotypes. PheSom...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10366691/ https://www.ncbi.nlm.nih.gov/pubmed/37496714 http://dx.doi.org/10.3389/fgene.2023.1185790 |
_version_ | 1785077223059881984 |
---|---|
author | Liu, Xinhua Gao, Ling Peng, Yonglin Fang, Zhonghai Wang, Ju |
author_facet | Liu, Xinhua Gao, Ling Peng, Yonglin Fang, Zhonghai Wang, Ju |
author_sort | Liu, Xinhua |
collection | PubMed |
description | Background: Phenotype similarity calculation should be used to help improve drug repurposing. In this study, based on the MeSH terms describing the phenotypes deposited in OMIM, we proposed a method, namely, PheSom (Phenotype Similarity On MeSH), to measure the similarity between phenotypes. PheSom counted the number of overlapping MeSH terms between two phenotypes and then took the weight of every MeSH term within each phenotype into account according to the term frequency-inverse document frequency (FIDC). Phenotype-related genes were used for the evaluation of our method. Results: A 7,739 × 7,739 similarity score matrix was finally obtained and the number of phenotype pairs was dramatically decreased with the increase of similarity score. Besides, the overlapping rates of phenotype-related genes were remarkably increased with the increase of similarity score between phenotypes, which supports the reliability of our method. Conclusion: We anticipate our method can be applied to identifying novel therapeutic methods for complex diseases. |
format | Online Article Text |
id | pubmed-10366691 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-103666912023-07-26 PheSom: a term frequency-based method for measuring human phenotype similarity on the basis of MeSH vocabulary Liu, Xinhua Gao, Ling Peng, Yonglin Fang, Zhonghai Wang, Ju Front Genet Genetics Background: Phenotype similarity calculation should be used to help improve drug repurposing. In this study, based on the MeSH terms describing the phenotypes deposited in OMIM, we proposed a method, namely, PheSom (Phenotype Similarity On MeSH), to measure the similarity between phenotypes. PheSom counted the number of overlapping MeSH terms between two phenotypes and then took the weight of every MeSH term within each phenotype into account according to the term frequency-inverse document frequency (FIDC). Phenotype-related genes were used for the evaluation of our method. Results: A 7,739 × 7,739 similarity score matrix was finally obtained and the number of phenotype pairs was dramatically decreased with the increase of similarity score. Besides, the overlapping rates of phenotype-related genes were remarkably increased with the increase of similarity score between phenotypes, which supports the reliability of our method. Conclusion: We anticipate our method can be applied to identifying novel therapeutic methods for complex diseases. Frontiers Media S.A. 2023-07-11 /pmc/articles/PMC10366691/ /pubmed/37496714 http://dx.doi.org/10.3389/fgene.2023.1185790 Text en Copyright © 2023 Liu, Gao, Peng, Fang and Wang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Liu, Xinhua Gao, Ling Peng, Yonglin Fang, Zhonghai Wang, Ju PheSom: a term frequency-based method for measuring human phenotype similarity on the basis of MeSH vocabulary |
title | PheSom: a term frequency-based method for measuring human phenotype similarity on the basis of MeSH vocabulary |
title_full | PheSom: a term frequency-based method for measuring human phenotype similarity on the basis of MeSH vocabulary |
title_fullStr | PheSom: a term frequency-based method for measuring human phenotype similarity on the basis of MeSH vocabulary |
title_full_unstemmed | PheSom: a term frequency-based method for measuring human phenotype similarity on the basis of MeSH vocabulary |
title_short | PheSom: a term frequency-based method for measuring human phenotype similarity on the basis of MeSH vocabulary |
title_sort | phesom: a term frequency-based method for measuring human phenotype similarity on the basis of mesh vocabulary |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10366691/ https://www.ncbi.nlm.nih.gov/pubmed/37496714 http://dx.doi.org/10.3389/fgene.2023.1185790 |
work_keys_str_mv | AT liuxinhua phesomatermfrequencybasedmethodformeasuringhumanphenotypesimilarityonthebasisofmeshvocabulary AT gaoling phesomatermfrequencybasedmethodformeasuringhumanphenotypesimilarityonthebasisofmeshvocabulary AT pengyonglin phesomatermfrequencybasedmethodformeasuringhumanphenotypesimilarityonthebasisofmeshvocabulary AT fangzhonghai phesomatermfrequencybasedmethodformeasuringhumanphenotypesimilarityonthebasisofmeshvocabulary AT wangju phesomatermfrequencybasedmethodformeasuringhumanphenotypesimilarityonthebasisofmeshvocabulary |