Cargando…

PheSom: a term frequency-based method for measuring human phenotype similarity on the basis of MeSH vocabulary

Background: Phenotype similarity calculation should be used to help improve drug repurposing. In this study, based on the MeSH terms describing the phenotypes deposited in OMIM, we proposed a method, namely, PheSom (Phenotype Similarity On MeSH), to measure the similarity between phenotypes. PheSom...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Xinhua, Gao, Ling, Peng, Yonglin, Fang, Zhonghai, Wang, Ju
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10366691/
https://www.ncbi.nlm.nih.gov/pubmed/37496714
http://dx.doi.org/10.3389/fgene.2023.1185790
_version_ 1785077223059881984
author Liu, Xinhua
Gao, Ling
Peng, Yonglin
Fang, Zhonghai
Wang, Ju
author_facet Liu, Xinhua
Gao, Ling
Peng, Yonglin
Fang, Zhonghai
Wang, Ju
author_sort Liu, Xinhua
collection PubMed
description Background: Phenotype similarity calculation should be used to help improve drug repurposing. In this study, based on the MeSH terms describing the phenotypes deposited in OMIM, we proposed a method, namely, PheSom (Phenotype Similarity On MeSH), to measure the similarity between phenotypes. PheSom counted the number of overlapping MeSH terms between two phenotypes and then took the weight of every MeSH term within each phenotype into account according to the term frequency-inverse document frequency (FIDC). Phenotype-related genes were used for the evaluation of our method. Results: A 7,739 × 7,739 similarity score matrix was finally obtained and the number of phenotype pairs was dramatically decreased with the increase of similarity score. Besides, the overlapping rates of phenotype-related genes were remarkably increased with the increase of similarity score between phenotypes, which supports the reliability of our method. Conclusion: We anticipate our method can be applied to identifying novel therapeutic methods for complex diseases.
format Online
Article
Text
id pubmed-10366691
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-103666912023-07-26 PheSom: a term frequency-based method for measuring human phenotype similarity on the basis of MeSH vocabulary Liu, Xinhua Gao, Ling Peng, Yonglin Fang, Zhonghai Wang, Ju Front Genet Genetics Background: Phenotype similarity calculation should be used to help improve drug repurposing. In this study, based on the MeSH terms describing the phenotypes deposited in OMIM, we proposed a method, namely, PheSom (Phenotype Similarity On MeSH), to measure the similarity between phenotypes. PheSom counted the number of overlapping MeSH terms between two phenotypes and then took the weight of every MeSH term within each phenotype into account according to the term frequency-inverse document frequency (FIDC). Phenotype-related genes were used for the evaluation of our method. Results: A 7,739 × 7,739 similarity score matrix was finally obtained and the number of phenotype pairs was dramatically decreased with the increase of similarity score. Besides, the overlapping rates of phenotype-related genes were remarkably increased with the increase of similarity score between phenotypes, which supports the reliability of our method. Conclusion: We anticipate our method can be applied to identifying novel therapeutic methods for complex diseases. Frontiers Media S.A. 2023-07-11 /pmc/articles/PMC10366691/ /pubmed/37496714 http://dx.doi.org/10.3389/fgene.2023.1185790 Text en Copyright © 2023 Liu, Gao, Peng, Fang and Wang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Liu, Xinhua
Gao, Ling
Peng, Yonglin
Fang, Zhonghai
Wang, Ju
PheSom: a term frequency-based method for measuring human phenotype similarity on the basis of MeSH vocabulary
title PheSom: a term frequency-based method for measuring human phenotype similarity on the basis of MeSH vocabulary
title_full PheSom: a term frequency-based method for measuring human phenotype similarity on the basis of MeSH vocabulary
title_fullStr PheSom: a term frequency-based method for measuring human phenotype similarity on the basis of MeSH vocabulary
title_full_unstemmed PheSom: a term frequency-based method for measuring human phenotype similarity on the basis of MeSH vocabulary
title_short PheSom: a term frequency-based method for measuring human phenotype similarity on the basis of MeSH vocabulary
title_sort phesom: a term frequency-based method for measuring human phenotype similarity on the basis of mesh vocabulary
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10366691/
https://www.ncbi.nlm.nih.gov/pubmed/37496714
http://dx.doi.org/10.3389/fgene.2023.1185790
work_keys_str_mv AT liuxinhua phesomatermfrequencybasedmethodformeasuringhumanphenotypesimilarityonthebasisofmeshvocabulary
AT gaoling phesomatermfrequencybasedmethodformeasuringhumanphenotypesimilarityonthebasisofmeshvocabulary
AT pengyonglin phesomatermfrequencybasedmethodformeasuringhumanphenotypesimilarityonthebasisofmeshvocabulary
AT fangzhonghai phesomatermfrequencybasedmethodformeasuringhumanphenotypesimilarityonthebasisofmeshvocabulary
AT wangju phesomatermfrequencybasedmethodformeasuringhumanphenotypesimilarityonthebasisofmeshvocabulary