Cargando…

Starvar: symptom-based tool for automatic ranking of variants using evidence from literature and genomes

BACKGROUND: Identifying variants associated with diseases is a challenging task in medical genetics research. Current studies that prioritize variants within individual genomes generally rely on known variants, evidence from literature and genomes, and patient symptoms and clinical signs. The functi...

Descripción completa

Detalles Bibliográficos
Autores principales: Kafkas, Șenay, Abdelhakim, Marwa, Uludag, Mahmut, Althagafi, Azza, Alghamdi, Malak, Hoehndorf, Robert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10362560/
https://www.ncbi.nlm.nih.gov/pubmed/37479972
http://dx.doi.org/10.1186/s12859-023-05406-w
_version_ 1785076449595621376
author Kafkas, Șenay
Abdelhakim, Marwa
Uludag, Mahmut
Althagafi, Azza
Alghamdi, Malak
Hoehndorf, Robert
author_facet Kafkas, Șenay
Abdelhakim, Marwa
Uludag, Mahmut
Althagafi, Azza
Alghamdi, Malak
Hoehndorf, Robert
author_sort Kafkas, Șenay
collection PubMed
description BACKGROUND: Identifying variants associated with diseases is a challenging task in medical genetics research. Current studies that prioritize variants within individual genomes generally rely on known variants, evidence from literature and genomes, and patient symptoms and clinical signs. The functionalities of the existing tools, which rank variants based on given patient symptoms and clinical signs, are restricted to the coverage of ontologies such as the Human Phenotype Ontology (HPO). However, most clinicians do not limit themselves to HPO while describing patient symptoms/signs and their associated variants/genes. There is thus a need for an automated tool that can prioritize variants based on freely expressed patient symptoms and clinical signs. RESULTS: STARVar is a Symptom-based Tool for Automatic Ranking of Variants using evidence from literature and genomes. STARVar uses patient symptoms and clinical signs, either linked to HPO or expressed in free text format. It returns a ranked list of variants based on a combined score from two classifiers utilizing evidence from genomics and literature. STARVar improves over related tools on a set of synthetic patients. In addition, we demonstrated its distinct contribution to the domain on another synthetic dataset covering publicly available clinical genotype–phenotype associations by using symptoms and clinical signs expressed in free text format. CONCLUSIONS: STARVar stands as a unique and efficient tool that has the advantage of ranking variants with flexibly expressed patient symptoms in free-form text. Therefore, STARVar can be easily integrated into bioinformatics workflows designed to analyze disease-associated genomes. AVAILABILITY: STARVar is freely available from https://github.com/bio-ontology-research-group/STARVar. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05406-w.
format Online
Article
Text
id pubmed-10362560
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-103625602023-07-23 Starvar: symptom-based tool for automatic ranking of variants using evidence from literature and genomes Kafkas, Șenay Abdelhakim, Marwa Uludag, Mahmut Althagafi, Azza Alghamdi, Malak Hoehndorf, Robert BMC Bioinformatics Software BACKGROUND: Identifying variants associated with diseases is a challenging task in medical genetics research. Current studies that prioritize variants within individual genomes generally rely on known variants, evidence from literature and genomes, and patient symptoms and clinical signs. The functionalities of the existing tools, which rank variants based on given patient symptoms and clinical signs, are restricted to the coverage of ontologies such as the Human Phenotype Ontology (HPO). However, most clinicians do not limit themselves to HPO while describing patient symptoms/signs and their associated variants/genes. There is thus a need for an automated tool that can prioritize variants based on freely expressed patient symptoms and clinical signs. RESULTS: STARVar is a Symptom-based Tool for Automatic Ranking of Variants using evidence from literature and genomes. STARVar uses patient symptoms and clinical signs, either linked to HPO or expressed in free text format. It returns a ranked list of variants based on a combined score from two classifiers utilizing evidence from genomics and literature. STARVar improves over related tools on a set of synthetic patients. In addition, we demonstrated its distinct contribution to the domain on another synthetic dataset covering publicly available clinical genotype–phenotype associations by using symptoms and clinical signs expressed in free text format. CONCLUSIONS: STARVar stands as a unique and efficient tool that has the advantage of ranking variants with flexibly expressed patient symptoms in free-form text. Therefore, STARVar can be easily integrated into bioinformatics workflows designed to analyze disease-associated genomes. AVAILABILITY: STARVar is freely available from https://github.com/bio-ontology-research-group/STARVar. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05406-w. BioMed Central 2023-07-21 /pmc/articles/PMC10362560/ /pubmed/37479972 http://dx.doi.org/10.1186/s12859-023-05406-w Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
Kafkas, Șenay
Abdelhakim, Marwa
Uludag, Mahmut
Althagafi, Azza
Alghamdi, Malak
Hoehndorf, Robert
Starvar: symptom-based tool for automatic ranking of variants using evidence from literature and genomes
title Starvar: symptom-based tool for automatic ranking of variants using evidence from literature and genomes
title_full Starvar: symptom-based tool for automatic ranking of variants using evidence from literature and genomes
title_fullStr Starvar: symptom-based tool for automatic ranking of variants using evidence from literature and genomes
title_full_unstemmed Starvar: symptom-based tool for automatic ranking of variants using evidence from literature and genomes
title_short Starvar: symptom-based tool for automatic ranking of variants using evidence from literature and genomes
title_sort starvar: symptom-based tool for automatic ranking of variants using evidence from literature and genomes
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10362560/
https://www.ncbi.nlm.nih.gov/pubmed/37479972
http://dx.doi.org/10.1186/s12859-023-05406-w
work_keys_str_mv AT kafkassenay starvarsymptombasedtoolforautomaticrankingofvariantsusingevidencefromliteratureandgenomes
AT abdelhakimmarwa starvarsymptombasedtoolforautomaticrankingofvariantsusingevidencefromliteratureandgenomes
AT uludagmahmut starvarsymptombasedtoolforautomaticrankingofvariantsusingevidencefromliteratureandgenomes
AT althagafiazza starvarsymptombasedtoolforautomaticrankingofvariantsusingevidencefromliteratureandgenomes
AT alghamdimalak starvarsymptombasedtoolforautomaticrankingofvariantsusingevidencefromliteratureandgenomes
AT hoehndorfrobert starvarsymptombasedtoolforautomaticrankingofvariantsusingevidencefromliteratureandgenomes