Cargando…

PhD-SNPg: updating a webserver and lightweight tool for scoring nucleotide variants

One of the primary challenges in human genetics is determining the functional impact of single nucleotide variants (SNVs) and insertion and deletions (InDels), whether coding or noncoding. In the past, methods have been created to detect disease-related single amino acid changes, but only some can a...

Descripción completa

Detalles Bibliográficos
Autores principales: Capriotti, Emidio, Fariselli, Piero
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10320148/
https://www.ncbi.nlm.nih.gov/pubmed/37246737
http://dx.doi.org/10.1093/nar/gkad455
_version_ 1785068389039865856
author Capriotti, Emidio
Fariselli, Piero
author_facet Capriotti, Emidio
Fariselli, Piero
author_sort Capriotti, Emidio
collection PubMed
description One of the primary challenges in human genetics is determining the functional impact of single nucleotide variants (SNVs) and insertion and deletions (InDels), whether coding or noncoding. In the past, methods have been created to detect disease-related single amino acid changes, but only some can assess the influence of noncoding variations. CADD is the most commonly used and advanced algorithm for predicting the diverse effects of genome variations. It employs a combination of sequence conservation and functional features derived from the ENCODE project data. To use CADD, a large set of pre-calculated information must be downloaded during the installation process. To streamline the variant annotation process, we developed PhD-SNP(g), a machine-learning tool that is easy to install and lightweight, relying solely on sequence-based features. Here we present an updated version, trained on a larger dataset, that can also predict the impact of the InDel variations. Despite its simplicity, PhD-SNP(g) performs similarly to CADD, making it ideal for rapid genome interpretation and as a benchmark for tool development.
format Online
Article
Text
id pubmed-10320148
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-103201482023-07-06 PhD-SNPg: updating a webserver and lightweight tool for scoring nucleotide variants Capriotti, Emidio Fariselli, Piero Nucleic Acids Res Web Server Issue One of the primary challenges in human genetics is determining the functional impact of single nucleotide variants (SNVs) and insertion and deletions (InDels), whether coding or noncoding. In the past, methods have been created to detect disease-related single amino acid changes, but only some can assess the influence of noncoding variations. CADD is the most commonly used and advanced algorithm for predicting the diverse effects of genome variations. It employs a combination of sequence conservation and functional features derived from the ENCODE project data. To use CADD, a large set of pre-calculated information must be downloaded during the installation process. To streamline the variant annotation process, we developed PhD-SNP(g), a machine-learning tool that is easy to install and lightweight, relying solely on sequence-based features. Here we present an updated version, trained on a larger dataset, that can also predict the impact of the InDel variations. Despite its simplicity, PhD-SNP(g) performs similarly to CADD, making it ideal for rapid genome interpretation and as a benchmark for tool development. Oxford University Press 2023-05-29 /pmc/articles/PMC10320148/ /pubmed/37246737 http://dx.doi.org/10.1093/nar/gkad455 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Web Server Issue
Capriotti, Emidio
Fariselli, Piero
PhD-SNPg: updating a webserver and lightweight tool for scoring nucleotide variants
title PhD-SNPg: updating a webserver and lightweight tool for scoring nucleotide variants
title_full PhD-SNPg: updating a webserver and lightweight tool for scoring nucleotide variants
title_fullStr PhD-SNPg: updating a webserver and lightweight tool for scoring nucleotide variants
title_full_unstemmed PhD-SNPg: updating a webserver and lightweight tool for scoring nucleotide variants
title_short PhD-SNPg: updating a webserver and lightweight tool for scoring nucleotide variants
title_sort phd-snpg: updating a webserver and lightweight tool for scoring nucleotide variants
topic Web Server Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10320148/
https://www.ncbi.nlm.nih.gov/pubmed/37246737
http://dx.doi.org/10.1093/nar/gkad455
work_keys_str_mv AT capriottiemidio phdsnpgupdatingawebserverandlightweighttoolforscoringnucleotidevariants
AT farisellipiero phdsnpgupdatingawebserverandlightweighttoolforscoringnucleotidevariants