Cargando…

PON-All: Amino Acid Substitution Tolerance Predictor for All Organisms

Genetic variations are investigated in human and many other organisms for many purposes (e.g., to aid in clinical diagnosis). Interpretation of the identified variations can be challenging. Although some dedicated prediction methods have been developed and some tools for human variants can also be u...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Yang, Shao, Aibin, Vihinen, Mauno
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9245922/
https://www.ncbi.nlm.nih.gov/pubmed/35782867
http://dx.doi.org/10.3389/fmolb.2022.867572
_version_ 1784738856366505984
author Yang, Yang
Shao, Aibin
Vihinen, Mauno
author_facet Yang, Yang
Shao, Aibin
Vihinen, Mauno
author_sort Yang, Yang
collection PubMed
description Genetic variations are investigated in human and many other organisms for many purposes (e.g., to aid in clinical diagnosis). Interpretation of the identified variations can be challenging. Although some dedicated prediction methods have been developed and some tools for human variants can also be used for other organisms, the performance and species range have been limited. We developed a novel variant pathogenicity/tolerance predictor for amino acid substitutions in any organism. The method, PON-All, is a machine learning tool trained on human, animal, and plant variants. Two versions are provided, one with Gene Ontology (GO) annotations and another without these details. GO annotations are not available or are partial for many organisms of interest. The methods provide predictions for three classes: pathogenic, benign, and variants of unknown significance. On the blind test, when using GO annotations, accuracy was 0.913 and MCC 0.827. When GO features were not used, accuracy was 0.856 and MCC 0.712. The performance is the best for human and plant variants and somewhat lower for animal variants because the number of known disease-causing variants in animals is rather small. The method was compared to several other tools and was found to have superior performance. PON-All is freely available at http://structure.bmc.lu.se/PON-All and http://8.133.174.28:8999/.
format Online
Article
Text
id pubmed-9245922
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-92459222022-07-01 PON-All: Amino Acid Substitution Tolerance Predictor for All Organisms Yang, Yang Shao, Aibin Vihinen, Mauno Front Mol Biosci Molecular Biosciences Genetic variations are investigated in human and many other organisms for many purposes (e.g., to aid in clinical diagnosis). Interpretation of the identified variations can be challenging. Although some dedicated prediction methods have been developed and some tools for human variants can also be used for other organisms, the performance and species range have been limited. We developed a novel variant pathogenicity/tolerance predictor for amino acid substitutions in any organism. The method, PON-All, is a machine learning tool trained on human, animal, and plant variants. Two versions are provided, one with Gene Ontology (GO) annotations and another without these details. GO annotations are not available or are partial for many organisms of interest. The methods provide predictions for three classes: pathogenic, benign, and variants of unknown significance. On the blind test, when using GO annotations, accuracy was 0.913 and MCC 0.827. When GO features were not used, accuracy was 0.856 and MCC 0.712. The performance is the best for human and plant variants and somewhat lower for animal variants because the number of known disease-causing variants in animals is rather small. The method was compared to several other tools and was found to have superior performance. PON-All is freely available at http://structure.bmc.lu.se/PON-All and http://8.133.174.28:8999/. Frontiers Media S.A. 2022-06-16 /pmc/articles/PMC9245922/ /pubmed/35782867 http://dx.doi.org/10.3389/fmolb.2022.867572 Text en Copyright © 2022 Yang, Shao and Vihinen. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Molecular Biosciences
Yang, Yang
Shao, Aibin
Vihinen, Mauno
PON-All: Amino Acid Substitution Tolerance Predictor for All Organisms
title PON-All: Amino Acid Substitution Tolerance Predictor for All Organisms
title_full PON-All: Amino Acid Substitution Tolerance Predictor for All Organisms
title_fullStr PON-All: Amino Acid Substitution Tolerance Predictor for All Organisms
title_full_unstemmed PON-All: Amino Acid Substitution Tolerance Predictor for All Organisms
title_short PON-All: Amino Acid Substitution Tolerance Predictor for All Organisms
title_sort pon-all: amino acid substitution tolerance predictor for all organisms
topic Molecular Biosciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9245922/
https://www.ncbi.nlm.nih.gov/pubmed/35782867
http://dx.doi.org/10.3389/fmolb.2022.867572
work_keys_str_mv AT yangyang ponallaminoacidsubstitutiontolerancepredictorforallorganisms
AT shaoaibin ponallaminoacidsubstitutiontolerancepredictorforallorganisms
AT vihinenmauno ponallaminoacidsubstitutiontolerancepredictorforallorganisms