Cargando…
PON-All: Amino Acid Substitution Tolerance Predictor for All Organisms
Genetic variations are investigated in human and many other organisms for many purposes (e.g., to aid in clinical diagnosis). Interpretation of the identified variations can be challenging. Although some dedicated prediction methods have been developed and some tools for human variants can also be u...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9245922/ https://www.ncbi.nlm.nih.gov/pubmed/35782867 http://dx.doi.org/10.3389/fmolb.2022.867572 |
_version_ | 1784738856366505984 |
---|---|
author | Yang, Yang Shao, Aibin Vihinen, Mauno |
author_facet | Yang, Yang Shao, Aibin Vihinen, Mauno |
author_sort | Yang, Yang |
collection | PubMed |
description | Genetic variations are investigated in human and many other organisms for many purposes (e.g., to aid in clinical diagnosis). Interpretation of the identified variations can be challenging. Although some dedicated prediction methods have been developed and some tools for human variants can also be used for other organisms, the performance and species range have been limited. We developed a novel variant pathogenicity/tolerance predictor for amino acid substitutions in any organism. The method, PON-All, is a machine learning tool trained on human, animal, and plant variants. Two versions are provided, one with Gene Ontology (GO) annotations and another without these details. GO annotations are not available or are partial for many organisms of interest. The methods provide predictions for three classes: pathogenic, benign, and variants of unknown significance. On the blind test, when using GO annotations, accuracy was 0.913 and MCC 0.827. When GO features were not used, accuracy was 0.856 and MCC 0.712. The performance is the best for human and plant variants and somewhat lower for animal variants because the number of known disease-causing variants in animals is rather small. The method was compared to several other tools and was found to have superior performance. PON-All is freely available at http://structure.bmc.lu.se/PON-All and http://8.133.174.28:8999/. |
format | Online Article Text |
id | pubmed-9245922 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-92459222022-07-01 PON-All: Amino Acid Substitution Tolerance Predictor for All Organisms Yang, Yang Shao, Aibin Vihinen, Mauno Front Mol Biosci Molecular Biosciences Genetic variations are investigated in human and many other organisms for many purposes (e.g., to aid in clinical diagnosis). Interpretation of the identified variations can be challenging. Although some dedicated prediction methods have been developed and some tools for human variants can also be used for other organisms, the performance and species range have been limited. We developed a novel variant pathogenicity/tolerance predictor for amino acid substitutions in any organism. The method, PON-All, is a machine learning tool trained on human, animal, and plant variants. Two versions are provided, one with Gene Ontology (GO) annotations and another without these details. GO annotations are not available or are partial for many organisms of interest. The methods provide predictions for three classes: pathogenic, benign, and variants of unknown significance. On the blind test, when using GO annotations, accuracy was 0.913 and MCC 0.827. When GO features were not used, accuracy was 0.856 and MCC 0.712. The performance is the best for human and plant variants and somewhat lower for animal variants because the number of known disease-causing variants in animals is rather small. The method was compared to several other tools and was found to have superior performance. PON-All is freely available at http://structure.bmc.lu.se/PON-All and http://8.133.174.28:8999/. Frontiers Media S.A. 2022-06-16 /pmc/articles/PMC9245922/ /pubmed/35782867 http://dx.doi.org/10.3389/fmolb.2022.867572 Text en Copyright © 2022 Yang, Shao and Vihinen. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Molecular Biosciences Yang, Yang Shao, Aibin Vihinen, Mauno PON-All: Amino Acid Substitution Tolerance Predictor for All Organisms |
title | PON-All: Amino Acid Substitution Tolerance Predictor for All Organisms |
title_full | PON-All: Amino Acid Substitution Tolerance Predictor for All Organisms |
title_fullStr | PON-All: Amino Acid Substitution Tolerance Predictor for All Organisms |
title_full_unstemmed | PON-All: Amino Acid Substitution Tolerance Predictor for All Organisms |
title_short | PON-All: Amino Acid Substitution Tolerance Predictor for All Organisms |
title_sort | pon-all: amino acid substitution tolerance predictor for all organisms |
topic | Molecular Biosciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9245922/ https://www.ncbi.nlm.nih.gov/pubmed/35782867 http://dx.doi.org/10.3389/fmolb.2022.867572 |
work_keys_str_mv | AT yangyang ponallaminoacidsubstitutiontolerancepredictorforallorganisms AT shaoaibin ponallaminoacidsubstitutiontolerancepredictorforallorganisms AT vihinenmauno ponallaminoacidsubstitutiontolerancepredictorforallorganisms |