Cargando…
InMeRF: prediction of pathogenicity of missense variants by individual modeling for each amino acid substitution
In predicting the pathogenicity of a nonsynonymous single-nucleotide variant (nsSNV), a radical change in amino acid properties is prone to be classified as being pathogenic. However, not all such nsSNVs are associated with human diseases. We generated random forest (RF) models individually for each...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7671370/ https://www.ncbi.nlm.nih.gov/pubmed/33543123 http://dx.doi.org/10.1093/nargab/lqaa038 |
_version_ | 1783610916491755520 |
---|---|
author | Takeda, Jun-ichi Nanatsue, Kentaro Yamagishi, Ryosuke Ito, Mikako Haga, Nobuhiko Hirata, Hiromi Ogi, Tomoo Ohno, Kinji |
author_facet | Takeda, Jun-ichi Nanatsue, Kentaro Yamagishi, Ryosuke Ito, Mikako Haga, Nobuhiko Hirata, Hiromi Ogi, Tomoo Ohno, Kinji |
author_sort | Takeda, Jun-ichi |
collection | PubMed |
description | In predicting the pathogenicity of a nonsynonymous single-nucleotide variant (nsSNV), a radical change in amino acid properties is prone to be classified as being pathogenic. However, not all such nsSNVs are associated with human diseases. We generated random forest (RF) models individually for each amino acid substitution to differentiate pathogenic nsSNVs in the Human Gene Mutation Database and common nsSNVs in dbSNP. We named a set of our models ‘Individual Meta RF’ (InMeRF). Ten-fold cross-validation of InMeRF showed that the areas under the curves (AUCs) of receiver operating characteristic (ROC) and precision–recall curves were on average 0.941 and 0.957, respectively. To compare InMeRF with seven other tools, the eight tools were generated using the same training dataset, and were compared using the same three testing datasets. ROC-AUCs of InMeRF were ranked first in the eight tools. We applied InMeRF to 155 pathogenic and 125 common nsSNVs in seven major genes causing congenital myasthenic syndromes, as well as in VANGL1 causing spina bifida, and found that the sensitivity and specificity of InMeRF were 0.942 and 0.848, respectively. We made the InMeRF web service, and also made genome-wide InMeRF scores available online (https://www.med.nagoya-u.ac.jp/neurogenetics/InMeRF/). |
format | Online Article Text |
id | pubmed-7671370 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-76713702021-02-03 InMeRF: prediction of pathogenicity of missense variants by individual modeling for each amino acid substitution Takeda, Jun-ichi Nanatsue, Kentaro Yamagishi, Ryosuke Ito, Mikako Haga, Nobuhiko Hirata, Hiromi Ogi, Tomoo Ohno, Kinji NAR Genom Bioinform Standard Article In predicting the pathogenicity of a nonsynonymous single-nucleotide variant (nsSNV), a radical change in amino acid properties is prone to be classified as being pathogenic. However, not all such nsSNVs are associated with human diseases. We generated random forest (RF) models individually for each amino acid substitution to differentiate pathogenic nsSNVs in the Human Gene Mutation Database and common nsSNVs in dbSNP. We named a set of our models ‘Individual Meta RF’ (InMeRF). Ten-fold cross-validation of InMeRF showed that the areas under the curves (AUCs) of receiver operating characteristic (ROC) and precision–recall curves were on average 0.941 and 0.957, respectively. To compare InMeRF with seven other tools, the eight tools were generated using the same training dataset, and were compared using the same three testing datasets. ROC-AUCs of InMeRF were ranked first in the eight tools. We applied InMeRF to 155 pathogenic and 125 common nsSNVs in seven major genes causing congenital myasthenic syndromes, as well as in VANGL1 causing spina bifida, and found that the sensitivity and specificity of InMeRF were 0.942 and 0.848, respectively. We made the InMeRF web service, and also made genome-wide InMeRF scores available online (https://www.med.nagoya-u.ac.jp/neurogenetics/InMeRF/). Oxford University Press 2020-05-26 /pmc/articles/PMC7671370/ /pubmed/33543123 http://dx.doi.org/10.1093/nargab/lqaa038 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Standard Article Takeda, Jun-ichi Nanatsue, Kentaro Yamagishi, Ryosuke Ito, Mikako Haga, Nobuhiko Hirata, Hiromi Ogi, Tomoo Ohno, Kinji InMeRF: prediction of pathogenicity of missense variants by individual modeling for each amino acid substitution |
title | InMeRF: prediction of pathogenicity of missense variants by individual modeling for each amino acid substitution |
title_full | InMeRF: prediction of pathogenicity of missense variants by individual modeling for each amino acid substitution |
title_fullStr | InMeRF: prediction of pathogenicity of missense variants by individual modeling for each amino acid substitution |
title_full_unstemmed | InMeRF: prediction of pathogenicity of missense variants by individual modeling for each amino acid substitution |
title_short | InMeRF: prediction of pathogenicity of missense variants by individual modeling for each amino acid substitution |
title_sort | inmerf: prediction of pathogenicity of missense variants by individual modeling for each amino acid substitution |
topic | Standard Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7671370/ https://www.ncbi.nlm.nih.gov/pubmed/33543123 http://dx.doi.org/10.1093/nargab/lqaa038 |
work_keys_str_mv | AT takedajunichi inmerfpredictionofpathogenicityofmissensevariantsbyindividualmodelingforeachaminoacidsubstitution AT nanatsuekentaro inmerfpredictionofpathogenicityofmissensevariantsbyindividualmodelingforeachaminoacidsubstitution AT yamagishiryosuke inmerfpredictionofpathogenicityofmissensevariantsbyindividualmodelingforeachaminoacidsubstitution AT itomikako inmerfpredictionofpathogenicityofmissensevariantsbyindividualmodelingforeachaminoacidsubstitution AT haganobuhiko inmerfpredictionofpathogenicityofmissensevariantsbyindividualmodelingforeachaminoacidsubstitution AT hiratahiromi inmerfpredictionofpathogenicityofmissensevariantsbyindividualmodelingforeachaminoacidsubstitution AT ogitomoo inmerfpredictionofpathogenicityofmissensevariantsbyindividualmodelingforeachaminoacidsubstitution AT ohnokinji inmerfpredictionofpathogenicityofmissensevariantsbyindividualmodelingforeachaminoacidsubstitution |