Cargando…

InMeRF: prediction of pathogenicity of missense variants by individual modeling for each amino acid substitution

In predicting the pathogenicity of a nonsynonymous single-nucleotide variant (nsSNV), a radical change in amino acid properties is prone to be classified as being pathogenic. However, not all such nsSNVs are associated with human diseases. We generated random forest (RF) models individually for each...

Descripción completa

Detalles Bibliográficos
Autores principales: Takeda, Jun-ichi, Nanatsue, Kentaro, Yamagishi, Ryosuke, Ito, Mikako, Haga, Nobuhiko, Hirata, Hiromi, Ogi, Tomoo, Ohno, Kinji
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7671370/
https://www.ncbi.nlm.nih.gov/pubmed/33543123
http://dx.doi.org/10.1093/nargab/lqaa038
_version_ 1783610916491755520
author Takeda, Jun-ichi
Nanatsue, Kentaro
Yamagishi, Ryosuke
Ito, Mikako
Haga, Nobuhiko
Hirata, Hiromi
Ogi, Tomoo
Ohno, Kinji
author_facet Takeda, Jun-ichi
Nanatsue, Kentaro
Yamagishi, Ryosuke
Ito, Mikako
Haga, Nobuhiko
Hirata, Hiromi
Ogi, Tomoo
Ohno, Kinji
author_sort Takeda, Jun-ichi
collection PubMed
description In predicting the pathogenicity of a nonsynonymous single-nucleotide variant (nsSNV), a radical change in amino acid properties is prone to be classified as being pathogenic. However, not all such nsSNVs are associated with human diseases. We generated random forest (RF) models individually for each amino acid substitution to differentiate pathogenic nsSNVs in the Human Gene Mutation Database and common nsSNVs in dbSNP. We named a set of our models ‘Individual Meta RF’ (InMeRF). Ten-fold cross-validation of InMeRF showed that the areas under the curves (AUCs) of receiver operating characteristic (ROC) and precision–recall curves were on average 0.941 and 0.957, respectively. To compare InMeRF with seven other tools, the eight tools were generated using the same training dataset, and were compared using the same three testing datasets. ROC-AUCs of InMeRF were ranked first in the eight tools. We applied InMeRF to 155 pathogenic and 125 common nsSNVs in seven major genes causing congenital myasthenic syndromes, as well as in VANGL1 causing spina bifida, and found that the sensitivity and specificity of InMeRF were 0.942 and 0.848, respectively. We made the InMeRF web service, and also made genome-wide InMeRF scores available online (https://www.med.nagoya-u.ac.jp/neurogenetics/InMeRF/).
format Online
Article
Text
id pubmed-7671370
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-76713702021-02-03 InMeRF: prediction of pathogenicity of missense variants by individual modeling for each amino acid substitution Takeda, Jun-ichi Nanatsue, Kentaro Yamagishi, Ryosuke Ito, Mikako Haga, Nobuhiko Hirata, Hiromi Ogi, Tomoo Ohno, Kinji NAR Genom Bioinform Standard Article In predicting the pathogenicity of a nonsynonymous single-nucleotide variant (nsSNV), a radical change in amino acid properties is prone to be classified as being pathogenic. However, not all such nsSNVs are associated with human diseases. We generated random forest (RF) models individually for each amino acid substitution to differentiate pathogenic nsSNVs in the Human Gene Mutation Database and common nsSNVs in dbSNP. We named a set of our models ‘Individual Meta RF’ (InMeRF). Ten-fold cross-validation of InMeRF showed that the areas under the curves (AUCs) of receiver operating characteristic (ROC) and precision–recall curves were on average 0.941 and 0.957, respectively. To compare InMeRF with seven other tools, the eight tools were generated using the same training dataset, and were compared using the same three testing datasets. ROC-AUCs of InMeRF were ranked first in the eight tools. We applied InMeRF to 155 pathogenic and 125 common nsSNVs in seven major genes causing congenital myasthenic syndromes, as well as in VANGL1 causing spina bifida, and found that the sensitivity and specificity of InMeRF were 0.942 and 0.848, respectively. We made the InMeRF web service, and also made genome-wide InMeRF scores available online (https://www.med.nagoya-u.ac.jp/neurogenetics/InMeRF/). Oxford University Press 2020-05-26 /pmc/articles/PMC7671370/ /pubmed/33543123 http://dx.doi.org/10.1093/nargab/lqaa038 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Standard Article
Takeda, Jun-ichi
Nanatsue, Kentaro
Yamagishi, Ryosuke
Ito, Mikako
Haga, Nobuhiko
Hirata, Hiromi
Ogi, Tomoo
Ohno, Kinji
InMeRF: prediction of pathogenicity of missense variants by individual modeling for each amino acid substitution
title InMeRF: prediction of pathogenicity of missense variants by individual modeling for each amino acid substitution
title_full InMeRF: prediction of pathogenicity of missense variants by individual modeling for each amino acid substitution
title_fullStr InMeRF: prediction of pathogenicity of missense variants by individual modeling for each amino acid substitution
title_full_unstemmed InMeRF: prediction of pathogenicity of missense variants by individual modeling for each amino acid substitution
title_short InMeRF: prediction of pathogenicity of missense variants by individual modeling for each amino acid substitution
title_sort inmerf: prediction of pathogenicity of missense variants by individual modeling for each amino acid substitution
topic Standard Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7671370/
https://www.ncbi.nlm.nih.gov/pubmed/33543123
http://dx.doi.org/10.1093/nargab/lqaa038
work_keys_str_mv AT takedajunichi inmerfpredictionofpathogenicityofmissensevariantsbyindividualmodelingforeachaminoacidsubstitution
AT nanatsuekentaro inmerfpredictionofpathogenicityofmissensevariantsbyindividualmodelingforeachaminoacidsubstitution
AT yamagishiryosuke inmerfpredictionofpathogenicityofmissensevariantsbyindividualmodelingforeachaminoacidsubstitution
AT itomikako inmerfpredictionofpathogenicityofmissensevariantsbyindividualmodelingforeachaminoacidsubstitution
AT haganobuhiko inmerfpredictionofpathogenicityofmissensevariantsbyindividualmodelingforeachaminoacidsubstitution
AT hiratahiromi inmerfpredictionofpathogenicityofmissensevariantsbyindividualmodelingforeachaminoacidsubstitution
AT ogitomoo inmerfpredictionofpathogenicityofmissensevariantsbyindividualmodelingforeachaminoacidsubstitution
AT ohnokinji inmerfpredictionofpathogenicityofmissensevariantsbyindividualmodelingforeachaminoacidsubstitution