Cargando…

MSIpred: a python package for tumor microsatellite instability classification from tumor mutation annotation data using a support vector machine

Microsatellite instability (MSI) is characterized by high degree of polymorphism in microsatellite lengths due to deficiency in mismatch repair (MMR) system. MSI is associated with several tumor types and its status can be considered as an important indicator for patient prognosis. Conventional clin...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Chen, Liang, Chun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6277498/
https://www.ncbi.nlm.nih.gov/pubmed/30510242
http://dx.doi.org/10.1038/s41598-018-35682-z
_version_ 1783378165005025280
author Wang, Chen
Liang, Chun
author_facet Wang, Chen
Liang, Chun
author_sort Wang, Chen
collection PubMed
description Microsatellite instability (MSI) is characterized by high degree of polymorphism in microsatellite lengths due to deficiency in mismatch repair (MMR) system. MSI is associated with several tumor types and its status can be considered as an important indicator for patient prognosis. Conventional clinical diagnosis of MSI examines PCR products of a panel of microsatellite markers using electrophoresis (MSI-PCR), which is laborious, costly, and time consuming. We developed MSIpred, a python package for automatic MSI classification using a machine learning technology – support vector machine (SVM). MSIpred computes 22 features characterizing tumor somatic mutational load from mutation data in mutation annotation format (MAF) generated from paired tumor-normal exome sequencing data, subsequently using these features to predict tumor MSI status with a SVM classifier trained by MAF data of 1074 tumors belonging to four types. Evaluation of MSIpred on an independent testing set, MAF data of another 358 tumors, achieved overall accuracy of ≥98% and area under receiver operating characteristic (ROC) curve of 0.967. Further analysis on discrepant cases revealed that discrepancies were partially due to misclassification of MSI-PCR. Additional testing of MSIpred on non-TCGA data also validated its good classification performance. These results indicated that MSIpred is a robust pan-tumor MSI classification tool and can serve as a complementary diagnostic to MSI-PCR in MSI diagnosis.
format Online
Article
Text
id pubmed-6277498
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-62774982018-12-06 MSIpred: a python package for tumor microsatellite instability classification from tumor mutation annotation data using a support vector machine Wang, Chen Liang, Chun Sci Rep Article Microsatellite instability (MSI) is characterized by high degree of polymorphism in microsatellite lengths due to deficiency in mismatch repair (MMR) system. MSI is associated with several tumor types and its status can be considered as an important indicator for patient prognosis. Conventional clinical diagnosis of MSI examines PCR products of a panel of microsatellite markers using electrophoresis (MSI-PCR), which is laborious, costly, and time consuming. We developed MSIpred, a python package for automatic MSI classification using a machine learning technology – support vector machine (SVM). MSIpred computes 22 features characterizing tumor somatic mutational load from mutation data in mutation annotation format (MAF) generated from paired tumor-normal exome sequencing data, subsequently using these features to predict tumor MSI status with a SVM classifier trained by MAF data of 1074 tumors belonging to four types. Evaluation of MSIpred on an independent testing set, MAF data of another 358 tumors, achieved overall accuracy of ≥98% and area under receiver operating characteristic (ROC) curve of 0.967. Further analysis on discrepant cases revealed that discrepancies were partially due to misclassification of MSI-PCR. Additional testing of MSIpred on non-TCGA data also validated its good classification performance. These results indicated that MSIpred is a robust pan-tumor MSI classification tool and can serve as a complementary diagnostic to MSI-PCR in MSI diagnosis. Nature Publishing Group UK 2018-12-03 /pmc/articles/PMC6277498/ /pubmed/30510242 http://dx.doi.org/10.1038/s41598-018-35682-z Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Wang, Chen
Liang, Chun
MSIpred: a python package for tumor microsatellite instability classification from tumor mutation annotation data using a support vector machine
title MSIpred: a python package for tumor microsatellite instability classification from tumor mutation annotation data using a support vector machine
title_full MSIpred: a python package for tumor microsatellite instability classification from tumor mutation annotation data using a support vector machine
title_fullStr MSIpred: a python package for tumor microsatellite instability classification from tumor mutation annotation data using a support vector machine
title_full_unstemmed MSIpred: a python package for tumor microsatellite instability classification from tumor mutation annotation data using a support vector machine
title_short MSIpred: a python package for tumor microsatellite instability classification from tumor mutation annotation data using a support vector machine
title_sort msipred: a python package for tumor microsatellite instability classification from tumor mutation annotation data using a support vector machine
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6277498/
https://www.ncbi.nlm.nih.gov/pubmed/30510242
http://dx.doi.org/10.1038/s41598-018-35682-z
work_keys_str_mv AT wangchen msipredapythonpackagefortumormicrosatelliteinstabilityclassificationfromtumormutationannotationdatausingasupportvectormachine
AT liangchun msipredapythonpackagefortumormicrosatelliteinstabilityclassificationfromtumormutationannotationdatausingasupportvectormachine