Cargando…
MSIpred: a python package for tumor microsatellite instability classification from tumor mutation annotation data using a support vector machine
Microsatellite instability (MSI) is characterized by high degree of polymorphism in microsatellite lengths due to deficiency in mismatch repair (MMR) system. MSI is associated with several tumor types and its status can be considered as an important indicator for patient prognosis. Conventional clin...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6277498/ https://www.ncbi.nlm.nih.gov/pubmed/30510242 http://dx.doi.org/10.1038/s41598-018-35682-z |
_version_ | 1783378165005025280 |
---|---|
author | Wang, Chen Liang, Chun |
author_facet | Wang, Chen Liang, Chun |
author_sort | Wang, Chen |
collection | PubMed |
description | Microsatellite instability (MSI) is characterized by high degree of polymorphism in microsatellite lengths due to deficiency in mismatch repair (MMR) system. MSI is associated with several tumor types and its status can be considered as an important indicator for patient prognosis. Conventional clinical diagnosis of MSI examines PCR products of a panel of microsatellite markers using electrophoresis (MSI-PCR), which is laborious, costly, and time consuming. We developed MSIpred, a python package for automatic MSI classification using a machine learning technology – support vector machine (SVM). MSIpred computes 22 features characterizing tumor somatic mutational load from mutation data in mutation annotation format (MAF) generated from paired tumor-normal exome sequencing data, subsequently using these features to predict tumor MSI status with a SVM classifier trained by MAF data of 1074 tumors belonging to four types. Evaluation of MSIpred on an independent testing set, MAF data of another 358 tumors, achieved overall accuracy of ≥98% and area under receiver operating characteristic (ROC) curve of 0.967. Further analysis on discrepant cases revealed that discrepancies were partially due to misclassification of MSI-PCR. Additional testing of MSIpred on non-TCGA data also validated its good classification performance. These results indicated that MSIpred is a robust pan-tumor MSI classification tool and can serve as a complementary diagnostic to MSI-PCR in MSI diagnosis. |
format | Online Article Text |
id | pubmed-6277498 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-62774982018-12-06 MSIpred: a python package for tumor microsatellite instability classification from tumor mutation annotation data using a support vector machine Wang, Chen Liang, Chun Sci Rep Article Microsatellite instability (MSI) is characterized by high degree of polymorphism in microsatellite lengths due to deficiency in mismatch repair (MMR) system. MSI is associated with several tumor types and its status can be considered as an important indicator for patient prognosis. Conventional clinical diagnosis of MSI examines PCR products of a panel of microsatellite markers using electrophoresis (MSI-PCR), which is laborious, costly, and time consuming. We developed MSIpred, a python package for automatic MSI classification using a machine learning technology – support vector machine (SVM). MSIpred computes 22 features characterizing tumor somatic mutational load from mutation data in mutation annotation format (MAF) generated from paired tumor-normal exome sequencing data, subsequently using these features to predict tumor MSI status with a SVM classifier trained by MAF data of 1074 tumors belonging to four types. Evaluation of MSIpred on an independent testing set, MAF data of another 358 tumors, achieved overall accuracy of ≥98% and area under receiver operating characteristic (ROC) curve of 0.967. Further analysis on discrepant cases revealed that discrepancies were partially due to misclassification of MSI-PCR. Additional testing of MSIpred on non-TCGA data also validated its good classification performance. These results indicated that MSIpred is a robust pan-tumor MSI classification tool and can serve as a complementary diagnostic to MSI-PCR in MSI diagnosis. Nature Publishing Group UK 2018-12-03 /pmc/articles/PMC6277498/ /pubmed/30510242 http://dx.doi.org/10.1038/s41598-018-35682-z Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Wang, Chen Liang, Chun MSIpred: a python package for tumor microsatellite instability classification from tumor mutation annotation data using a support vector machine |
title | MSIpred: a python package for tumor microsatellite instability classification from tumor mutation annotation data using a support vector machine |
title_full | MSIpred: a python package for tumor microsatellite instability classification from tumor mutation annotation data using a support vector machine |
title_fullStr | MSIpred: a python package for tumor microsatellite instability classification from tumor mutation annotation data using a support vector machine |
title_full_unstemmed | MSIpred: a python package for tumor microsatellite instability classification from tumor mutation annotation data using a support vector machine |
title_short | MSIpred: a python package for tumor microsatellite instability classification from tumor mutation annotation data using a support vector machine |
title_sort | msipred: a python package for tumor microsatellite instability classification from tumor mutation annotation data using a support vector machine |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6277498/ https://www.ncbi.nlm.nih.gov/pubmed/30510242 http://dx.doi.org/10.1038/s41598-018-35682-z |
work_keys_str_mv | AT wangchen msipredapythonpackagefortumormicrosatelliteinstabilityclassificationfromtumormutationannotationdatausingasupportvectormachine AT liangchun msipredapythonpackagefortumormicrosatelliteinstabilityclassificationfromtumormutationannotationdatausingasupportvectormachine |