Cargando…

Machine learning models-based on integration of next-generation sequencing testing and tumor cell sizes improve subtype classification of mature B-cell neoplasms

BACKGROUND: Next-generation sequencing (NGS) panels for mature B-cell neoplasms (MBNs) are widely applied clinically but have yet to be routinely used in a manner that is suitable for subtype differential diagnosis. This study retrospectively investigated newly diagnosed cases of MBNs from our labor...

Descripción completa

Detalles Bibliográficos
Autores principales: Mu, Yafei, Chen, Yuxin, Meng, Yuhuan, Chen, Tao, Fan, Xijie, Yuan, Jiecheng, Lin, Junwei, Pan, Jianhua, Li, Guibin, Feng, Jinghua, Diao, Kaiyuan, Li, Yinghua, Yu, Shihui, Liu, Lingling
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10436202/
https://www.ncbi.nlm.nih.gov/pubmed/37601650
http://dx.doi.org/10.3389/fonc.2023.1160383
_version_ 1785092269614825472
author Mu, Yafei
Chen, Yuxin
Meng, Yuhuan
Chen, Tao
Fan, Xijie
Yuan, Jiecheng
Lin, Junwei
Pan, Jianhua
Li, Guibin
Feng, Jinghua
Diao, Kaiyuan
Li, Yinghua
Yu, Shihui
Liu, Lingling
author_facet Mu, Yafei
Chen, Yuxin
Meng, Yuhuan
Chen, Tao
Fan, Xijie
Yuan, Jiecheng
Lin, Junwei
Pan, Jianhua
Li, Guibin
Feng, Jinghua
Diao, Kaiyuan
Li, Yinghua
Yu, Shihui
Liu, Lingling
author_sort Mu, Yafei
collection PubMed
description BACKGROUND: Next-generation sequencing (NGS) panels for mature B-cell neoplasms (MBNs) are widely applied clinically but have yet to be routinely used in a manner that is suitable for subtype differential diagnosis. This study retrospectively investigated newly diagnosed cases of MBNs from our laboratory to investigate mutation landscapes in Chinese patients with MBNs and to combine mutational information and machine learning (ML) into clinical applications for MBNs, especially for subtype classification. METHODS: Samples from the Catalogue Of Somatic Mutations In Cancer (COSMIC) database were collected for ML model construction and cases from our laboratory were used for ML model validation. Five repeats of 10-fold cross-validation Random Forest algorithm was used for ML model construction. Mutation detection was performed by NGS and tumor cell size was confirmed by cell morphology and/or flow cytometry in our laboratory. RESULTS: Totally 849 newly diagnosed MBN cases from our laboratory were retrospectively identified and included in mutational landscape analyses. Patterns of gene mutations in a variety of MBN subtypes were found, important to investigate tumorigenesis in MBNs. A long list of novel mutations was revealed, valuable to both functional studies and clinical applications. By combining gene mutation information revealed by NGS and ML, we established ML models that provide valuable information for MBN subtype classification. In total, 8895 cases of 8 subtypes of MBNs in the COSMIC database were collected and utilized for ML model construction, and the models were validated on the 849 MBN cases from our laboratory. A series of ML models was constructed in this study, and the most efficient model, with an accuracy of 0.87, was based on integration of NGS testing and tumor cell sizes. CONCLUSIONS: The ML models were of great significance in the differential diagnosis of all cases and different MBN subtypes. Additionally, using NGS results to assist in subtype classification of MBNs by method of ML has positive clinical potential.
format Online
Article
Text
id pubmed-10436202
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-104362022023-08-19 Machine learning models-based on integration of next-generation sequencing testing and tumor cell sizes improve subtype classification of mature B-cell neoplasms Mu, Yafei Chen, Yuxin Meng, Yuhuan Chen, Tao Fan, Xijie Yuan, Jiecheng Lin, Junwei Pan, Jianhua Li, Guibin Feng, Jinghua Diao, Kaiyuan Li, Yinghua Yu, Shihui Liu, Lingling Front Oncol Oncology BACKGROUND: Next-generation sequencing (NGS) panels for mature B-cell neoplasms (MBNs) are widely applied clinically but have yet to be routinely used in a manner that is suitable for subtype differential diagnosis. This study retrospectively investigated newly diagnosed cases of MBNs from our laboratory to investigate mutation landscapes in Chinese patients with MBNs and to combine mutational information and machine learning (ML) into clinical applications for MBNs, especially for subtype classification. METHODS: Samples from the Catalogue Of Somatic Mutations In Cancer (COSMIC) database were collected for ML model construction and cases from our laboratory were used for ML model validation. Five repeats of 10-fold cross-validation Random Forest algorithm was used for ML model construction. Mutation detection was performed by NGS and tumor cell size was confirmed by cell morphology and/or flow cytometry in our laboratory. RESULTS: Totally 849 newly diagnosed MBN cases from our laboratory were retrospectively identified and included in mutational landscape analyses. Patterns of gene mutations in a variety of MBN subtypes were found, important to investigate tumorigenesis in MBNs. A long list of novel mutations was revealed, valuable to both functional studies and clinical applications. By combining gene mutation information revealed by NGS and ML, we established ML models that provide valuable information for MBN subtype classification. In total, 8895 cases of 8 subtypes of MBNs in the COSMIC database were collected and utilized for ML model construction, and the models were validated on the 849 MBN cases from our laboratory. A series of ML models was constructed in this study, and the most efficient model, with an accuracy of 0.87, was based on integration of NGS testing and tumor cell sizes. CONCLUSIONS: The ML models were of great significance in the differential diagnosis of all cases and different MBN subtypes. Additionally, using NGS results to assist in subtype classification of MBNs by method of ML has positive clinical potential. Frontiers Media S.A. 2023-08-03 /pmc/articles/PMC10436202/ /pubmed/37601650 http://dx.doi.org/10.3389/fonc.2023.1160383 Text en Copyright © 2023 Mu, Chen, Meng, Chen, Fan, Yuan, Lin, Pan, Li, Feng, Diao, Li, Yu and Liu https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Oncology
Mu, Yafei
Chen, Yuxin
Meng, Yuhuan
Chen, Tao
Fan, Xijie
Yuan, Jiecheng
Lin, Junwei
Pan, Jianhua
Li, Guibin
Feng, Jinghua
Diao, Kaiyuan
Li, Yinghua
Yu, Shihui
Liu, Lingling
Machine learning models-based on integration of next-generation sequencing testing and tumor cell sizes improve subtype classification of mature B-cell neoplasms
title Machine learning models-based on integration of next-generation sequencing testing and tumor cell sizes improve subtype classification of mature B-cell neoplasms
title_full Machine learning models-based on integration of next-generation sequencing testing and tumor cell sizes improve subtype classification of mature B-cell neoplasms
title_fullStr Machine learning models-based on integration of next-generation sequencing testing and tumor cell sizes improve subtype classification of mature B-cell neoplasms
title_full_unstemmed Machine learning models-based on integration of next-generation sequencing testing and tumor cell sizes improve subtype classification of mature B-cell neoplasms
title_short Machine learning models-based on integration of next-generation sequencing testing and tumor cell sizes improve subtype classification of mature B-cell neoplasms
title_sort machine learning models-based on integration of next-generation sequencing testing and tumor cell sizes improve subtype classification of mature b-cell neoplasms
topic Oncology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10436202/
https://www.ncbi.nlm.nih.gov/pubmed/37601650
http://dx.doi.org/10.3389/fonc.2023.1160383
work_keys_str_mv AT muyafei machinelearningmodelsbasedonintegrationofnextgenerationsequencingtestingandtumorcellsizesimprovesubtypeclassificationofmaturebcellneoplasms
AT chenyuxin machinelearningmodelsbasedonintegrationofnextgenerationsequencingtestingandtumorcellsizesimprovesubtypeclassificationofmaturebcellneoplasms
AT mengyuhuan machinelearningmodelsbasedonintegrationofnextgenerationsequencingtestingandtumorcellsizesimprovesubtypeclassificationofmaturebcellneoplasms
AT chentao machinelearningmodelsbasedonintegrationofnextgenerationsequencingtestingandtumorcellsizesimprovesubtypeclassificationofmaturebcellneoplasms
AT fanxijie machinelearningmodelsbasedonintegrationofnextgenerationsequencingtestingandtumorcellsizesimprovesubtypeclassificationofmaturebcellneoplasms
AT yuanjiecheng machinelearningmodelsbasedonintegrationofnextgenerationsequencingtestingandtumorcellsizesimprovesubtypeclassificationofmaturebcellneoplasms
AT linjunwei machinelearningmodelsbasedonintegrationofnextgenerationsequencingtestingandtumorcellsizesimprovesubtypeclassificationofmaturebcellneoplasms
AT panjianhua machinelearningmodelsbasedonintegrationofnextgenerationsequencingtestingandtumorcellsizesimprovesubtypeclassificationofmaturebcellneoplasms
AT liguibin machinelearningmodelsbasedonintegrationofnextgenerationsequencingtestingandtumorcellsizesimprovesubtypeclassificationofmaturebcellneoplasms
AT fengjinghua machinelearningmodelsbasedonintegrationofnextgenerationsequencingtestingandtumorcellsizesimprovesubtypeclassificationofmaturebcellneoplasms
AT diaokaiyuan machinelearningmodelsbasedonintegrationofnextgenerationsequencingtestingandtumorcellsizesimprovesubtypeclassificationofmaturebcellneoplasms
AT liyinghua machinelearningmodelsbasedonintegrationofnextgenerationsequencingtestingandtumorcellsizesimprovesubtypeclassificationofmaturebcellneoplasms
AT yushihui machinelearningmodelsbasedonintegrationofnextgenerationsequencingtestingandtumorcellsizesimprovesubtypeclassificationofmaturebcellneoplasms
AT liulingling machinelearningmodelsbasedonintegrationofnextgenerationsequencingtestingandtumorcellsizesimprovesubtypeclassificationofmaturebcellneoplasms