Cargando…

Machine Learning Provides an Accurate Classification of Diffuse Large B-Cell Lymphoma from Immunohistochemical Data

BACKGROUND: The classification of diffuse large B-cell lymphomas into Germinal Center (GCB) and non-GC subtypes defines disease subgroups which are different both in terms of gene expression and prognosis. Given their clinical significance, several classification algorithms have been designed, some...

Descripción completa

Detalles Bibliográficos
Autor principal: Costa, Carlos Bruno Tavares Da
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Medknow Publications & Media Pvt Ltd 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6029013/
https://www.ncbi.nlm.nih.gov/pubmed/30034919
http://dx.doi.org/10.4103/jpi.jpi_14_18
_version_ 1783336878752137216
author Costa, Carlos Bruno Tavares Da
author_facet Costa, Carlos Bruno Tavares Da
author_sort Costa, Carlos Bruno Tavares Da
collection PubMed
description BACKGROUND: The classification of diffuse large B-cell lymphomas into Germinal Center (GCB) and non-GC subtypes defines disease subgroups which are different both in terms of gene expression and prognosis. Given their clinical significance, several classification algorithms have been designed, some by making use of widely availability immunohistochemical techniques. Despite their high concordance with gene expression profiles (GEP) and prognostic value, these algorithms were based on technical and biological assumptions that could be improved in terms of performance for classification. METHODS: In order to overcome this limitation, a new algorithm was obtained by analyzing a previously published dataset of 475 patients by using an automatic classification tree method. RESULTS: The resulting algorithm classifies correctly 91.6% of the cases when compared to GEP, displaying a Receiver-Operator Characteristic (ROC) area under the curve of 0.934. Noteworthy features of this algorithm include the capability to classify GEP-unclassifiable cases and a significant prognostic value, both in terms of overall survival (60 months for non-GC vs not reached for GCB, P = 0.007) and progression-free survival (61.9 months vs not reached, P = 0.017). CONCLUSION: By using a machine learning classification method that avoids most pre-assumptions, the novel algorithm obtained is accurate and maintains relevant features for clinical implementation.
format Online
Article
Text
id pubmed-6029013
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Medknow Publications & Media Pvt Ltd
record_format MEDLINE/PubMed
spelling pubmed-60290132018-07-20 Machine Learning Provides an Accurate Classification of Diffuse Large B-Cell Lymphoma from Immunohistochemical Data Costa, Carlos Bruno Tavares Da J Pathol Inform Brief Report BACKGROUND: The classification of diffuse large B-cell lymphomas into Germinal Center (GCB) and non-GC subtypes defines disease subgroups which are different both in terms of gene expression and prognosis. Given their clinical significance, several classification algorithms have been designed, some by making use of widely availability immunohistochemical techniques. Despite their high concordance with gene expression profiles (GEP) and prognostic value, these algorithms were based on technical and biological assumptions that could be improved in terms of performance for classification. METHODS: In order to overcome this limitation, a new algorithm was obtained by analyzing a previously published dataset of 475 patients by using an automatic classification tree method. RESULTS: The resulting algorithm classifies correctly 91.6% of the cases when compared to GEP, displaying a Receiver-Operator Characteristic (ROC) area under the curve of 0.934. Noteworthy features of this algorithm include the capability to classify GEP-unclassifiable cases and a significant prognostic value, both in terms of overall survival (60 months for non-GC vs not reached for GCB, P = 0.007) and progression-free survival (61.9 months vs not reached, P = 0.017). CONCLUSION: By using a machine learning classification method that avoids most pre-assumptions, the novel algorithm obtained is accurate and maintains relevant features for clinical implementation. Medknow Publications & Media Pvt Ltd 2018-06-13 /pmc/articles/PMC6029013/ /pubmed/30034919 http://dx.doi.org/10.4103/jpi.jpi_14_18 Text en Copyright: © 2018 Journal of Pathology Informatics http://creativecommons.org/licenses/by-nc-sa/4.0 This is an open access journal, and articles are distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License, which allows others to remix, tweak, and build upon the work non-commercially, as long as appropriate credit is given and the new creations are licensed under the identical terms.
spellingShingle Brief Report
Costa, Carlos Bruno Tavares Da
Machine Learning Provides an Accurate Classification of Diffuse Large B-Cell Lymphoma from Immunohistochemical Data
title Machine Learning Provides an Accurate Classification of Diffuse Large B-Cell Lymphoma from Immunohistochemical Data
title_full Machine Learning Provides an Accurate Classification of Diffuse Large B-Cell Lymphoma from Immunohistochemical Data
title_fullStr Machine Learning Provides an Accurate Classification of Diffuse Large B-Cell Lymphoma from Immunohistochemical Data
title_full_unstemmed Machine Learning Provides an Accurate Classification of Diffuse Large B-Cell Lymphoma from Immunohistochemical Data
title_short Machine Learning Provides an Accurate Classification of Diffuse Large B-Cell Lymphoma from Immunohistochemical Data
title_sort machine learning provides an accurate classification of diffuse large b-cell lymphoma from immunohistochemical data
topic Brief Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6029013/
https://www.ncbi.nlm.nih.gov/pubmed/30034919
http://dx.doi.org/10.4103/jpi.jpi_14_18
work_keys_str_mv AT costacarlosbrunotavaresda machinelearningprovidesanaccurateclassificationofdiffuselargebcelllymphomafromimmunohistochemicaldata