Cargando…

Discriminatory Gleason grade group signatures of prostate cancer: An application of machine learning methods

One of the most precise methods to detect prostate cancer is by evaluation of a stained biopsy by a pathologist under a microscope. Regions of the tissue are assessed and graded according to the observed histological pattern. However, this is not only laborious, but also relies on the experience of...

Descripción completa

Detalles Bibliográficos
Autores principales: Mokoatle, Mpho, Mapiye, Darlington, Marivate, Vukosi, Hayes, Vanessa M., Bornman, Riana
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9182297/
https://www.ncbi.nlm.nih.gov/pubmed/35679280
http://dx.doi.org/10.1371/journal.pone.0267714
_version_ 1784724000797097984
author Mokoatle, Mpho
Mapiye, Darlington
Marivate, Vukosi
Hayes, Vanessa M.
Bornman, Riana
author_facet Mokoatle, Mpho
Mapiye, Darlington
Marivate, Vukosi
Hayes, Vanessa M.
Bornman, Riana
author_sort Mokoatle, Mpho
collection PubMed
description One of the most precise methods to detect prostate cancer is by evaluation of a stained biopsy by a pathologist under a microscope. Regions of the tissue are assessed and graded according to the observed histological pattern. However, this is not only laborious, but also relies on the experience of the pathologist and tends to suffer from the lack of reproducibility of biopsy outcomes across pathologists. As a result, computational approaches are being sought and machine learning has been gaining momentum in the prediction of the Gleason grade group. To date, machine learning literature has addressed this problem by using features from magnetic resonance imaging images, whole slide images, tissue microarrays, gene expression data, and clinical features. However, there is a gap with regards to predicting the Gleason grade group using DNA sequences as the only input source to the machine learning models. In this work, using whole genome sequence data from South African prostate cancer patients, an application of machine learning and biological experiments were combined to understand the challenges that are associated with the prediction of the Gleason grade group. A series of machine learning binary classifiers (XGBoost, LSTM, GRU, LR, RF) were created only relying on DNA sequences input features. All the models were not able to adequately discriminate between the DNA sequences of the studied Gleason grade groups (Gleason grade group 1 and 5). However, the models were further evaluated in the prediction of tumor DNA sequences from matched-normal DNA sequences, given DNA sequences as the only input source. In this new problem, the models performed acceptably better than before with the XGBoost model achieving the highest accuracy of 74 ± 01, F1 score of 79 ± 01, recall of 99 ± 0.0, and precision of 66 ± 0.1.
format Online
Article
Text
id pubmed-9182297
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-91822972022-06-10 Discriminatory Gleason grade group signatures of prostate cancer: An application of machine learning methods Mokoatle, Mpho Mapiye, Darlington Marivate, Vukosi Hayes, Vanessa M. Bornman, Riana PLoS One Research Article One of the most precise methods to detect prostate cancer is by evaluation of a stained biopsy by a pathologist under a microscope. Regions of the tissue are assessed and graded according to the observed histological pattern. However, this is not only laborious, but also relies on the experience of the pathologist and tends to suffer from the lack of reproducibility of biopsy outcomes across pathologists. As a result, computational approaches are being sought and machine learning has been gaining momentum in the prediction of the Gleason grade group. To date, machine learning literature has addressed this problem by using features from magnetic resonance imaging images, whole slide images, tissue microarrays, gene expression data, and clinical features. However, there is a gap with regards to predicting the Gleason grade group using DNA sequences as the only input source to the machine learning models. In this work, using whole genome sequence data from South African prostate cancer patients, an application of machine learning and biological experiments were combined to understand the challenges that are associated with the prediction of the Gleason grade group. A series of machine learning binary classifiers (XGBoost, LSTM, GRU, LR, RF) were created only relying on DNA sequences input features. All the models were not able to adequately discriminate between the DNA sequences of the studied Gleason grade groups (Gleason grade group 1 and 5). However, the models were further evaluated in the prediction of tumor DNA sequences from matched-normal DNA sequences, given DNA sequences as the only input source. In this new problem, the models performed acceptably better than before with the XGBoost model achieving the highest accuracy of 74 ± 01, F1 score of 79 ± 01, recall of 99 ± 0.0, and precision of 66 ± 0.1. Public Library of Science 2022-06-09 /pmc/articles/PMC9182297/ /pubmed/35679280 http://dx.doi.org/10.1371/journal.pone.0267714 Text en © 2022 Mokoatle et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Mokoatle, Mpho
Mapiye, Darlington
Marivate, Vukosi
Hayes, Vanessa M.
Bornman, Riana
Discriminatory Gleason grade group signatures of prostate cancer: An application of machine learning methods
title Discriminatory Gleason grade group signatures of prostate cancer: An application of machine learning methods
title_full Discriminatory Gleason grade group signatures of prostate cancer: An application of machine learning methods
title_fullStr Discriminatory Gleason grade group signatures of prostate cancer: An application of machine learning methods
title_full_unstemmed Discriminatory Gleason grade group signatures of prostate cancer: An application of machine learning methods
title_short Discriminatory Gleason grade group signatures of prostate cancer: An application of machine learning methods
title_sort discriminatory gleason grade group signatures of prostate cancer: an application of machine learning methods
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9182297/
https://www.ncbi.nlm.nih.gov/pubmed/35679280
http://dx.doi.org/10.1371/journal.pone.0267714
work_keys_str_mv AT mokoatlempho discriminatorygleasongradegroupsignaturesofprostatecanceranapplicationofmachinelearningmethods
AT mapiyedarlington discriminatorygleasongradegroupsignaturesofprostatecanceranapplicationofmachinelearningmethods
AT marivatevukosi discriminatorygleasongradegroupsignaturesofprostatecanceranapplicationofmachinelearningmethods
AT hayesvanessam discriminatorygleasongradegroupsignaturesofprostatecanceranapplicationofmachinelearningmethods
AT bornmanriana discriminatorygleasongradegroupsignaturesofprostatecanceranapplicationofmachinelearningmethods