Cargando…

Application of Random Forests Methods to Diabetic Retinopathy Classification Analyses

BACKGROUND: Diabetic retinopathy (DR) is one of the leading causes of blindness in the United States and world-wide. DR is a silent disease that may go unnoticed until it is too late for effective treatment. Therefore, early detection could improve the chances of therapeutic interventions that would...

Descripción completa

Detalles Bibliográficos
Autores principales: Casanova, Ramon, Saldana, Santiago, Chew, Emily Y., Danis, Ronald P., Greven, Craig M., Ambrosius, Walter T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4062420/
https://www.ncbi.nlm.nih.gov/pubmed/24940623
http://dx.doi.org/10.1371/journal.pone.0098587
_version_ 1782321645217644544
author Casanova, Ramon
Saldana, Santiago
Chew, Emily Y.
Danis, Ronald P.
Greven, Craig M.
Ambrosius, Walter T.
author_facet Casanova, Ramon
Saldana, Santiago
Chew, Emily Y.
Danis, Ronald P.
Greven, Craig M.
Ambrosius, Walter T.
author_sort Casanova, Ramon
collection PubMed
description BACKGROUND: Diabetic retinopathy (DR) is one of the leading causes of blindness in the United States and world-wide. DR is a silent disease that may go unnoticed until it is too late for effective treatment. Therefore, early detection could improve the chances of therapeutic interventions that would alleviate its effects. METHODOLOGY: Graded fundus photography and systemic data from 3443 ACCORD-Eye Study participants were used to estimate Random Forest (RF) and logistic regression classifiers. We studied the impact of sample size on classifier performance and the possibility of using RF generated class conditional probabilities as metrics describing DR risk. RF measures of variable importance are used to detect factors that affect classification performance. PRINCIPAL FINDINGS: Both types of data were informative when discriminating participants with or without DR. RF based models produced much higher classification accuracy than those based on logistic regression. Combining both types of data did not increase accuracy but did increase statistical discrimination of healthy participants who subsequently did or did not have DR events during four years of follow-up. RF variable importance criteria revealed that microaneurysms counts in both eyes seemed to play the most important role in discrimination among the graded fundus variables, while the number of medicines and diabetes duration were the most relevant among the systemic variables. CONCLUSIONS AND SIGNIFICANCE: We have introduced RF methods to DR classification analyses based on fundus photography data. In addition, we propose an approach to DR risk assessment based on metrics derived from graded fundus photography and systemic data. Our results suggest that RF methods could be a valuable tool to diagnose DR diagnosis and evaluate its progression.
format Online
Article
Text
id pubmed-4062420
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-40624202014-06-24 Application of Random Forests Methods to Diabetic Retinopathy Classification Analyses Casanova, Ramon Saldana, Santiago Chew, Emily Y. Danis, Ronald P. Greven, Craig M. Ambrosius, Walter T. PLoS One Research Article BACKGROUND: Diabetic retinopathy (DR) is one of the leading causes of blindness in the United States and world-wide. DR is a silent disease that may go unnoticed until it is too late for effective treatment. Therefore, early detection could improve the chances of therapeutic interventions that would alleviate its effects. METHODOLOGY: Graded fundus photography and systemic data from 3443 ACCORD-Eye Study participants were used to estimate Random Forest (RF) and logistic regression classifiers. We studied the impact of sample size on classifier performance and the possibility of using RF generated class conditional probabilities as metrics describing DR risk. RF measures of variable importance are used to detect factors that affect classification performance. PRINCIPAL FINDINGS: Both types of data were informative when discriminating participants with or without DR. RF based models produced much higher classification accuracy than those based on logistic regression. Combining both types of data did not increase accuracy but did increase statistical discrimination of healthy participants who subsequently did or did not have DR events during four years of follow-up. RF variable importance criteria revealed that microaneurysms counts in both eyes seemed to play the most important role in discrimination among the graded fundus variables, while the number of medicines and diabetes duration were the most relevant among the systemic variables. CONCLUSIONS AND SIGNIFICANCE: We have introduced RF methods to DR classification analyses based on fundus photography data. In addition, we propose an approach to DR risk assessment based on metrics derived from graded fundus photography and systemic data. Our results suggest that RF methods could be a valuable tool to diagnose DR diagnosis and evaluate its progression. Public Library of Science 2014-06-18 /pmc/articles/PMC4062420/ /pubmed/24940623 http://dx.doi.org/10.1371/journal.pone.0098587 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration, which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose.
spellingShingle Research Article
Casanova, Ramon
Saldana, Santiago
Chew, Emily Y.
Danis, Ronald P.
Greven, Craig M.
Ambrosius, Walter T.
Application of Random Forests Methods to Diabetic Retinopathy Classification Analyses
title Application of Random Forests Methods to Diabetic Retinopathy Classification Analyses
title_full Application of Random Forests Methods to Diabetic Retinopathy Classification Analyses
title_fullStr Application of Random Forests Methods to Diabetic Retinopathy Classification Analyses
title_full_unstemmed Application of Random Forests Methods to Diabetic Retinopathy Classification Analyses
title_short Application of Random Forests Methods to Diabetic Retinopathy Classification Analyses
title_sort application of random forests methods to diabetic retinopathy classification analyses
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4062420/
https://www.ncbi.nlm.nih.gov/pubmed/24940623
http://dx.doi.org/10.1371/journal.pone.0098587
work_keys_str_mv AT casanovaramon applicationofrandomforestsmethodstodiabeticretinopathyclassificationanalyses
AT saldanasantiago applicationofrandomforestsmethodstodiabeticretinopathyclassificationanalyses
AT chewemilyy applicationofrandomforestsmethodstodiabeticretinopathyclassificationanalyses
AT danisronaldp applicationofrandomforestsmethodstodiabeticretinopathyclassificationanalyses
AT grevencraigm applicationofrandomforestsmethodstodiabeticretinopathyclassificationanalyses
AT ambrosiuswaltert applicationofrandomforestsmethodstodiabeticretinopathyclassificationanalyses