Cargando…

Collectives of diagnostic biomarkers identify high-risk subpopulations of hematuria patients: exploiting heterogeneity in large-scale biomarker data

BACKGROUND: Ineffective risk stratification can delay diagnosis of serious disease in patients with hematuria. We applied a systems biology approach to analyze clinical, demographic and biomarker measurements (n = 29) collected from 157 hematuric patients: 80 urothelial cancer (UC) and 77 controls w...

Descripción completa

Detalles Bibliográficos
Autores principales: Emmert-Streib, Frank, Abogunrin, Funso, de Matos Simoes, Ricardo, Duggan, Brian, Ruddock, Mark W, Reid, Cherith N, Roddy, Owen, White, Lisa, O'Kane, Hugh F, O'Rourke, Declan, Anderson, Neil H, Nambirajan, Thiagarajan, Williamson, Kate E
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3570289/
https://www.ncbi.nlm.nih.gov/pubmed/23327460
http://dx.doi.org/10.1186/1741-7015-11-12
_version_ 1782259039269289984
author Emmert-Streib, Frank
Abogunrin, Funso
de Matos Simoes, Ricardo
Duggan, Brian
Ruddock, Mark W
Reid, Cherith N
Roddy, Owen
White, Lisa
O'Kane, Hugh F
O'Rourke, Declan
Anderson, Neil H
Nambirajan, Thiagarajan
Williamson, Kate E
author_facet Emmert-Streib, Frank
Abogunrin, Funso
de Matos Simoes, Ricardo
Duggan, Brian
Ruddock, Mark W
Reid, Cherith N
Roddy, Owen
White, Lisa
O'Kane, Hugh F
O'Rourke, Declan
Anderson, Neil H
Nambirajan, Thiagarajan
Williamson, Kate E
author_sort Emmert-Streib, Frank
collection PubMed
description BACKGROUND: Ineffective risk stratification can delay diagnosis of serious disease in patients with hematuria. We applied a systems biology approach to analyze clinical, demographic and biomarker measurements (n = 29) collected from 157 hematuric patients: 80 urothelial cancer (UC) and 77 controls with confounding pathologies. METHODS: On the basis of biomarkers, we conducted agglomerative hierarchical clustering to identify patient and biomarker clusters. We then explored the relationship between the patient clusters and clinical characteristics using Chi-square analyses. We determined classification errors and areas under the receiver operating curve of Random Forest Classifiers (RFC) for patient subpopulations using the biomarker clusters to reduce the dimensionality of the data. RESULTS: Agglomerative clustering identified five patient clusters and seven biomarker clusters. Final diagnoses categories were non-randomly distributed across the five patient clusters. In addition, two of the patient clusters were enriched with patients with 'low cancer-risk' characteristics. The biomarkers which contributed to the diagnostic classifiers for these two patient clusters were similar. In contrast, three of the patient clusters were significantly enriched with patients harboring 'high cancer-risk" characteristics including proteinuria, aggressive pathological stage and grade, and malignant cytology. Patients in these three clusters included controls, that is, patients with other serious disease and patients with cancers other than UC. Biomarkers which contributed to the diagnostic classifiers for the largest 'high cancer- risk' cluster were different than those contributing to the classifiers for the 'low cancer-risk' clusters. Biomarkers which contributed to subpopulations that were split according to smoking status, gender and medication were different. CONCLUSIONS: The systems biology approach applied in this study allowed the hematuric patients to cluster naturally on the basis of the heterogeneity within their biomarker data, into five distinct risk subpopulations. Our findings highlight an approach with the promise to unlock the potential of biomarkers. This will be especially valuable in the field of diagnostic bladder cancer where biomarkers are urgently required. Clinicians could interpret risk classification scores in the context of clinical parameters at the time of triage. This could reduce cystoscopies and enable priority diagnosis of aggressive diseases, leading to improved patient outcomes at reduced costs.
format Online
Article
Text
id pubmed-3570289
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35702892013-02-15 Collectives of diagnostic biomarkers identify high-risk subpopulations of hematuria patients: exploiting heterogeneity in large-scale biomarker data Emmert-Streib, Frank Abogunrin, Funso de Matos Simoes, Ricardo Duggan, Brian Ruddock, Mark W Reid, Cherith N Roddy, Owen White, Lisa O'Kane, Hugh F O'Rourke, Declan Anderson, Neil H Nambirajan, Thiagarajan Williamson, Kate E BMC Med Research Article BACKGROUND: Ineffective risk stratification can delay diagnosis of serious disease in patients with hematuria. We applied a systems biology approach to analyze clinical, demographic and biomarker measurements (n = 29) collected from 157 hematuric patients: 80 urothelial cancer (UC) and 77 controls with confounding pathologies. METHODS: On the basis of biomarkers, we conducted agglomerative hierarchical clustering to identify patient and biomarker clusters. We then explored the relationship between the patient clusters and clinical characteristics using Chi-square analyses. We determined classification errors and areas under the receiver operating curve of Random Forest Classifiers (RFC) for patient subpopulations using the biomarker clusters to reduce the dimensionality of the data. RESULTS: Agglomerative clustering identified five patient clusters and seven biomarker clusters. Final diagnoses categories were non-randomly distributed across the five patient clusters. In addition, two of the patient clusters were enriched with patients with 'low cancer-risk' characteristics. The biomarkers which contributed to the diagnostic classifiers for these two patient clusters were similar. In contrast, three of the patient clusters were significantly enriched with patients harboring 'high cancer-risk" characteristics including proteinuria, aggressive pathological stage and grade, and malignant cytology. Patients in these three clusters included controls, that is, patients with other serious disease and patients with cancers other than UC. Biomarkers which contributed to the diagnostic classifiers for the largest 'high cancer- risk' cluster were different than those contributing to the classifiers for the 'low cancer-risk' clusters. Biomarkers which contributed to subpopulations that were split according to smoking status, gender and medication were different. CONCLUSIONS: The systems biology approach applied in this study allowed the hematuric patients to cluster naturally on the basis of the heterogeneity within their biomarker data, into five distinct risk subpopulations. Our findings highlight an approach with the promise to unlock the potential of biomarkers. This will be especially valuable in the field of diagnostic bladder cancer where biomarkers are urgently required. Clinicians could interpret risk classification scores in the context of clinical parameters at the time of triage. This could reduce cystoscopies and enable priority diagnosis of aggressive diseases, leading to improved patient outcomes at reduced costs. BioMed Central 2013-01-17 /pmc/articles/PMC3570289/ /pubmed/23327460 http://dx.doi.org/10.1186/1741-7015-11-12 Text en Copyright ©2013 Emmert-Streib et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Emmert-Streib, Frank
Abogunrin, Funso
de Matos Simoes, Ricardo
Duggan, Brian
Ruddock, Mark W
Reid, Cherith N
Roddy, Owen
White, Lisa
O'Kane, Hugh F
O'Rourke, Declan
Anderson, Neil H
Nambirajan, Thiagarajan
Williamson, Kate E
Collectives of diagnostic biomarkers identify high-risk subpopulations of hematuria patients: exploiting heterogeneity in large-scale biomarker data
title Collectives of diagnostic biomarkers identify high-risk subpopulations of hematuria patients: exploiting heterogeneity in large-scale biomarker data
title_full Collectives of diagnostic biomarkers identify high-risk subpopulations of hematuria patients: exploiting heterogeneity in large-scale biomarker data
title_fullStr Collectives of diagnostic biomarkers identify high-risk subpopulations of hematuria patients: exploiting heterogeneity in large-scale biomarker data
title_full_unstemmed Collectives of diagnostic biomarkers identify high-risk subpopulations of hematuria patients: exploiting heterogeneity in large-scale biomarker data
title_short Collectives of diagnostic biomarkers identify high-risk subpopulations of hematuria patients: exploiting heterogeneity in large-scale biomarker data
title_sort collectives of diagnostic biomarkers identify high-risk subpopulations of hematuria patients: exploiting heterogeneity in large-scale biomarker data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3570289/
https://www.ncbi.nlm.nih.gov/pubmed/23327460
http://dx.doi.org/10.1186/1741-7015-11-12
work_keys_str_mv AT emmertstreibfrank collectivesofdiagnosticbiomarkersidentifyhighrisksubpopulationsofhematuriapatientsexploitingheterogeneityinlargescalebiomarkerdata
AT abogunrinfunso collectivesofdiagnosticbiomarkersidentifyhighrisksubpopulationsofhematuriapatientsexploitingheterogeneityinlargescalebiomarkerdata
AT dematossimoesricardo collectivesofdiagnosticbiomarkersidentifyhighrisksubpopulationsofhematuriapatientsexploitingheterogeneityinlargescalebiomarkerdata
AT dugganbrian collectivesofdiagnosticbiomarkersidentifyhighrisksubpopulationsofhematuriapatientsexploitingheterogeneityinlargescalebiomarkerdata
AT ruddockmarkw collectivesofdiagnosticbiomarkersidentifyhighrisksubpopulationsofhematuriapatientsexploitingheterogeneityinlargescalebiomarkerdata
AT reidcherithn collectivesofdiagnosticbiomarkersidentifyhighrisksubpopulationsofhematuriapatientsexploitingheterogeneityinlargescalebiomarkerdata
AT roddyowen collectivesofdiagnosticbiomarkersidentifyhighrisksubpopulationsofhematuriapatientsexploitingheterogeneityinlargescalebiomarkerdata
AT whitelisa collectivesofdiagnosticbiomarkersidentifyhighrisksubpopulationsofhematuriapatientsexploitingheterogeneityinlargescalebiomarkerdata
AT okanehughf collectivesofdiagnosticbiomarkersidentifyhighrisksubpopulationsofhematuriapatientsexploitingheterogeneityinlargescalebiomarkerdata
AT orourkedeclan collectivesofdiagnosticbiomarkersidentifyhighrisksubpopulationsofhematuriapatientsexploitingheterogeneityinlargescalebiomarkerdata
AT andersonneilh collectivesofdiagnosticbiomarkersidentifyhighrisksubpopulationsofhematuriapatientsexploitingheterogeneityinlargescalebiomarkerdata
AT nambirajanthiagarajan collectivesofdiagnosticbiomarkersidentifyhighrisksubpopulationsofhematuriapatientsexploitingheterogeneityinlargescalebiomarkerdata
AT williamsonkatee collectivesofdiagnosticbiomarkersidentifyhighrisksubpopulationsofhematuriapatientsexploitingheterogeneityinlargescalebiomarkerdata