Cargando…

Exploring cancer register data to find risk factors for recurrence of breast cancer – application of Canonical Correlation Analysis

BACKGROUND: A common approach in exploring register data is to find relationships between outcomes and predictors by using multiple regression analysis (MRA). If there is more than one outcome variable, the analysis must then be repeated, and the results combined in some arbitrary fashion. In contra...

Descripción completa

Detalles Bibliográficos
Autores principales: Razavi, Amir R, Gill, Hans, Stål, Olle, Sundquist, Marie, Thorstenson, Sten, Åhlfeldt, Hans, Shahsavar, Nosrat
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1208892/
https://www.ncbi.nlm.nih.gov/pubmed/16111503
http://dx.doi.org/10.1186/1472-6947-5-29
_version_ 1782124926229020672
author Razavi, Amir R
Gill, Hans
Stål, Olle
Sundquist, Marie
Thorstenson, Sten
Åhlfeldt, Hans
Shahsavar, Nosrat
author_facet Razavi, Amir R
Gill, Hans
Stål, Olle
Sundquist, Marie
Thorstenson, Sten
Åhlfeldt, Hans
Shahsavar, Nosrat
author_sort Razavi, Amir R
collection PubMed
description BACKGROUND: A common approach in exploring register data is to find relationships between outcomes and predictors by using multiple regression analysis (MRA). If there is more than one outcome variable, the analysis must then be repeated, and the results combined in some arbitrary fashion. In contrast, Canonical Correlation Analysis (CCA) has the ability to analyze multiple outcomes at the same time. One essential outcome after breast cancer treatment is recurrence of the disease. It is important to understand the relationship between different predictors and recurrence, including the time interval until recurrence. This study describes the application of CCA to find important predictors for two different outcomes for breast cancer patients, loco-regional recurrence and occurrence of distant metastasis and to decrease the number of variables in the sets of predictors and outcomes without decreasing the predictive strength of the model. METHODS: Data for 637 malignant breast cancer patients admitted in the south-east region of Sweden were analyzed. By using CCA and looking at the structure coefficients (loadings), relationships between tumor specifications and the two outcomes during different time intervals were analyzed and a correlation model was built. RESULTS: The analysis successfully detected known predictors for breast cancer recurrence during the first two years and distant metastasis 2–4 years after diagnosis. Nottingham Histologic Grading (NHG) was the most important predictor, while age of the patient at the time of diagnosis was not an important predictor. CONCLUSION: In cancer registers with high dimensionality, CCA can be used for identifying the importance of risk factors for breast cancer recurrence. This technique can result in a model ready for further processing by data mining methods through reducing the number of variables to important ones.
format Text
id pubmed-1208892
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-12088922005-09-15 Exploring cancer register data to find risk factors for recurrence of breast cancer – application of Canonical Correlation Analysis Razavi, Amir R Gill, Hans Stål, Olle Sundquist, Marie Thorstenson, Sten Åhlfeldt, Hans Shahsavar, Nosrat BMC Med Inform Decis Mak Research Article BACKGROUND: A common approach in exploring register data is to find relationships between outcomes and predictors by using multiple regression analysis (MRA). If there is more than one outcome variable, the analysis must then be repeated, and the results combined in some arbitrary fashion. In contrast, Canonical Correlation Analysis (CCA) has the ability to analyze multiple outcomes at the same time. One essential outcome after breast cancer treatment is recurrence of the disease. It is important to understand the relationship between different predictors and recurrence, including the time interval until recurrence. This study describes the application of CCA to find important predictors for two different outcomes for breast cancer patients, loco-regional recurrence and occurrence of distant metastasis and to decrease the number of variables in the sets of predictors and outcomes without decreasing the predictive strength of the model. METHODS: Data for 637 malignant breast cancer patients admitted in the south-east region of Sweden were analyzed. By using CCA and looking at the structure coefficients (loadings), relationships between tumor specifications and the two outcomes during different time intervals were analyzed and a correlation model was built. RESULTS: The analysis successfully detected known predictors for breast cancer recurrence during the first two years and distant metastasis 2–4 years after diagnosis. Nottingham Histologic Grading (NHG) was the most important predictor, while age of the patient at the time of diagnosis was not an important predictor. CONCLUSION: In cancer registers with high dimensionality, CCA can be used for identifying the importance of risk factors for breast cancer recurrence. This technique can result in a model ready for further processing by data mining methods through reducing the number of variables to important ones. BioMed Central 2005-08-22 /pmc/articles/PMC1208892/ /pubmed/16111503 http://dx.doi.org/10.1186/1472-6947-5-29 Text en Copyright © 2005 Razavi et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Razavi, Amir R
Gill, Hans
Stål, Olle
Sundquist, Marie
Thorstenson, Sten
Åhlfeldt, Hans
Shahsavar, Nosrat
Exploring cancer register data to find risk factors for recurrence of breast cancer – application of Canonical Correlation Analysis
title Exploring cancer register data to find risk factors for recurrence of breast cancer – application of Canonical Correlation Analysis
title_full Exploring cancer register data to find risk factors for recurrence of breast cancer – application of Canonical Correlation Analysis
title_fullStr Exploring cancer register data to find risk factors for recurrence of breast cancer – application of Canonical Correlation Analysis
title_full_unstemmed Exploring cancer register data to find risk factors for recurrence of breast cancer – application of Canonical Correlation Analysis
title_short Exploring cancer register data to find risk factors for recurrence of breast cancer – application of Canonical Correlation Analysis
title_sort exploring cancer register data to find risk factors for recurrence of breast cancer – application of canonical correlation analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1208892/
https://www.ncbi.nlm.nih.gov/pubmed/16111503
http://dx.doi.org/10.1186/1472-6947-5-29
work_keys_str_mv AT razaviamirr exploringcancerregisterdatatofindriskfactorsforrecurrenceofbreastcancerapplicationofcanonicalcorrelationanalysis
AT gillhans exploringcancerregisterdatatofindriskfactorsforrecurrenceofbreastcancerapplicationofcanonicalcorrelationanalysis
AT stalolle exploringcancerregisterdatatofindriskfactorsforrecurrenceofbreastcancerapplicationofcanonicalcorrelationanalysis
AT sundquistmarie exploringcancerregisterdatatofindriskfactorsforrecurrenceofbreastcancerapplicationofcanonicalcorrelationanalysis
AT thorstensonsten exploringcancerregisterdatatofindriskfactorsforrecurrenceofbreastcancerapplicationofcanonicalcorrelationanalysis
AT ahlfeldthans exploringcancerregisterdatatofindriskfactorsforrecurrenceofbreastcancerapplicationofcanonicalcorrelationanalysis
AT shahsavarnosrat exploringcancerregisterdatatofindriskfactorsforrecurrenceofbreastcancerapplicationofcanonicalcorrelationanalysis
AT exploringcancerregisterdatatofindriskfactorsforrecurrenceofbreastcancerapplicationofcanonicalcorrelationanalysis