Cargando…
Exploring cancer register data to find risk factors for recurrence of breast cancer – application of Canonical Correlation Analysis
BACKGROUND: A common approach in exploring register data is to find relationships between outcomes and predictors by using multiple regression analysis (MRA). If there is more than one outcome variable, the analysis must then be repeated, and the results combined in some arbitrary fashion. In contra...
Autores principales: | , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2005
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1208892/ https://www.ncbi.nlm.nih.gov/pubmed/16111503 http://dx.doi.org/10.1186/1472-6947-5-29 |
_version_ | 1782124926229020672 |
---|---|
author | Razavi, Amir R Gill, Hans Stål, Olle Sundquist, Marie Thorstenson, Sten Åhlfeldt, Hans Shahsavar, Nosrat |
author_facet | Razavi, Amir R Gill, Hans Stål, Olle Sundquist, Marie Thorstenson, Sten Åhlfeldt, Hans Shahsavar, Nosrat |
author_sort | Razavi, Amir R |
collection | PubMed |
description | BACKGROUND: A common approach in exploring register data is to find relationships between outcomes and predictors by using multiple regression analysis (MRA). If there is more than one outcome variable, the analysis must then be repeated, and the results combined in some arbitrary fashion. In contrast, Canonical Correlation Analysis (CCA) has the ability to analyze multiple outcomes at the same time. One essential outcome after breast cancer treatment is recurrence of the disease. It is important to understand the relationship between different predictors and recurrence, including the time interval until recurrence. This study describes the application of CCA to find important predictors for two different outcomes for breast cancer patients, loco-regional recurrence and occurrence of distant metastasis and to decrease the number of variables in the sets of predictors and outcomes without decreasing the predictive strength of the model. METHODS: Data for 637 malignant breast cancer patients admitted in the south-east region of Sweden were analyzed. By using CCA and looking at the structure coefficients (loadings), relationships between tumor specifications and the two outcomes during different time intervals were analyzed and a correlation model was built. RESULTS: The analysis successfully detected known predictors for breast cancer recurrence during the first two years and distant metastasis 2–4 years after diagnosis. Nottingham Histologic Grading (NHG) was the most important predictor, while age of the patient at the time of diagnosis was not an important predictor. CONCLUSION: In cancer registers with high dimensionality, CCA can be used for identifying the importance of risk factors for breast cancer recurrence. This technique can result in a model ready for further processing by data mining methods through reducing the number of variables to important ones. |
format | Text |
id | pubmed-1208892 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2005 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-12088922005-09-15 Exploring cancer register data to find risk factors for recurrence of breast cancer – application of Canonical Correlation Analysis Razavi, Amir R Gill, Hans Stål, Olle Sundquist, Marie Thorstenson, Sten Åhlfeldt, Hans Shahsavar, Nosrat BMC Med Inform Decis Mak Research Article BACKGROUND: A common approach in exploring register data is to find relationships between outcomes and predictors by using multiple regression analysis (MRA). If there is more than one outcome variable, the analysis must then be repeated, and the results combined in some arbitrary fashion. In contrast, Canonical Correlation Analysis (CCA) has the ability to analyze multiple outcomes at the same time. One essential outcome after breast cancer treatment is recurrence of the disease. It is important to understand the relationship between different predictors and recurrence, including the time interval until recurrence. This study describes the application of CCA to find important predictors for two different outcomes for breast cancer patients, loco-regional recurrence and occurrence of distant metastasis and to decrease the number of variables in the sets of predictors and outcomes without decreasing the predictive strength of the model. METHODS: Data for 637 malignant breast cancer patients admitted in the south-east region of Sweden were analyzed. By using CCA and looking at the structure coefficients (loadings), relationships between tumor specifications and the two outcomes during different time intervals were analyzed and a correlation model was built. RESULTS: The analysis successfully detected known predictors for breast cancer recurrence during the first two years and distant metastasis 2–4 years after diagnosis. Nottingham Histologic Grading (NHG) was the most important predictor, while age of the patient at the time of diagnosis was not an important predictor. CONCLUSION: In cancer registers with high dimensionality, CCA can be used for identifying the importance of risk factors for breast cancer recurrence. This technique can result in a model ready for further processing by data mining methods through reducing the number of variables to important ones. BioMed Central 2005-08-22 /pmc/articles/PMC1208892/ /pubmed/16111503 http://dx.doi.org/10.1186/1472-6947-5-29 Text en Copyright © 2005 Razavi et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Razavi, Amir R Gill, Hans Stål, Olle Sundquist, Marie Thorstenson, Sten Åhlfeldt, Hans Shahsavar, Nosrat Exploring cancer register data to find risk factors for recurrence of breast cancer – application of Canonical Correlation Analysis |
title | Exploring cancer register data to find risk factors for recurrence of breast cancer – application of Canonical Correlation Analysis |
title_full | Exploring cancer register data to find risk factors for recurrence of breast cancer – application of Canonical Correlation Analysis |
title_fullStr | Exploring cancer register data to find risk factors for recurrence of breast cancer – application of Canonical Correlation Analysis |
title_full_unstemmed | Exploring cancer register data to find risk factors for recurrence of breast cancer – application of Canonical Correlation Analysis |
title_short | Exploring cancer register data to find risk factors for recurrence of breast cancer – application of Canonical Correlation Analysis |
title_sort | exploring cancer register data to find risk factors for recurrence of breast cancer – application of canonical correlation analysis |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1208892/ https://www.ncbi.nlm.nih.gov/pubmed/16111503 http://dx.doi.org/10.1186/1472-6947-5-29 |
work_keys_str_mv | AT razaviamirr exploringcancerregisterdatatofindriskfactorsforrecurrenceofbreastcancerapplicationofcanonicalcorrelationanalysis AT gillhans exploringcancerregisterdatatofindriskfactorsforrecurrenceofbreastcancerapplicationofcanonicalcorrelationanalysis AT stalolle exploringcancerregisterdatatofindriskfactorsforrecurrenceofbreastcancerapplicationofcanonicalcorrelationanalysis AT sundquistmarie exploringcancerregisterdatatofindriskfactorsforrecurrenceofbreastcancerapplicationofcanonicalcorrelationanalysis AT thorstensonsten exploringcancerregisterdatatofindriskfactorsforrecurrenceofbreastcancerapplicationofcanonicalcorrelationanalysis AT ahlfeldthans exploringcancerregisterdatatofindriskfactorsforrecurrenceofbreastcancerapplicationofcanonicalcorrelationanalysis AT shahsavarnosrat exploringcancerregisterdatatofindriskfactorsforrecurrenceofbreastcancerapplicationofcanonicalcorrelationanalysis AT exploringcancerregisterdatatofindriskfactorsforrecurrenceofbreastcancerapplicationofcanonicalcorrelationanalysis |