Cargando…
Analysis of Multivariate Disease Classification Data in the Presence of Partially Missing Disease Traits
In modern cancer epidemiology, diseases are classified based on pathologic and molecular traits, and different combinations of these traits give rise to many disease subtypes. The effect of predictor variables can be measured by fitting a polytomous logistic model to such data. The differences (hete...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4270282/ https://www.ncbi.nlm.nih.gov/pubmed/25530913 http://dx.doi.org/10.4172/2155-6180.1000197 |
_version_ | 1782349467129741312 |
---|---|
author | Miao, Jingang Sinha, Samiran Wang, Suojin Diver, W Ryan Gapstur, Susan M |
author_facet | Miao, Jingang Sinha, Samiran Wang, Suojin Diver, W Ryan Gapstur, Susan M |
author_sort | Miao, Jingang |
collection | PubMed |
description | In modern cancer epidemiology, diseases are classified based on pathologic and molecular traits, and different combinations of these traits give rise to many disease subtypes. The effect of predictor variables can be measured by fitting a polytomous logistic model to such data. The differences (heterogeneity) among the relative risk parameters associated with subtypes are of great interest to better understand disease etiology. Due to the heterogeneity of the relative risk parameters, when a risk factor is changed, the prevalence of one subtype may change more than that of another subtype does. Estimation of the heterogeneity parameters is difficult when disease trait information is only partially observed and the number of disease subtypes is large. We consider a robust semiparametric approach based on the pseudo-conditional likelihood for estimating these heterogeneity parameters. Through simulation studies, we compare the robustness and efficiency of our approach with that of the maximum likelihood approach. The method is then applied to analyze the associations of weight gain with risk of breast cancer subtypes using data from the American Cancer Society Cancer Prevention Study II Nutrition Cohort. |
format | Online Article Text |
id | pubmed-4270282 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
record_format | MEDLINE/PubMed |
spelling | pubmed-42702822014-12-18 Analysis of Multivariate Disease Classification Data in the Presence of Partially Missing Disease Traits Miao, Jingang Sinha, Samiran Wang, Suojin Diver, W Ryan Gapstur, Susan M J Biom Biostat Article In modern cancer epidemiology, diseases are classified based on pathologic and molecular traits, and different combinations of these traits give rise to many disease subtypes. The effect of predictor variables can be measured by fitting a polytomous logistic model to such data. The differences (heterogeneity) among the relative risk parameters associated with subtypes are of great interest to better understand disease etiology. Due to the heterogeneity of the relative risk parameters, when a risk factor is changed, the prevalence of one subtype may change more than that of another subtype does. Estimation of the heterogeneity parameters is difficult when disease trait information is only partially observed and the number of disease subtypes is large. We consider a robust semiparametric approach based on the pseudo-conditional likelihood for estimating these heterogeneity parameters. Through simulation studies, we compare the robustness and efficiency of our approach with that of the maximum likelihood approach. The method is then applied to analyze the associations of weight gain with risk of breast cancer subtypes using data from the American Cancer Society Cancer Prevention Study II Nutrition Cohort. 2014-05-13 2014 /pmc/articles/PMC4270282/ /pubmed/25530913 http://dx.doi.org/10.4172/2155-6180.1000197 Text en Copyright: © 2014 Miao J, et al. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Article Miao, Jingang Sinha, Samiran Wang, Suojin Diver, W Ryan Gapstur, Susan M Analysis of Multivariate Disease Classification Data in the Presence of Partially Missing Disease Traits |
title | Analysis of Multivariate Disease Classification Data in the Presence of Partially Missing Disease Traits |
title_full | Analysis of Multivariate Disease Classification Data in the Presence of Partially Missing Disease Traits |
title_fullStr | Analysis of Multivariate Disease Classification Data in the Presence of Partially Missing Disease Traits |
title_full_unstemmed | Analysis of Multivariate Disease Classification Data in the Presence of Partially Missing Disease Traits |
title_short | Analysis of Multivariate Disease Classification Data in the Presence of Partially Missing Disease Traits |
title_sort | analysis of multivariate disease classification data in the presence of partially missing disease traits |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4270282/ https://www.ncbi.nlm.nih.gov/pubmed/25530913 http://dx.doi.org/10.4172/2155-6180.1000197 |
work_keys_str_mv | AT miaojingang analysisofmultivariatediseaseclassificationdatainthepresenceofpartiallymissingdiseasetraits AT sinhasamiran analysisofmultivariatediseaseclassificationdatainthepresenceofpartiallymissingdiseasetraits AT wangsuojin analysisofmultivariatediseaseclassificationdatainthepresenceofpartiallymissingdiseasetraits AT diverwryan analysisofmultivariatediseaseclassificationdatainthepresenceofpartiallymissingdiseasetraits AT gapstursusanm analysisofmultivariatediseaseclassificationdatainthepresenceofpartiallymissingdiseasetraits |