Cargando…

Analysis of Multivariate Disease Classification Data in the Presence of Partially Missing Disease Traits

In modern cancer epidemiology, diseases are classified based on pathologic and molecular traits, and different combinations of these traits give rise to many disease subtypes. The effect of predictor variables can be measured by fitting a polytomous logistic model to such data. The differences (hete...

Descripción completa

Detalles Bibliográficos
Autores principales: Miao, Jingang, Sinha, Samiran, Wang, Suojin, Diver, W Ryan, Gapstur, Susan M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4270282/
https://www.ncbi.nlm.nih.gov/pubmed/25530913
http://dx.doi.org/10.4172/2155-6180.1000197
_version_ 1782349467129741312
author Miao, Jingang
Sinha, Samiran
Wang, Suojin
Diver, W Ryan
Gapstur, Susan M
author_facet Miao, Jingang
Sinha, Samiran
Wang, Suojin
Diver, W Ryan
Gapstur, Susan M
author_sort Miao, Jingang
collection PubMed
description In modern cancer epidemiology, diseases are classified based on pathologic and molecular traits, and different combinations of these traits give rise to many disease subtypes. The effect of predictor variables can be measured by fitting a polytomous logistic model to such data. The differences (heterogeneity) among the relative risk parameters associated with subtypes are of great interest to better understand disease etiology. Due to the heterogeneity of the relative risk parameters, when a risk factor is changed, the prevalence of one subtype may change more than that of another subtype does. Estimation of the heterogeneity parameters is difficult when disease trait information is only partially observed and the number of disease subtypes is large. We consider a robust semiparametric approach based on the pseudo-conditional likelihood for estimating these heterogeneity parameters. Through simulation studies, we compare the robustness and efficiency of our approach with that of the maximum likelihood approach. The method is then applied to analyze the associations of weight gain with risk of breast cancer subtypes using data from the American Cancer Society Cancer Prevention Study II Nutrition Cohort.
format Online
Article
Text
id pubmed-4270282
institution National Center for Biotechnology Information
language English
publishDate 2014
record_format MEDLINE/PubMed
spelling pubmed-42702822014-12-18 Analysis of Multivariate Disease Classification Data in the Presence of Partially Missing Disease Traits Miao, Jingang Sinha, Samiran Wang, Suojin Diver, W Ryan Gapstur, Susan M J Biom Biostat Article In modern cancer epidemiology, diseases are classified based on pathologic and molecular traits, and different combinations of these traits give rise to many disease subtypes. The effect of predictor variables can be measured by fitting a polytomous logistic model to such data. The differences (heterogeneity) among the relative risk parameters associated with subtypes are of great interest to better understand disease etiology. Due to the heterogeneity of the relative risk parameters, when a risk factor is changed, the prevalence of one subtype may change more than that of another subtype does. Estimation of the heterogeneity parameters is difficult when disease trait information is only partially observed and the number of disease subtypes is large. We consider a robust semiparametric approach based on the pseudo-conditional likelihood for estimating these heterogeneity parameters. Through simulation studies, we compare the robustness and efficiency of our approach with that of the maximum likelihood approach. The method is then applied to analyze the associations of weight gain with risk of breast cancer subtypes using data from the American Cancer Society Cancer Prevention Study II Nutrition Cohort. 2014-05-13 2014 /pmc/articles/PMC4270282/ /pubmed/25530913 http://dx.doi.org/10.4172/2155-6180.1000197 Text en Copyright: © 2014 Miao J, et al. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Article
Miao, Jingang
Sinha, Samiran
Wang, Suojin
Diver, W Ryan
Gapstur, Susan M
Analysis of Multivariate Disease Classification Data in the Presence of Partially Missing Disease Traits
title Analysis of Multivariate Disease Classification Data in the Presence of Partially Missing Disease Traits
title_full Analysis of Multivariate Disease Classification Data in the Presence of Partially Missing Disease Traits
title_fullStr Analysis of Multivariate Disease Classification Data in the Presence of Partially Missing Disease Traits
title_full_unstemmed Analysis of Multivariate Disease Classification Data in the Presence of Partially Missing Disease Traits
title_short Analysis of Multivariate Disease Classification Data in the Presence of Partially Missing Disease Traits
title_sort analysis of multivariate disease classification data in the presence of partially missing disease traits
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4270282/
https://www.ncbi.nlm.nih.gov/pubmed/25530913
http://dx.doi.org/10.4172/2155-6180.1000197
work_keys_str_mv AT miaojingang analysisofmultivariatediseaseclassificationdatainthepresenceofpartiallymissingdiseasetraits
AT sinhasamiran analysisofmultivariatediseaseclassificationdatainthepresenceofpartiallymissingdiseasetraits
AT wangsuojin analysisofmultivariatediseaseclassificationdatainthepresenceofpartiallymissingdiseasetraits
AT diverwryan analysisofmultivariatediseaseclassificationdatainthepresenceofpartiallymissingdiseasetraits
AT gapstursusanm analysisofmultivariatediseaseclassificationdatainthepresenceofpartiallymissingdiseasetraits