Cargando…

Improve the model of disease subtype heterogeneity by leveraging external summary data

Researchers are often interested in understanding the disease subtype heterogeneity by testing whether a risk exposure has the same level of effect on different disease subtypes. The polytomous logistic regression (PLR) model provides a flexible tool for such an evaluation. Disease subtype heterogen...

Descripción completa

Detalles Bibliográficos
Autores principales: Fu, Sheng, Purdue, Mark P., Zhang, Han, Qin, Jing, Song, Lei, Berndt, Sonja I., Yu, Kai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10337985/
https://www.ncbi.nlm.nih.gov/pubmed/37437002
http://dx.doi.org/10.1371/journal.pcbi.1011236
_version_ 1785071535754575872
author Fu, Sheng
Purdue, Mark P.
Zhang, Han
Qin, Jing
Song, Lei
Berndt, Sonja I.
Yu, Kai
author_facet Fu, Sheng
Purdue, Mark P.
Zhang, Han
Qin, Jing
Song, Lei
Berndt, Sonja I.
Yu, Kai
author_sort Fu, Sheng
collection PubMed
description Researchers are often interested in understanding the disease subtype heterogeneity by testing whether a risk exposure has the same level of effect on different disease subtypes. The polytomous logistic regression (PLR) model provides a flexible tool for such an evaluation. Disease subtype heterogeneity can also be investigated with a case-only study that uses a case-case comparison procedure to directly assess the difference between risk effects on two disease subtypes. Motivated by a large consortium project on the genetic basis of non-Hodgkin lymphoma (NHL) subtypes, we develop PolyGIM, a procedure to fit the PLR model by integrating individual-level data with summary data extracted from multiple studies under different designs. The summary data consist of coefficient estimates from working logistic regression models established by external studies. Examples of the working model include the case-case comparison model and the case-control comparison model, which compares the control group with a subtype group or a broad disease group formed by merging several subtypes. PolyGIM efficiently evaluates risk effects and provides a powerful test for disease subtype heterogeneity in situations when only summary data, instead of individual-level data, is available from external studies due to various informatics and privacy constraints. We investigate the theoretic properties of PolyGIM and use simulation studies to demonstrate its advantages. Using data from eight genome-wide association studies within the NHL consortium, we apply it to study the effect of the polygenic risk score defined by a lymphoid malignancy on the risks of four NHL subtypes. These results show that PolyGIM can be a valuable tool for pooling data from multiple sources for a more coherent evaluation of disease subtype heterogeneity.
format Online
Article
Text
id pubmed-10337985
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-103379852023-07-13 Improve the model of disease subtype heterogeneity by leveraging external summary data Fu, Sheng Purdue, Mark P. Zhang, Han Qin, Jing Song, Lei Berndt, Sonja I. Yu, Kai PLoS Comput Biol Research Article Researchers are often interested in understanding the disease subtype heterogeneity by testing whether a risk exposure has the same level of effect on different disease subtypes. The polytomous logistic regression (PLR) model provides a flexible tool for such an evaluation. Disease subtype heterogeneity can also be investigated with a case-only study that uses a case-case comparison procedure to directly assess the difference between risk effects on two disease subtypes. Motivated by a large consortium project on the genetic basis of non-Hodgkin lymphoma (NHL) subtypes, we develop PolyGIM, a procedure to fit the PLR model by integrating individual-level data with summary data extracted from multiple studies under different designs. The summary data consist of coefficient estimates from working logistic regression models established by external studies. Examples of the working model include the case-case comparison model and the case-control comparison model, which compares the control group with a subtype group or a broad disease group formed by merging several subtypes. PolyGIM efficiently evaluates risk effects and provides a powerful test for disease subtype heterogeneity in situations when only summary data, instead of individual-level data, is available from external studies due to various informatics and privacy constraints. We investigate the theoretic properties of PolyGIM and use simulation studies to demonstrate its advantages. Using data from eight genome-wide association studies within the NHL consortium, we apply it to study the effect of the polygenic risk score defined by a lymphoid malignancy on the risks of four NHL subtypes. These results show that PolyGIM can be a valuable tool for pooling data from multiple sources for a more coherent evaluation of disease subtype heterogeneity. Public Library of Science 2023-07-12 /pmc/articles/PMC10337985/ /pubmed/37437002 http://dx.doi.org/10.1371/journal.pcbi.1011236 Text en https://creativecommons.org/publicdomain/zero/1.0/This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Fu, Sheng
Purdue, Mark P.
Zhang, Han
Qin, Jing
Song, Lei
Berndt, Sonja I.
Yu, Kai
Improve the model of disease subtype heterogeneity by leveraging external summary data
title Improve the model of disease subtype heterogeneity by leveraging external summary data
title_full Improve the model of disease subtype heterogeneity by leveraging external summary data
title_fullStr Improve the model of disease subtype heterogeneity by leveraging external summary data
title_full_unstemmed Improve the model of disease subtype heterogeneity by leveraging external summary data
title_short Improve the model of disease subtype heterogeneity by leveraging external summary data
title_sort improve the model of disease subtype heterogeneity by leveraging external summary data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10337985/
https://www.ncbi.nlm.nih.gov/pubmed/37437002
http://dx.doi.org/10.1371/journal.pcbi.1011236
work_keys_str_mv AT fusheng improvethemodelofdiseasesubtypeheterogeneitybyleveragingexternalsummarydata
AT purduemarkp improvethemodelofdiseasesubtypeheterogeneitybyleveragingexternalsummarydata
AT zhanghan improvethemodelofdiseasesubtypeheterogeneitybyleveragingexternalsummarydata
AT qinjing improvethemodelofdiseasesubtypeheterogeneitybyleveragingexternalsummarydata
AT songlei improvethemodelofdiseasesubtypeheterogeneitybyleveragingexternalsummarydata
AT berndtsonjai improvethemodelofdiseasesubtypeheterogeneitybyleveragingexternalsummarydata
AT yukai improvethemodelofdiseasesubtypeheterogeneitybyleveragingexternalsummarydata