Cargando…
Improve the model of disease subtype heterogeneity by leveraging external summary data
Researchers are often interested in understanding the disease subtype heterogeneity by testing whether a risk exposure has the same level of effect on different disease subtypes. The polytomous logistic regression (PLR) model provides a flexible tool for such an evaluation. Disease subtype heterogen...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10337985/ https://www.ncbi.nlm.nih.gov/pubmed/37437002 http://dx.doi.org/10.1371/journal.pcbi.1011236 |
_version_ | 1785071535754575872 |
---|---|
author | Fu, Sheng Purdue, Mark P. Zhang, Han Qin, Jing Song, Lei Berndt, Sonja I. Yu, Kai |
author_facet | Fu, Sheng Purdue, Mark P. Zhang, Han Qin, Jing Song, Lei Berndt, Sonja I. Yu, Kai |
author_sort | Fu, Sheng |
collection | PubMed |
description | Researchers are often interested in understanding the disease subtype heterogeneity by testing whether a risk exposure has the same level of effect on different disease subtypes. The polytomous logistic regression (PLR) model provides a flexible tool for such an evaluation. Disease subtype heterogeneity can also be investigated with a case-only study that uses a case-case comparison procedure to directly assess the difference between risk effects on two disease subtypes. Motivated by a large consortium project on the genetic basis of non-Hodgkin lymphoma (NHL) subtypes, we develop PolyGIM, a procedure to fit the PLR model by integrating individual-level data with summary data extracted from multiple studies under different designs. The summary data consist of coefficient estimates from working logistic regression models established by external studies. Examples of the working model include the case-case comparison model and the case-control comparison model, which compares the control group with a subtype group or a broad disease group formed by merging several subtypes. PolyGIM efficiently evaluates risk effects and provides a powerful test for disease subtype heterogeneity in situations when only summary data, instead of individual-level data, is available from external studies due to various informatics and privacy constraints. We investigate the theoretic properties of PolyGIM and use simulation studies to demonstrate its advantages. Using data from eight genome-wide association studies within the NHL consortium, we apply it to study the effect of the polygenic risk score defined by a lymphoid malignancy on the risks of four NHL subtypes. These results show that PolyGIM can be a valuable tool for pooling data from multiple sources for a more coherent evaluation of disease subtype heterogeneity. |
format | Online Article Text |
id | pubmed-10337985 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-103379852023-07-13 Improve the model of disease subtype heterogeneity by leveraging external summary data Fu, Sheng Purdue, Mark P. Zhang, Han Qin, Jing Song, Lei Berndt, Sonja I. Yu, Kai PLoS Comput Biol Research Article Researchers are often interested in understanding the disease subtype heterogeneity by testing whether a risk exposure has the same level of effect on different disease subtypes. The polytomous logistic regression (PLR) model provides a flexible tool for such an evaluation. Disease subtype heterogeneity can also be investigated with a case-only study that uses a case-case comparison procedure to directly assess the difference between risk effects on two disease subtypes. Motivated by a large consortium project on the genetic basis of non-Hodgkin lymphoma (NHL) subtypes, we develop PolyGIM, a procedure to fit the PLR model by integrating individual-level data with summary data extracted from multiple studies under different designs. The summary data consist of coefficient estimates from working logistic regression models established by external studies. Examples of the working model include the case-case comparison model and the case-control comparison model, which compares the control group with a subtype group or a broad disease group formed by merging several subtypes. PolyGIM efficiently evaluates risk effects and provides a powerful test for disease subtype heterogeneity in situations when only summary data, instead of individual-level data, is available from external studies due to various informatics and privacy constraints. We investigate the theoretic properties of PolyGIM and use simulation studies to demonstrate its advantages. Using data from eight genome-wide association studies within the NHL consortium, we apply it to study the effect of the polygenic risk score defined by a lymphoid malignancy on the risks of four NHL subtypes. These results show that PolyGIM can be a valuable tool for pooling data from multiple sources for a more coherent evaluation of disease subtype heterogeneity. Public Library of Science 2023-07-12 /pmc/articles/PMC10337985/ /pubmed/37437002 http://dx.doi.org/10.1371/journal.pcbi.1011236 Text en https://creativecommons.org/publicdomain/zero/1.0/This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication. |
spellingShingle | Research Article Fu, Sheng Purdue, Mark P. Zhang, Han Qin, Jing Song, Lei Berndt, Sonja I. Yu, Kai Improve the model of disease subtype heterogeneity by leveraging external summary data |
title | Improve the model of disease subtype heterogeneity by leveraging external summary data |
title_full | Improve the model of disease subtype heterogeneity by leveraging external summary data |
title_fullStr | Improve the model of disease subtype heterogeneity by leveraging external summary data |
title_full_unstemmed | Improve the model of disease subtype heterogeneity by leveraging external summary data |
title_short | Improve the model of disease subtype heterogeneity by leveraging external summary data |
title_sort | improve the model of disease subtype heterogeneity by leveraging external summary data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10337985/ https://www.ncbi.nlm.nih.gov/pubmed/37437002 http://dx.doi.org/10.1371/journal.pcbi.1011236 |
work_keys_str_mv | AT fusheng improvethemodelofdiseasesubtypeheterogeneitybyleveragingexternalsummarydata AT purduemarkp improvethemodelofdiseasesubtypeheterogeneitybyleveragingexternalsummarydata AT zhanghan improvethemodelofdiseasesubtypeheterogeneitybyleveragingexternalsummarydata AT qinjing improvethemodelofdiseasesubtypeheterogeneitybyleveragingexternalsummarydata AT songlei improvethemodelofdiseasesubtypeheterogeneitybyleveragingexternalsummarydata AT berndtsonjai improvethemodelofdiseasesubtypeheterogeneitybyleveragingexternalsummarydata AT yukai improvethemodelofdiseasesubtypeheterogeneitybyleveragingexternalsummarydata |