Cargando…

The effect of sample size on polygenic hazard models for prostate cancer

We determined the effect of sample size on performance of polygenic hazard score (PHS) models in prostate cancer. Age and genotypes were obtained for 40,861 men from the PRACTICAL consortium. The dataset included 201,590 SNPs per subject, and was split into training and testing sets. Established-SNP...

Descripción completa

Detalles Bibliográficos
Autores principales: Karunamuni, Roshan A., Huynh-Le, Minh-Phuong, Fan, Chun C., Eeles, Rosalind A., Easton, Douglas F., Kote-Jarai, ZSofia, Amin Al Olama, Ali, Benlloch Garcia, Sara, Muir, Kenneth, Gronberg, Henrik, Wiklund, Fredrik, Aly, Markus, Schleutker, Johanna, Sipeky, Csilla, Tammela, Teuvo L. J., Nordestgaard, Børge G., Key, Tim J., Travis, Ruth C., Neal, David E., Donovan, Jenny L., Hamdy, Freddie C., Pharoah, Paul, Pashayan, Nora, Khaw, Kay-Tee, Thibodeau, Stephen N., McDonnell, Shannon K., Schaid, Daniel J., Maier, Christiane, Vogel, Walther, Luedeke, Manuel, Herkommer, Kathleen, Kibel, Adam S., Cybulski, Cezary, Wokolorczyk, Dominika, Kluzniak, Wojciech, Cannon-Albright, Lisa, Brenner, Hermann, Schöttker, Ben, Holleczek, Bernd, Park, Jong Y., Sellers, Thomas A., Lin, Hui-Yi, Slavov, Chavdar, Kaneva, Radka, Mitev, Vanio, Batra, Jyotsna, Clements, Judith A., Spurdle, Amanda, Teixeira, Manuel R., Paulo, Paula, Maia, Sofia, Pandha, Hardev, Michael, Agnieszka, Mills, Ian G., Andreassen, Ole A., Dale, Anders M., Seibert, Tyler M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7608255/
https://www.ncbi.nlm.nih.gov/pubmed/32514134
http://dx.doi.org/10.1038/s41431-020-0664-2
_version_ 1783604798021435392
author Karunamuni, Roshan A.
Huynh-Le, Minh-Phuong
Fan, Chun C.
Eeles, Rosalind A.
Easton, Douglas F.
Kote-Jarai, ZSofia
Amin Al Olama, Ali
Benlloch Garcia, Sara
Muir, Kenneth
Gronberg, Henrik
Wiklund, Fredrik
Aly, Markus
Schleutker, Johanna
Sipeky, Csilla
Tammela, Teuvo L. J.
Nordestgaard, Børge G.
Key, Tim J.
Travis, Ruth C.
Neal, David E.
Donovan, Jenny L.
Hamdy, Freddie C.
Pharoah, Paul
Pashayan, Nora
Khaw, Kay-Tee
Thibodeau, Stephen N.
McDonnell, Shannon K.
Schaid, Daniel J.
Maier, Christiane
Vogel, Walther
Luedeke, Manuel
Herkommer, Kathleen
Kibel, Adam S.
Cybulski, Cezary
Wokolorczyk, Dominika
Kluzniak, Wojciech
Cannon-Albright, Lisa
Brenner, Hermann
Schöttker, Ben
Holleczek, Bernd
Park, Jong Y.
Sellers, Thomas A.
Lin, Hui-Yi
Slavov, Chavdar
Kaneva, Radka
Mitev, Vanio
Batra, Jyotsna
Clements, Judith A.
Spurdle, Amanda
Teixeira, Manuel R.
Paulo, Paula
Maia, Sofia
Pandha, Hardev
Michael, Agnieszka
Mills, Ian G.
Andreassen, Ole A.
Dale, Anders M.
Seibert, Tyler M.
author_facet Karunamuni, Roshan A.
Huynh-Le, Minh-Phuong
Fan, Chun C.
Eeles, Rosalind A.
Easton, Douglas F.
Kote-Jarai, ZSofia
Amin Al Olama, Ali
Benlloch Garcia, Sara
Muir, Kenneth
Gronberg, Henrik
Wiklund, Fredrik
Aly, Markus
Schleutker, Johanna
Sipeky, Csilla
Tammela, Teuvo L. J.
Nordestgaard, Børge G.
Key, Tim J.
Travis, Ruth C.
Neal, David E.
Donovan, Jenny L.
Hamdy, Freddie C.
Pharoah, Paul
Pashayan, Nora
Khaw, Kay-Tee
Thibodeau, Stephen N.
McDonnell, Shannon K.
Schaid, Daniel J.
Maier, Christiane
Vogel, Walther
Luedeke, Manuel
Herkommer, Kathleen
Kibel, Adam S.
Cybulski, Cezary
Wokolorczyk, Dominika
Kluzniak, Wojciech
Cannon-Albright, Lisa
Brenner, Hermann
Schöttker, Ben
Holleczek, Bernd
Park, Jong Y.
Sellers, Thomas A.
Lin, Hui-Yi
Slavov, Chavdar
Kaneva, Radka
Mitev, Vanio
Batra, Jyotsna
Clements, Judith A.
Spurdle, Amanda
Teixeira, Manuel R.
Paulo, Paula
Maia, Sofia
Pandha, Hardev
Michael, Agnieszka
Mills, Ian G.
Andreassen, Ole A.
Dale, Anders M.
Seibert, Tyler M.
author_sort Karunamuni, Roshan A.
collection PubMed
description We determined the effect of sample size on performance of polygenic hazard score (PHS) models in prostate cancer. Age and genotypes were obtained for 40,861 men from the PRACTICAL consortium. The dataset included 201,590 SNPs per subject, and was split into training and testing sets. Established-SNP models considered 65 SNPs that had been previously associated with prostate cancer. Discovery-SNP models used stepwise selection to identify new SNPs. The performance of each PHS model was calculated for random sizes of the training set. The performance of a representative Established-SNP model was estimated for random sizes of the testing set. Mean HR(98/50) (hazard ratio of top 2% to average in test set) of the Established-SNP model increased from 1.73 [95% CI: 1.69–1.77] to 2.41 [2.40–2.43] when the number of training samples was increased from 1 thousand to 30 thousand. Corresponding HR(98/50) of the Discovery-SNP model increased from 1.05 [0.93–1.18] to 2.19 [2.16–2.23]. HR(98/50) of a representative Established-SNP model using testing set sample sizes of 0.6 thousand and 6 thousand observations were 1.78 [1.70–1.85] and 1.73 [1.71–1.76], respectively. We estimate that a study population of 20 thousand men is required to develop Discovery-SNP PHS models while 10 thousand men should be sufficient for Established-SNP models.
format Online
Article
Text
id pubmed-7608255
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-76082552020-11-05 The effect of sample size on polygenic hazard models for prostate cancer Karunamuni, Roshan A. Huynh-Le, Minh-Phuong Fan, Chun C. Eeles, Rosalind A. Easton, Douglas F. Kote-Jarai, ZSofia Amin Al Olama, Ali Benlloch Garcia, Sara Muir, Kenneth Gronberg, Henrik Wiklund, Fredrik Aly, Markus Schleutker, Johanna Sipeky, Csilla Tammela, Teuvo L. J. Nordestgaard, Børge G. Key, Tim J. Travis, Ruth C. Neal, David E. Donovan, Jenny L. Hamdy, Freddie C. Pharoah, Paul Pashayan, Nora Khaw, Kay-Tee Thibodeau, Stephen N. McDonnell, Shannon K. Schaid, Daniel J. Maier, Christiane Vogel, Walther Luedeke, Manuel Herkommer, Kathleen Kibel, Adam S. Cybulski, Cezary Wokolorczyk, Dominika Kluzniak, Wojciech Cannon-Albright, Lisa Brenner, Hermann Schöttker, Ben Holleczek, Bernd Park, Jong Y. Sellers, Thomas A. Lin, Hui-Yi Slavov, Chavdar Kaneva, Radka Mitev, Vanio Batra, Jyotsna Clements, Judith A. Spurdle, Amanda Teixeira, Manuel R. Paulo, Paula Maia, Sofia Pandha, Hardev Michael, Agnieszka Mills, Ian G. Andreassen, Ole A. Dale, Anders M. Seibert, Tyler M. Eur J Hum Genet Article We determined the effect of sample size on performance of polygenic hazard score (PHS) models in prostate cancer. Age and genotypes were obtained for 40,861 men from the PRACTICAL consortium. The dataset included 201,590 SNPs per subject, and was split into training and testing sets. Established-SNP models considered 65 SNPs that had been previously associated with prostate cancer. Discovery-SNP models used stepwise selection to identify new SNPs. The performance of each PHS model was calculated for random sizes of the training set. The performance of a representative Established-SNP model was estimated for random sizes of the testing set. Mean HR(98/50) (hazard ratio of top 2% to average in test set) of the Established-SNP model increased from 1.73 [95% CI: 1.69–1.77] to 2.41 [2.40–2.43] when the number of training samples was increased from 1 thousand to 30 thousand. Corresponding HR(98/50) of the Discovery-SNP model increased from 1.05 [0.93–1.18] to 2.19 [2.16–2.23]. HR(98/50) of a representative Established-SNP model using testing set sample sizes of 0.6 thousand and 6 thousand observations were 1.78 [1.70–1.85] and 1.73 [1.71–1.76], respectively. We estimate that a study population of 20 thousand men is required to develop Discovery-SNP PHS models while 10 thousand men should be sufficient for Established-SNP models. Springer International Publishing 2020-06-08 2020-10 /pmc/articles/PMC7608255/ /pubmed/32514134 http://dx.doi.org/10.1038/s41431-020-0664-2 Text en © The Author(s), under exclusive licence to European Society of Human Genetics 2020
spellingShingle Article
Karunamuni, Roshan A.
Huynh-Le, Minh-Phuong
Fan, Chun C.
Eeles, Rosalind A.
Easton, Douglas F.
Kote-Jarai, ZSofia
Amin Al Olama, Ali
Benlloch Garcia, Sara
Muir, Kenneth
Gronberg, Henrik
Wiklund, Fredrik
Aly, Markus
Schleutker, Johanna
Sipeky, Csilla
Tammela, Teuvo L. J.
Nordestgaard, Børge G.
Key, Tim J.
Travis, Ruth C.
Neal, David E.
Donovan, Jenny L.
Hamdy, Freddie C.
Pharoah, Paul
Pashayan, Nora
Khaw, Kay-Tee
Thibodeau, Stephen N.
McDonnell, Shannon K.
Schaid, Daniel J.
Maier, Christiane
Vogel, Walther
Luedeke, Manuel
Herkommer, Kathleen
Kibel, Adam S.
Cybulski, Cezary
Wokolorczyk, Dominika
Kluzniak, Wojciech
Cannon-Albright, Lisa
Brenner, Hermann
Schöttker, Ben
Holleczek, Bernd
Park, Jong Y.
Sellers, Thomas A.
Lin, Hui-Yi
Slavov, Chavdar
Kaneva, Radka
Mitev, Vanio
Batra, Jyotsna
Clements, Judith A.
Spurdle, Amanda
Teixeira, Manuel R.
Paulo, Paula
Maia, Sofia
Pandha, Hardev
Michael, Agnieszka
Mills, Ian G.
Andreassen, Ole A.
Dale, Anders M.
Seibert, Tyler M.
The effect of sample size on polygenic hazard models for prostate cancer
title The effect of sample size on polygenic hazard models for prostate cancer
title_full The effect of sample size on polygenic hazard models for prostate cancer
title_fullStr The effect of sample size on polygenic hazard models for prostate cancer
title_full_unstemmed The effect of sample size on polygenic hazard models for prostate cancer
title_short The effect of sample size on polygenic hazard models for prostate cancer
title_sort effect of sample size on polygenic hazard models for prostate cancer
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7608255/
https://www.ncbi.nlm.nih.gov/pubmed/32514134
http://dx.doi.org/10.1038/s41431-020-0664-2
work_keys_str_mv AT karunamuniroshana theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT huynhleminhphuong theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT fanchunc theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT eelesrosalinda theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT eastondouglasf theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT kotejaraizsofia theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT aminalolamaali theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT benllochgarciasara theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT muirkenneth theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT gronberghenrik theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT wiklundfredrik theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT alymarkus theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT schleutkerjohanna theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT sipekycsilla theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT tammelateuvolj theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT nordestgaardbørgeg theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT keytimj theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT travisruthc theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT nealdavide theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT donovanjennyl theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT hamdyfreddiec theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT pharoahpaul theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT pashayannora theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT khawkaytee theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT thibodeaustephenn theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT mcdonnellshannonk theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT schaiddanielj theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT maierchristiane theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT vogelwalther theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT luedekemanuel theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT herkommerkathleen theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT kibeladams theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT cybulskicezary theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT wokolorczykdominika theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT kluzniakwojciech theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT cannonalbrightlisa theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT brennerhermann theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT schottkerben theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT holleczekbernd theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT parkjongy theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT sellersthomasa theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT linhuiyi theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT slavovchavdar theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT kanevaradka theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT mitevvanio theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT batrajyotsna theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT clementsjuditha theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT spurdleamanda theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT teixeiramanuelr theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT paulopaula theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT maiasofia theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT pandhahardev theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT michaelagnieszka theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT millsiang theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT andreassenolea theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT daleandersm theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT seiberttylerm theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT theeffectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT karunamuniroshana effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT huynhleminhphuong effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT fanchunc effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT eelesrosalinda effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT eastondouglasf effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT kotejaraizsofia effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT aminalolamaali effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT benllochgarciasara effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT muirkenneth effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT gronberghenrik effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT wiklundfredrik effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT alymarkus effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT schleutkerjohanna effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT sipekycsilla effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT tammelateuvolj effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT nordestgaardbørgeg effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT keytimj effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT travisruthc effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT nealdavide effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT donovanjennyl effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT hamdyfreddiec effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT pharoahpaul effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT pashayannora effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT khawkaytee effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT thibodeaustephenn effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT mcdonnellshannonk effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT schaiddanielj effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT maierchristiane effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT vogelwalther effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT luedekemanuel effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT herkommerkathleen effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT kibeladams effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT cybulskicezary effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT wokolorczykdominika effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT kluzniakwojciech effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT cannonalbrightlisa effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT brennerhermann effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT schottkerben effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT holleczekbernd effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT parkjongy effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT sellersthomasa effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT linhuiyi effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT slavovchavdar effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT kanevaradka effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT mitevvanio effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT batrajyotsna effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT clementsjuditha effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT spurdleamanda effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT teixeiramanuelr effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT paulopaula effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT maiasofia effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT pandhahardev effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT michaelagnieszka effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT millsiang effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT andreassenolea effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT daleandersm effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT seiberttylerm effectofsamplesizeonpolygenichazardmodelsforprostatecancer
AT effectofsamplesizeonpolygenichazardmodelsforprostatecancer