Cargando…

A new statistical method for curve group analysis of longitudinal gene expression data illustrated for breast cancer in the NOWAC postgenome cohort as a proof of principle

BACKGROUND: The understanding of changes in temporal processes related to human carcinogenesis is limited. One approach for prospective functional genomic studies is to compile trajectories of differential expression of genes, based on measurements from many case-control pairs. We propose a new stat...

Descripción completa

Detalles Bibliográficos
Autores principales: Lund, Eiliv, Holden, Lars, Bøvelstad, Hege, Plancade, Sandra, Mode, Nicolle, Günther, Clara-Cecilie, Nuel, Gregory, Thalabard, Jean-Christophe, Holden, Marit
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4779232/
https://www.ncbi.nlm.nih.gov/pubmed/26944545
http://dx.doi.org/10.1186/s12874-016-0129-z
_version_ 1782419600784228352
author Lund, Eiliv
Holden, Lars
Bøvelstad, Hege
Plancade, Sandra
Mode, Nicolle
Günther, Clara-Cecilie
Nuel, Gregory
Thalabard, Jean-Christophe
Holden, Marit
author_facet Lund, Eiliv
Holden, Lars
Bøvelstad, Hege
Plancade, Sandra
Mode, Nicolle
Günther, Clara-Cecilie
Nuel, Gregory
Thalabard, Jean-Christophe
Holden, Marit
author_sort Lund, Eiliv
collection PubMed
description BACKGROUND: The understanding of changes in temporal processes related to human carcinogenesis is limited. One approach for prospective functional genomic studies is to compile trajectories of differential expression of genes, based on measurements from many case-control pairs. We propose a new statistical method that does not assume any parametric shape for the gene trajectories. METHODS: The trajectory of a gene is defined as the curve representing the changes in gene expression levels in the blood as a function of time to cancer diagnosis. In a nested case–control design it consists of differences in gene expression levels between cases and controls. Genes can be grouped into curve groups, each curve group corresponding to genes with a similar development over time. The proposed new statistical approach is based on a set of hypothesis testing that can determine whether or not there is development in gene expression levels over time, and whether this development varies among different strata. Curve group analysis may reveal significant differences in gene expression levels over time among the different strata considered. This new method was applied as a “proof of concept” to breast cancer in the Norwegian Women and Cancer (NOWAC) postgenome cohort, using blood samples collected prospectively that were specifically preserved for transcriptomic analyses (PAX tube). Cohort members diagnosed with invasive breast cancer through 2009 were identified through linkage to the Cancer Registry of Norway, and for each case a random control from the postgenome cohort was also selected, matched by birth year and time of blood sampling, to create a case-control pair. After exclusions, 441 case-control pairs were available for analyses, in which we considered strata of lymph node status at time of diagnosis and time of diagnosis with respect to breast cancer screening visits. RESULTS: The development of gene expression levels in the NOWAC postgenome cohort varied in the last years before breast cancer diagnosis, and this development differed by lymph node status and participation in the Norwegian Breast Cancer Screening Program. The differences among the investigated strata appeared larger in the year before breast cancer diagnosis compared to earlier years. CONCLUSIONS: This approach shows good properties in term of statistical power and type 1 error under minimal assumptions. When applied to a real data set it was able to discriminate between groups of genes with non-linear similar patterns before diagnosis.
format Online
Article
Text
id pubmed-4779232
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-47792322016-03-06 A new statistical method for curve group analysis of longitudinal gene expression data illustrated for breast cancer in the NOWAC postgenome cohort as a proof of principle Lund, Eiliv Holden, Lars Bøvelstad, Hege Plancade, Sandra Mode, Nicolle Günther, Clara-Cecilie Nuel, Gregory Thalabard, Jean-Christophe Holden, Marit BMC Med Res Methodol Research Article BACKGROUND: The understanding of changes in temporal processes related to human carcinogenesis is limited. One approach for prospective functional genomic studies is to compile trajectories of differential expression of genes, based on measurements from many case-control pairs. We propose a new statistical method that does not assume any parametric shape for the gene trajectories. METHODS: The trajectory of a gene is defined as the curve representing the changes in gene expression levels in the blood as a function of time to cancer diagnosis. In a nested case–control design it consists of differences in gene expression levels between cases and controls. Genes can be grouped into curve groups, each curve group corresponding to genes with a similar development over time. The proposed new statistical approach is based on a set of hypothesis testing that can determine whether or not there is development in gene expression levels over time, and whether this development varies among different strata. Curve group analysis may reveal significant differences in gene expression levels over time among the different strata considered. This new method was applied as a “proof of concept” to breast cancer in the Norwegian Women and Cancer (NOWAC) postgenome cohort, using blood samples collected prospectively that were specifically preserved for transcriptomic analyses (PAX tube). Cohort members diagnosed with invasive breast cancer through 2009 were identified through linkage to the Cancer Registry of Norway, and for each case a random control from the postgenome cohort was also selected, matched by birth year and time of blood sampling, to create a case-control pair. After exclusions, 441 case-control pairs were available for analyses, in which we considered strata of lymph node status at time of diagnosis and time of diagnosis with respect to breast cancer screening visits. RESULTS: The development of gene expression levels in the NOWAC postgenome cohort varied in the last years before breast cancer diagnosis, and this development differed by lymph node status and participation in the Norwegian Breast Cancer Screening Program. The differences among the investigated strata appeared larger in the year before breast cancer diagnosis compared to earlier years. CONCLUSIONS: This approach shows good properties in term of statistical power and type 1 error under minimal assumptions. When applied to a real data set it was able to discriminate between groups of genes with non-linear similar patterns before diagnosis. BioMed Central 2016-03-05 /pmc/articles/PMC4779232/ /pubmed/26944545 http://dx.doi.org/10.1186/s12874-016-0129-z Text en © Lund et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Lund, Eiliv
Holden, Lars
Bøvelstad, Hege
Plancade, Sandra
Mode, Nicolle
Günther, Clara-Cecilie
Nuel, Gregory
Thalabard, Jean-Christophe
Holden, Marit
A new statistical method for curve group analysis of longitudinal gene expression data illustrated for breast cancer in the NOWAC postgenome cohort as a proof of principle
title A new statistical method for curve group analysis of longitudinal gene expression data illustrated for breast cancer in the NOWAC postgenome cohort as a proof of principle
title_full A new statistical method for curve group analysis of longitudinal gene expression data illustrated for breast cancer in the NOWAC postgenome cohort as a proof of principle
title_fullStr A new statistical method for curve group analysis of longitudinal gene expression data illustrated for breast cancer in the NOWAC postgenome cohort as a proof of principle
title_full_unstemmed A new statistical method for curve group analysis of longitudinal gene expression data illustrated for breast cancer in the NOWAC postgenome cohort as a proof of principle
title_short A new statistical method for curve group analysis of longitudinal gene expression data illustrated for breast cancer in the NOWAC postgenome cohort as a proof of principle
title_sort new statistical method for curve group analysis of longitudinal gene expression data illustrated for breast cancer in the nowac postgenome cohort as a proof of principle
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4779232/
https://www.ncbi.nlm.nih.gov/pubmed/26944545
http://dx.doi.org/10.1186/s12874-016-0129-z
work_keys_str_mv AT lundeiliv anewstatisticalmethodforcurvegroupanalysisoflongitudinalgeneexpressiondataillustratedforbreastcancerinthenowacpostgenomecohortasaproofofprinciple
AT holdenlars anewstatisticalmethodforcurvegroupanalysisoflongitudinalgeneexpressiondataillustratedforbreastcancerinthenowacpostgenomecohortasaproofofprinciple
AT bøvelstadhege anewstatisticalmethodforcurvegroupanalysisoflongitudinalgeneexpressiondataillustratedforbreastcancerinthenowacpostgenomecohortasaproofofprinciple
AT plancadesandra anewstatisticalmethodforcurvegroupanalysisoflongitudinalgeneexpressiondataillustratedforbreastcancerinthenowacpostgenomecohortasaproofofprinciple
AT modenicolle anewstatisticalmethodforcurvegroupanalysisoflongitudinalgeneexpressiondataillustratedforbreastcancerinthenowacpostgenomecohortasaproofofprinciple
AT guntherclaracecilie anewstatisticalmethodforcurvegroupanalysisoflongitudinalgeneexpressiondataillustratedforbreastcancerinthenowacpostgenomecohortasaproofofprinciple
AT nuelgregory anewstatisticalmethodforcurvegroupanalysisoflongitudinalgeneexpressiondataillustratedforbreastcancerinthenowacpostgenomecohortasaproofofprinciple
AT thalabardjeanchristophe anewstatisticalmethodforcurvegroupanalysisoflongitudinalgeneexpressiondataillustratedforbreastcancerinthenowacpostgenomecohortasaproofofprinciple
AT holdenmarit anewstatisticalmethodforcurvegroupanalysisoflongitudinalgeneexpressiondataillustratedforbreastcancerinthenowacpostgenomecohortasaproofofprinciple
AT lundeiliv newstatisticalmethodforcurvegroupanalysisoflongitudinalgeneexpressiondataillustratedforbreastcancerinthenowacpostgenomecohortasaproofofprinciple
AT holdenlars newstatisticalmethodforcurvegroupanalysisoflongitudinalgeneexpressiondataillustratedforbreastcancerinthenowacpostgenomecohortasaproofofprinciple
AT bøvelstadhege newstatisticalmethodforcurvegroupanalysisoflongitudinalgeneexpressiondataillustratedforbreastcancerinthenowacpostgenomecohortasaproofofprinciple
AT plancadesandra newstatisticalmethodforcurvegroupanalysisoflongitudinalgeneexpressiondataillustratedforbreastcancerinthenowacpostgenomecohortasaproofofprinciple
AT modenicolle newstatisticalmethodforcurvegroupanalysisoflongitudinalgeneexpressiondataillustratedforbreastcancerinthenowacpostgenomecohortasaproofofprinciple
AT guntherclaracecilie newstatisticalmethodforcurvegroupanalysisoflongitudinalgeneexpressiondataillustratedforbreastcancerinthenowacpostgenomecohortasaproofofprinciple
AT nuelgregory newstatisticalmethodforcurvegroupanalysisoflongitudinalgeneexpressiondataillustratedforbreastcancerinthenowacpostgenomecohortasaproofofprinciple
AT thalabardjeanchristophe newstatisticalmethodforcurvegroupanalysisoflongitudinalgeneexpressiondataillustratedforbreastcancerinthenowacpostgenomecohortasaproofofprinciple
AT holdenmarit newstatisticalmethodforcurvegroupanalysisoflongitudinalgeneexpressiondataillustratedforbreastcancerinthenowacpostgenomecohortasaproofofprinciple