Cargando…

Power in pairs: assessing the statistical value of paired samples in tests for differential expression

BACKGROUND: When genomics researchers design a high-throughput study to test for differential expression, some biological systems and research questions provide opportunities to use paired samples from subjects, and researchers can plan for a certain proportion of subjects to have paired samples. We...

Descripción completa

Detalles Bibliográficos
Autores principales: Stevens, John R., Herrick, Jennifer S., Wolff, Roger K., Slattery, Martha L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6302489/
https://www.ncbi.nlm.nih.gov/pubmed/30572829
http://dx.doi.org/10.1186/s12864-018-5236-2
_version_ 1783381991518896128
author Stevens, John R.
Herrick, Jennifer S.
Wolff, Roger K.
Slattery, Martha L.
author_facet Stevens, John R.
Herrick, Jennifer S.
Wolff, Roger K.
Slattery, Martha L.
author_sort Stevens, John R.
collection PubMed
description BACKGROUND: When genomics researchers design a high-throughput study to test for differential expression, some biological systems and research questions provide opportunities to use paired samples from subjects, and researchers can plan for a certain proportion of subjects to have paired samples. We consider the effect of this paired samples proportion on the statistical power of the study, using characteristics of both count (RNA-Seq) and continuous (microarray) expression data from a colorectal cancer study. RESULTS: We demonstrate that a higher proportion of subjects with paired samples yields higher statistical power, for various total numbers of samples, and for various strengths of subject-level confounding factors. In the design scenarios considered, the statistical power in a fully-paired design is substantially (and in many cases several times) greater than in an unpaired design. CONCLUSIONS: For the many biological systems and research questions where paired samples are feasible and relevant, substantial statistical power gains can be achieved at the study design stage when genomics researchers plan on using paired samples from the largest possible proportion of subjects. Any cost savings in a study design with unpaired samples are likely accompanied by underpowered and possibly biased results. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-5236-2) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6302489
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-63024892018-12-31 Power in pairs: assessing the statistical value of paired samples in tests for differential expression Stevens, John R. Herrick, Jennifer S. Wolff, Roger K. Slattery, Martha L. BMC Genomics Methodology Article BACKGROUND: When genomics researchers design a high-throughput study to test for differential expression, some biological systems and research questions provide opportunities to use paired samples from subjects, and researchers can plan for a certain proportion of subjects to have paired samples. We consider the effect of this paired samples proportion on the statistical power of the study, using characteristics of both count (RNA-Seq) and continuous (microarray) expression data from a colorectal cancer study. RESULTS: We demonstrate that a higher proportion of subjects with paired samples yields higher statistical power, for various total numbers of samples, and for various strengths of subject-level confounding factors. In the design scenarios considered, the statistical power in a fully-paired design is substantially (and in many cases several times) greater than in an unpaired design. CONCLUSIONS: For the many biological systems and research questions where paired samples are feasible and relevant, substantial statistical power gains can be achieved at the study design stage when genomics researchers plan on using paired samples from the largest possible proportion of subjects. Any cost savings in a study design with unpaired samples are likely accompanied by underpowered and possibly biased results. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-5236-2) contains supplementary material, which is available to authorized users. BioMed Central 2018-12-20 /pmc/articles/PMC6302489/ /pubmed/30572829 http://dx.doi.org/10.1186/s12864-018-5236-2 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Stevens, John R.
Herrick, Jennifer S.
Wolff, Roger K.
Slattery, Martha L.
Power in pairs: assessing the statistical value of paired samples in tests for differential expression
title Power in pairs: assessing the statistical value of paired samples in tests for differential expression
title_full Power in pairs: assessing the statistical value of paired samples in tests for differential expression
title_fullStr Power in pairs: assessing the statistical value of paired samples in tests for differential expression
title_full_unstemmed Power in pairs: assessing the statistical value of paired samples in tests for differential expression
title_short Power in pairs: assessing the statistical value of paired samples in tests for differential expression
title_sort power in pairs: assessing the statistical value of paired samples in tests for differential expression
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6302489/
https://www.ncbi.nlm.nih.gov/pubmed/30572829
http://dx.doi.org/10.1186/s12864-018-5236-2
work_keys_str_mv AT stevensjohnr powerinpairsassessingthestatisticalvalueofpairedsamplesintestsfordifferentialexpression
AT herrickjennifers powerinpairsassessingthestatisticalvalueofpairedsamplesintestsfordifferentialexpression
AT wolffrogerk powerinpairsassessingthestatisticalvalueofpairedsamplesintestsfordifferentialexpression
AT slatterymarthal powerinpairsassessingthestatisticalvalueofpairedsamplesintestsfordifferentialexpression