Cargando…

FairSubset: A tool to choose representative subsets of data for use with replicates or groups of different sample sizes

High-impact journals are promoting transparency of data. Modern scientific methods can be automated and produce disparate samples sizes. In many cases, it is desirable to retain identical or pre-defined sample sizes between replicates or groups. However, choosing which subset of originally acquired...

Descripción completa

Detalles Bibliográficos
Autores principales: Ortell, Katherine K, Switonski, Pawel M, Delaney, Joe Ryan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Journal of Biological Methods 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6761370/
https://www.ncbi.nlm.nih.gov/pubmed/31583263
http://dx.doi.org/10.14440/jbm.2019.299
_version_ 1783454016705921024
author Ortell, Katherine K
Switonski, Pawel M
Delaney, Joe Ryan
author_facet Ortell, Katherine K
Switonski, Pawel M
Delaney, Joe Ryan
author_sort Ortell, Katherine K
collection PubMed
description High-impact journals are promoting transparency of data. Modern scientific methods can be automated and produce disparate samples sizes. In many cases, it is desirable to retain identical or pre-defined sample sizes between replicates or groups. However, choosing which subset of originally acquired data that best matches the entirety of the data set without introducing bias is not trivial. Here, we released a free online tool, FairSubset, and its constituent Shiny App R code to subset data in an unbiased fashion. Subsets were set at the same N across samples and retained representative average and standard deviation information. The method can be used for quantitation of entire fields of view or other replicates without biasing the data pool toward large N samples. We showed examples of the tool’s use with fluorescence data and DNA-damage related Comet tail quantitation. This FairSubset tool and the method to retain distribution information at the single-datum level may be considered for standardized use in fair publishing practices.
format Online
Article
Text
id pubmed-6761370
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Journal of Biological Methods
record_format MEDLINE/PubMed
spelling pubmed-67613702019-10-03 FairSubset: A tool to choose representative subsets of data for use with replicates or groups of different sample sizes Ortell, Katherine K Switonski, Pawel M Delaney, Joe Ryan J Biol Methods Resource High-impact journals are promoting transparency of data. Modern scientific methods can be automated and produce disparate samples sizes. In many cases, it is desirable to retain identical or pre-defined sample sizes between replicates or groups. However, choosing which subset of originally acquired data that best matches the entirety of the data set without introducing bias is not trivial. Here, we released a free online tool, FairSubset, and its constituent Shiny App R code to subset data in an unbiased fashion. Subsets were set at the same N across samples and retained representative average and standard deviation information. The method can be used for quantitation of entire fields of view or other replicates without biasing the data pool toward large N samples. We showed examples of the tool’s use with fluorescence data and DNA-damage related Comet tail quantitation. This FairSubset tool and the method to retain distribution information at the single-datum level may be considered for standardized use in fair publishing practices. Journal of Biological Methods 2019-09-03 /pmc/articles/PMC6761370/ /pubmed/31583263 http://dx.doi.org/10.14440/jbm.2019.299 Text en © 2013-2019 The Journal of Biological Methods, All rights reserved. http://creativecommons.org/licenses/by-nc-sa/4.0 This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License: http://creativecommons.org/licenses/by-nc-sa/4.0
spellingShingle Resource
Ortell, Katherine K
Switonski, Pawel M
Delaney, Joe Ryan
FairSubset: A tool to choose representative subsets of data for use with replicates or groups of different sample sizes
title FairSubset: A tool to choose representative subsets of data for use with replicates or groups of different sample sizes
title_full FairSubset: A tool to choose representative subsets of data for use with replicates or groups of different sample sizes
title_fullStr FairSubset: A tool to choose representative subsets of data for use with replicates or groups of different sample sizes
title_full_unstemmed FairSubset: A tool to choose representative subsets of data for use with replicates or groups of different sample sizes
title_short FairSubset: A tool to choose representative subsets of data for use with replicates or groups of different sample sizes
title_sort fairsubset: a tool to choose representative subsets of data for use with replicates or groups of different sample sizes
topic Resource
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6761370/
https://www.ncbi.nlm.nih.gov/pubmed/31583263
http://dx.doi.org/10.14440/jbm.2019.299
work_keys_str_mv AT ortellkatherinek fairsubsetatooltochooserepresentativesubsetsofdataforusewithreplicatesorgroupsofdifferentsamplesizes
AT switonskipawelm fairsubsetatooltochooserepresentativesubsetsofdataforusewithreplicatesorgroupsofdifferentsamplesizes
AT delaneyjoeryan fairsubsetatooltochooserepresentativesubsetsofdataforusewithreplicatesorgroupsofdifferentsamplesizes