Cargando…
matchRanges: generating null hypothesis genomic ranges via covariate-matched sampling
MOTIVATION: Deriving biological insights from genomic data commonly requires comparing attributes of selected genomic loci to a null set of loci. The selection of this null set is non-trivial, as it requires careful consideration of potential covariates, a problem that is exacerbated by the non-unif...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10168584/ https://www.ncbi.nlm.nih.gov/pubmed/37084270 http://dx.doi.org/10.1093/bioinformatics/btad197 |
_version_ | 1785038884448501760 |
---|---|
author | Davis, Eric S Mu, Wancen Lee, Stuart Dozmorov, Mikhail G Love, Michael I Phanstiel, Douglas H |
author_facet | Davis, Eric S Mu, Wancen Lee, Stuart Dozmorov, Mikhail G Love, Michael I Phanstiel, Douglas H |
author_sort | Davis, Eric S |
collection | PubMed |
description | MOTIVATION: Deriving biological insights from genomic data commonly requires comparing attributes of selected genomic loci to a null set of loci. The selection of this null set is non-trivial, as it requires careful consideration of potential covariates, a problem that is exacerbated by the non-uniform distribution of genomic features including genes, enhancers, and transcription factor binding sites. Propensity score-based covariate matching methods allow the selection of null sets from a pool of possible items while controlling for multiple covariates; however, existing packages do not operate on genomic data classes and can be slow for large data sets making them difficult to integrate into genomic workflows. RESULTS: To address this, we developed matchRanges, a propensity score-based covariate matching method for the efficient and convenient generation of matched null ranges from a set of background ranges within the Bioconductor framework. AVAILABILITY AND IMPLEMENTATION: Package: https://bioconductor.org/packages/nullranges, Code: https://github.com/nullranges, Documentation: https://nullranges.github.io/nullranges. |
format | Online Article Text |
id | pubmed-10168584 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-101685842023-05-10 matchRanges: generating null hypothesis genomic ranges via covariate-matched sampling Davis, Eric S Mu, Wancen Lee, Stuart Dozmorov, Mikhail G Love, Michael I Phanstiel, Douglas H Bioinformatics Applications Note MOTIVATION: Deriving biological insights from genomic data commonly requires comparing attributes of selected genomic loci to a null set of loci. The selection of this null set is non-trivial, as it requires careful consideration of potential covariates, a problem that is exacerbated by the non-uniform distribution of genomic features including genes, enhancers, and transcription factor binding sites. Propensity score-based covariate matching methods allow the selection of null sets from a pool of possible items while controlling for multiple covariates; however, existing packages do not operate on genomic data classes and can be slow for large data sets making them difficult to integrate into genomic workflows. RESULTS: To address this, we developed matchRanges, a propensity score-based covariate matching method for the efficient and convenient generation of matched null ranges from a set of background ranges within the Bioconductor framework. AVAILABILITY AND IMPLEMENTATION: Package: https://bioconductor.org/packages/nullranges, Code: https://github.com/nullranges, Documentation: https://nullranges.github.io/nullranges. Oxford University Press 2023-04-21 /pmc/articles/PMC10168584/ /pubmed/37084270 http://dx.doi.org/10.1093/bioinformatics/btad197 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Applications Note Davis, Eric S Mu, Wancen Lee, Stuart Dozmorov, Mikhail G Love, Michael I Phanstiel, Douglas H matchRanges: generating null hypothesis genomic ranges via covariate-matched sampling |
title | matchRanges: generating null hypothesis genomic ranges via covariate-matched sampling |
title_full | matchRanges: generating null hypothesis genomic ranges via covariate-matched sampling |
title_fullStr | matchRanges: generating null hypothesis genomic ranges via covariate-matched sampling |
title_full_unstemmed | matchRanges: generating null hypothesis genomic ranges via covariate-matched sampling |
title_short | matchRanges: generating null hypothesis genomic ranges via covariate-matched sampling |
title_sort | matchranges: generating null hypothesis genomic ranges via covariate-matched sampling |
topic | Applications Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10168584/ https://www.ncbi.nlm.nih.gov/pubmed/37084270 http://dx.doi.org/10.1093/bioinformatics/btad197 |
work_keys_str_mv | AT daviserics matchrangesgeneratingnullhypothesisgenomicrangesviacovariatematchedsampling AT muwancen matchrangesgeneratingnullhypothesisgenomicrangesviacovariatematchedsampling AT leestuart matchrangesgeneratingnullhypothesisgenomicrangesviacovariatematchedsampling AT dozmorovmikhailg matchrangesgeneratingnullhypothesisgenomicrangesviacovariatematchedsampling AT lovemichaeli matchrangesgeneratingnullhypothesisgenomicrangesviacovariatematchedsampling AT phanstieldouglash matchrangesgeneratingnullhypothesisgenomicrangesviacovariatematchedsampling |