Cargando…

matchRanges: generating null hypothesis genomic ranges via covariate-matched sampling

MOTIVATION: Deriving biological insights from genomic data commonly requires comparing attributes of selected genomic loci to a null set of loci. The selection of this null set is non-trivial, as it requires careful consideration of potential covariates, a problem that is exacerbated by the non-unif...

Descripción completa

Detalles Bibliográficos
Autores principales: Davis, Eric S, Mu, Wancen, Lee, Stuart, Dozmorov, Mikhail G, Love, Michael I, Phanstiel, Douglas H
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10168584/
https://www.ncbi.nlm.nih.gov/pubmed/37084270
http://dx.doi.org/10.1093/bioinformatics/btad197
_version_ 1785038884448501760
author Davis, Eric S
Mu, Wancen
Lee, Stuart
Dozmorov, Mikhail G
Love, Michael I
Phanstiel, Douglas H
author_facet Davis, Eric S
Mu, Wancen
Lee, Stuart
Dozmorov, Mikhail G
Love, Michael I
Phanstiel, Douglas H
author_sort Davis, Eric S
collection PubMed
description MOTIVATION: Deriving biological insights from genomic data commonly requires comparing attributes of selected genomic loci to a null set of loci. The selection of this null set is non-trivial, as it requires careful consideration of potential covariates, a problem that is exacerbated by the non-uniform distribution of genomic features including genes, enhancers, and transcription factor binding sites. Propensity score-based covariate matching methods allow the selection of null sets from a pool of possible items while controlling for multiple covariates; however, existing packages do not operate on genomic data classes and can be slow for large data sets making them difficult to integrate into genomic workflows. RESULTS: To address this, we developed matchRanges, a propensity score-based covariate matching method for the efficient and convenient generation of matched null ranges from a set of background ranges within the Bioconductor framework. AVAILABILITY AND IMPLEMENTATION: Package: https://bioconductor.org/packages/nullranges, Code: https://github.com/nullranges, Documentation: https://nullranges.github.io/nullranges.
format Online
Article
Text
id pubmed-10168584
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-101685842023-05-10 matchRanges: generating null hypothesis genomic ranges via covariate-matched sampling Davis, Eric S Mu, Wancen Lee, Stuart Dozmorov, Mikhail G Love, Michael I Phanstiel, Douglas H Bioinformatics Applications Note MOTIVATION: Deriving biological insights from genomic data commonly requires comparing attributes of selected genomic loci to a null set of loci. The selection of this null set is non-trivial, as it requires careful consideration of potential covariates, a problem that is exacerbated by the non-uniform distribution of genomic features including genes, enhancers, and transcription factor binding sites. Propensity score-based covariate matching methods allow the selection of null sets from a pool of possible items while controlling for multiple covariates; however, existing packages do not operate on genomic data classes and can be slow for large data sets making them difficult to integrate into genomic workflows. RESULTS: To address this, we developed matchRanges, a propensity score-based covariate matching method for the efficient and convenient generation of matched null ranges from a set of background ranges within the Bioconductor framework. AVAILABILITY AND IMPLEMENTATION: Package: https://bioconductor.org/packages/nullranges, Code: https://github.com/nullranges, Documentation: https://nullranges.github.io/nullranges. Oxford University Press 2023-04-21 /pmc/articles/PMC10168584/ /pubmed/37084270 http://dx.doi.org/10.1093/bioinformatics/btad197 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Note
Davis, Eric S
Mu, Wancen
Lee, Stuart
Dozmorov, Mikhail G
Love, Michael I
Phanstiel, Douglas H
matchRanges: generating null hypothesis genomic ranges via covariate-matched sampling
title matchRanges: generating null hypothesis genomic ranges via covariate-matched sampling
title_full matchRanges: generating null hypothesis genomic ranges via covariate-matched sampling
title_fullStr matchRanges: generating null hypothesis genomic ranges via covariate-matched sampling
title_full_unstemmed matchRanges: generating null hypothesis genomic ranges via covariate-matched sampling
title_short matchRanges: generating null hypothesis genomic ranges via covariate-matched sampling
title_sort matchranges: generating null hypothesis genomic ranges via covariate-matched sampling
topic Applications Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10168584/
https://www.ncbi.nlm.nih.gov/pubmed/37084270
http://dx.doi.org/10.1093/bioinformatics/btad197
work_keys_str_mv AT daviserics matchrangesgeneratingnullhypothesisgenomicrangesviacovariatematchedsampling
AT muwancen matchrangesgeneratingnullhypothesisgenomicrangesviacovariatematchedsampling
AT leestuart matchrangesgeneratingnullhypothesisgenomicrangesviacovariatematchedsampling
AT dozmorovmikhailg matchrangesgeneratingnullhypothesisgenomicrangesviacovariatematchedsampling
AT lovemichaeli matchrangesgeneratingnullhypothesisgenomicrangesviacovariatematchedsampling
AT phanstieldouglash matchrangesgeneratingnullhypothesisgenomicrangesviacovariatematchedsampling