Cargando…

RGT: a toolbox for the integrative analysis of high throughput regulatory genomics data

BACKGROUND: Massive amounts of data are produced by combining next-generation sequencing with complex biochemistry techniques to characterize regulatory genomics profiles, such as protein–DNA interaction and chromatin accessibility. Interpretation of such high-throughput data typically requires diff...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Zhijian, Kuo, Chao-Chung, Ticconi, Fabio, Shaigan, Mina, Gehrmann, Julia, Gusmao, Eduardo Gade, Allhoff, Manuel, Manolov, Martin, Zenke, Martin, Costa, Ivan G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9990262/
https://www.ncbi.nlm.nih.gov/pubmed/36879236
http://dx.doi.org/10.1186/s12859-023-05184-5
_version_ 1784901902676262912
author Li, Zhijian
Kuo, Chao-Chung
Ticconi, Fabio
Shaigan, Mina
Gehrmann, Julia
Gusmao, Eduardo Gade
Allhoff, Manuel
Manolov, Martin
Zenke, Martin
Costa, Ivan G.
author_facet Li, Zhijian
Kuo, Chao-Chung
Ticconi, Fabio
Shaigan, Mina
Gehrmann, Julia
Gusmao, Eduardo Gade
Allhoff, Manuel
Manolov, Martin
Zenke, Martin
Costa, Ivan G.
author_sort Li, Zhijian
collection PubMed
description BACKGROUND: Massive amounts of data are produced by combining next-generation sequencing with complex biochemistry techniques to characterize regulatory genomics profiles, such as protein–DNA interaction and chromatin accessibility. Interpretation of such high-throughput data typically requires different computation methods. However, existing tools are usually developed for a specific task, which makes it challenging to analyze the data in an integrative manner. RESULTS: We here describe the Regulatory Genomics Toolbox (RGT), a computational library for the integrative analysis of regulatory genomics data. RGT provides different functionalities to handle genomic signals and regions. Based on that, we developed several tools to perform distinct downstream analyses, including the prediction of transcription factor binding sites using ATAC-seq data, identification of differential peaks from ChIP-seq data, and detection of triple helix mediated RNA and DNA interactions, visualization, and finding an association between distinct regulatory factors. CONCLUSION: We present here RGT; a framework to facilitate the customization of computational methods to analyze genomic data for specific regulatory genomics problems. RGT is a comprehensive and flexible Python package for analyzing high throughput regulatory genomics data and is available at: https://github.com/CostaLab/reg-gen. The documentation is available at: https://reg-gen.readthedocs.io
format Online
Article
Text
id pubmed-9990262
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-99902622023-03-08 RGT: a toolbox for the integrative analysis of high throughput regulatory genomics data Li, Zhijian Kuo, Chao-Chung Ticconi, Fabio Shaigan, Mina Gehrmann, Julia Gusmao, Eduardo Gade Allhoff, Manuel Manolov, Martin Zenke, Martin Costa, Ivan G. BMC Bioinformatics Software BACKGROUND: Massive amounts of data are produced by combining next-generation sequencing with complex biochemistry techniques to characterize regulatory genomics profiles, such as protein–DNA interaction and chromatin accessibility. Interpretation of such high-throughput data typically requires different computation methods. However, existing tools are usually developed for a specific task, which makes it challenging to analyze the data in an integrative manner. RESULTS: We here describe the Regulatory Genomics Toolbox (RGT), a computational library for the integrative analysis of regulatory genomics data. RGT provides different functionalities to handle genomic signals and regions. Based on that, we developed several tools to perform distinct downstream analyses, including the prediction of transcription factor binding sites using ATAC-seq data, identification of differential peaks from ChIP-seq data, and detection of triple helix mediated RNA and DNA interactions, visualization, and finding an association between distinct regulatory factors. CONCLUSION: We present here RGT; a framework to facilitate the customization of computational methods to analyze genomic data for specific regulatory genomics problems. RGT is a comprehensive and flexible Python package for analyzing high throughput regulatory genomics data and is available at: https://github.com/CostaLab/reg-gen. The documentation is available at: https://reg-gen.readthedocs.io BioMed Central 2023-03-06 /pmc/articles/PMC9990262/ /pubmed/36879236 http://dx.doi.org/10.1186/s12859-023-05184-5 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
Li, Zhijian
Kuo, Chao-Chung
Ticconi, Fabio
Shaigan, Mina
Gehrmann, Julia
Gusmao, Eduardo Gade
Allhoff, Manuel
Manolov, Martin
Zenke, Martin
Costa, Ivan G.
RGT: a toolbox for the integrative analysis of high throughput regulatory genomics data
title RGT: a toolbox for the integrative analysis of high throughput regulatory genomics data
title_full RGT: a toolbox for the integrative analysis of high throughput regulatory genomics data
title_fullStr RGT: a toolbox for the integrative analysis of high throughput regulatory genomics data
title_full_unstemmed RGT: a toolbox for the integrative analysis of high throughput regulatory genomics data
title_short RGT: a toolbox for the integrative analysis of high throughput regulatory genomics data
title_sort rgt: a toolbox for the integrative analysis of high throughput regulatory genomics data
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9990262/
https://www.ncbi.nlm.nih.gov/pubmed/36879236
http://dx.doi.org/10.1186/s12859-023-05184-5
work_keys_str_mv AT lizhijian rgtatoolboxfortheintegrativeanalysisofhighthroughputregulatorygenomicsdata
AT kuochaochung rgtatoolboxfortheintegrativeanalysisofhighthroughputregulatorygenomicsdata
AT ticconifabio rgtatoolboxfortheintegrativeanalysisofhighthroughputregulatorygenomicsdata
AT shaiganmina rgtatoolboxfortheintegrativeanalysisofhighthroughputregulatorygenomicsdata
AT gehrmannjulia rgtatoolboxfortheintegrativeanalysisofhighthroughputregulatorygenomicsdata
AT gusmaoeduardogade rgtatoolboxfortheintegrativeanalysisofhighthroughputregulatorygenomicsdata
AT allhoffmanuel rgtatoolboxfortheintegrativeanalysisofhighthroughputregulatorygenomicsdata
AT manolovmartin rgtatoolboxfortheintegrativeanalysisofhighthroughputregulatorygenomicsdata
AT zenkemartin rgtatoolboxfortheintegrativeanalysisofhighthroughputregulatorygenomicsdata
AT costaivang rgtatoolboxfortheintegrativeanalysisofhighthroughputregulatorygenomicsdata