Cargando…

ChEAP: ChIP-exo analysis pipeline and the investigation of Escherichia coli RpoN protein-DNA interactions

Genome-scale studies of the bacterial regulatory network have been leveraged by declining sequencing cost and advances in ChIP (chromatin immunoprecipitation) methods. Of which, ChIP-exo has proven competent with its near-single base-pair resolution. While several algorithms and programs have been d...

Descripción completa

Detalles Bibliográficos
Autores principales: Bang, Ina, Khanh Nong, Linh, Young Park, Joon, Thi Le, Hoa, Mok Lee, Sang-, Kim, Donghyuk
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9735260/
https://www.ncbi.nlm.nih.gov/pubmed/36544470
http://dx.doi.org/10.1016/j.csbj.2022.11.053
_version_ 1784846719500943360
author Bang, Ina
Khanh Nong, Linh
Young Park, Joon
Thi Le, Hoa
Mok Lee, Sang-
Kim, Donghyuk
author_facet Bang, Ina
Khanh Nong, Linh
Young Park, Joon
Thi Le, Hoa
Mok Lee, Sang-
Kim, Donghyuk
author_sort Bang, Ina
collection PubMed
description Genome-scale studies of the bacterial regulatory network have been leveraged by declining sequencing cost and advances in ChIP (chromatin immunoprecipitation) methods. Of which, ChIP-exo has proven competent with its near-single base-pair resolution. While several algorithms and programs have been developed for different analytical steps in ChIP-exo data processing, there is a lack of effort in incorporating them into a convenient bioinformatics pipeline that is intuitive and publicly available. In this paper, we developed ChIP-exo Analysis Pipeline (ChEAP) that executes the one-step process, starting from trimming and aligning raw sequencing reads to visualization of ChIP-exo results. The pipeline was implemented on the interactive web-based Python development environment – Jupyter Notebook, which is compatible with the Google Colab cloud platform to facilitate the sharing of codes and collaboration among researchers. Additionally, users could exploit the free GPU and CPU resources allocated by Colab to carry out computing tasks regardless of the performance of their local machines. The utility of ChEAP was demonstrated with the ChIP-exo datasets of RpoN sigma factor in E. coli K-12 MG1655. To analyze two raw data files, ChEAP runtime was 2 min and 25 s. Subsequent analyses identified 113 RpoN binding sites showing a conserved RpoN binding pattern in the motif search. ChEAP application in ChIP-exo data analysis is extensive and flexible for the parallel processing of data from various organisms.
format Online
Article
Text
id pubmed-9735260
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-97352602022-12-20 ChEAP: ChIP-exo analysis pipeline and the investigation of Escherichia coli RpoN protein-DNA interactions Bang, Ina Khanh Nong, Linh Young Park, Joon Thi Le, Hoa Mok Lee, Sang- Kim, Donghyuk Comput Struct Biotechnol J Research Article Genome-scale studies of the bacterial regulatory network have been leveraged by declining sequencing cost and advances in ChIP (chromatin immunoprecipitation) methods. Of which, ChIP-exo has proven competent with its near-single base-pair resolution. While several algorithms and programs have been developed for different analytical steps in ChIP-exo data processing, there is a lack of effort in incorporating them into a convenient bioinformatics pipeline that is intuitive and publicly available. In this paper, we developed ChIP-exo Analysis Pipeline (ChEAP) that executes the one-step process, starting from trimming and aligning raw sequencing reads to visualization of ChIP-exo results. The pipeline was implemented on the interactive web-based Python development environment – Jupyter Notebook, which is compatible with the Google Colab cloud platform to facilitate the sharing of codes and collaboration among researchers. Additionally, users could exploit the free GPU and CPU resources allocated by Colab to carry out computing tasks regardless of the performance of their local machines. The utility of ChEAP was demonstrated with the ChIP-exo datasets of RpoN sigma factor in E. coli K-12 MG1655. To analyze two raw data files, ChEAP runtime was 2 min and 25 s. Subsequent analyses identified 113 RpoN binding sites showing a conserved RpoN binding pattern in the motif search. ChEAP application in ChIP-exo data analysis is extensive and flexible for the parallel processing of data from various organisms. Research Network of Computational and Structural Biotechnology 2022-12-02 /pmc/articles/PMC9735260/ /pubmed/36544470 http://dx.doi.org/10.1016/j.csbj.2022.11.053 Text en © 2022 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Research Article
Bang, Ina
Khanh Nong, Linh
Young Park, Joon
Thi Le, Hoa
Mok Lee, Sang-
Kim, Donghyuk
ChEAP: ChIP-exo analysis pipeline and the investigation of Escherichia coli RpoN protein-DNA interactions
title ChEAP: ChIP-exo analysis pipeline and the investigation of Escherichia coli RpoN protein-DNA interactions
title_full ChEAP: ChIP-exo analysis pipeline and the investigation of Escherichia coli RpoN protein-DNA interactions
title_fullStr ChEAP: ChIP-exo analysis pipeline and the investigation of Escherichia coli RpoN protein-DNA interactions
title_full_unstemmed ChEAP: ChIP-exo analysis pipeline and the investigation of Escherichia coli RpoN protein-DNA interactions
title_short ChEAP: ChIP-exo analysis pipeline and the investigation of Escherichia coli RpoN protein-DNA interactions
title_sort cheap: chip-exo analysis pipeline and the investigation of escherichia coli rpon protein-dna interactions
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9735260/
https://www.ncbi.nlm.nih.gov/pubmed/36544470
http://dx.doi.org/10.1016/j.csbj.2022.11.053
work_keys_str_mv AT bangina cheapchipexoanalysispipelineandtheinvestigationofescherichiacolirponproteindnainteractions
AT khanhnonglinh cheapchipexoanalysispipelineandtheinvestigationofescherichiacolirponproteindnainteractions
AT youngparkjoon cheapchipexoanalysispipelineandtheinvestigationofescherichiacolirponproteindnainteractions
AT thilehoa cheapchipexoanalysispipelineandtheinvestigationofescherichiacolirponproteindnainteractions
AT mokleesang cheapchipexoanalysispipelineandtheinvestigationofescherichiacolirponproteindnainteractions
AT kimdonghyuk cheapchipexoanalysispipelineandtheinvestigationofescherichiacolirponproteindnainteractions