Cargando…
ChEAP: ChIP-exo analysis pipeline and the investigation of Escherichia coli RpoN protein-DNA interactions
Genome-scale studies of the bacterial regulatory network have been leveraged by declining sequencing cost and advances in ChIP (chromatin immunoprecipitation) methods. Of which, ChIP-exo has proven competent with its near-single base-pair resolution. While several algorithms and programs have been d...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9735260/ https://www.ncbi.nlm.nih.gov/pubmed/36544470 http://dx.doi.org/10.1016/j.csbj.2022.11.053 |
_version_ | 1784846719500943360 |
---|---|
author | Bang, Ina Khanh Nong, Linh Young Park, Joon Thi Le, Hoa Mok Lee, Sang- Kim, Donghyuk |
author_facet | Bang, Ina Khanh Nong, Linh Young Park, Joon Thi Le, Hoa Mok Lee, Sang- Kim, Donghyuk |
author_sort | Bang, Ina |
collection | PubMed |
description | Genome-scale studies of the bacterial regulatory network have been leveraged by declining sequencing cost and advances in ChIP (chromatin immunoprecipitation) methods. Of which, ChIP-exo has proven competent with its near-single base-pair resolution. While several algorithms and programs have been developed for different analytical steps in ChIP-exo data processing, there is a lack of effort in incorporating them into a convenient bioinformatics pipeline that is intuitive and publicly available. In this paper, we developed ChIP-exo Analysis Pipeline (ChEAP) that executes the one-step process, starting from trimming and aligning raw sequencing reads to visualization of ChIP-exo results. The pipeline was implemented on the interactive web-based Python development environment – Jupyter Notebook, which is compatible with the Google Colab cloud platform to facilitate the sharing of codes and collaboration among researchers. Additionally, users could exploit the free GPU and CPU resources allocated by Colab to carry out computing tasks regardless of the performance of their local machines. The utility of ChEAP was demonstrated with the ChIP-exo datasets of RpoN sigma factor in E. coli K-12 MG1655. To analyze two raw data files, ChEAP runtime was 2 min and 25 s. Subsequent analyses identified 113 RpoN binding sites showing a conserved RpoN binding pattern in the motif search. ChEAP application in ChIP-exo data analysis is extensive and flexible for the parallel processing of data from various organisms. |
format | Online Article Text |
id | pubmed-9735260 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-97352602022-12-20 ChEAP: ChIP-exo analysis pipeline and the investigation of Escherichia coli RpoN protein-DNA interactions Bang, Ina Khanh Nong, Linh Young Park, Joon Thi Le, Hoa Mok Lee, Sang- Kim, Donghyuk Comput Struct Biotechnol J Research Article Genome-scale studies of the bacterial regulatory network have been leveraged by declining sequencing cost and advances in ChIP (chromatin immunoprecipitation) methods. Of which, ChIP-exo has proven competent with its near-single base-pair resolution. While several algorithms and programs have been developed for different analytical steps in ChIP-exo data processing, there is a lack of effort in incorporating them into a convenient bioinformatics pipeline that is intuitive and publicly available. In this paper, we developed ChIP-exo Analysis Pipeline (ChEAP) that executes the one-step process, starting from trimming and aligning raw sequencing reads to visualization of ChIP-exo results. The pipeline was implemented on the interactive web-based Python development environment – Jupyter Notebook, which is compatible with the Google Colab cloud platform to facilitate the sharing of codes and collaboration among researchers. Additionally, users could exploit the free GPU and CPU resources allocated by Colab to carry out computing tasks regardless of the performance of their local machines. The utility of ChEAP was demonstrated with the ChIP-exo datasets of RpoN sigma factor in E. coli K-12 MG1655. To analyze two raw data files, ChEAP runtime was 2 min and 25 s. Subsequent analyses identified 113 RpoN binding sites showing a conserved RpoN binding pattern in the motif search. ChEAP application in ChIP-exo data analysis is extensive and flexible for the parallel processing of data from various organisms. Research Network of Computational and Structural Biotechnology 2022-12-02 /pmc/articles/PMC9735260/ /pubmed/36544470 http://dx.doi.org/10.1016/j.csbj.2022.11.053 Text en © 2022 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Research Article Bang, Ina Khanh Nong, Linh Young Park, Joon Thi Le, Hoa Mok Lee, Sang- Kim, Donghyuk ChEAP: ChIP-exo analysis pipeline and the investigation of Escherichia coli RpoN protein-DNA interactions |
title | ChEAP: ChIP-exo analysis pipeline and the investigation of Escherichia coli RpoN protein-DNA interactions |
title_full | ChEAP: ChIP-exo analysis pipeline and the investigation of Escherichia coli RpoN protein-DNA interactions |
title_fullStr | ChEAP: ChIP-exo analysis pipeline and the investigation of Escherichia coli RpoN protein-DNA interactions |
title_full_unstemmed | ChEAP: ChIP-exo analysis pipeline and the investigation of Escherichia coli RpoN protein-DNA interactions |
title_short | ChEAP: ChIP-exo analysis pipeline and the investigation of Escherichia coli RpoN protein-DNA interactions |
title_sort | cheap: chip-exo analysis pipeline and the investigation of escherichia coli rpon protein-dna interactions |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9735260/ https://www.ncbi.nlm.nih.gov/pubmed/36544470 http://dx.doi.org/10.1016/j.csbj.2022.11.053 |
work_keys_str_mv | AT bangina cheapchipexoanalysispipelineandtheinvestigationofescherichiacolirponproteindnainteractions AT khanhnonglinh cheapchipexoanalysispipelineandtheinvestigationofescherichiacolirponproteindnainteractions AT youngparkjoon cheapchipexoanalysispipelineandtheinvestigationofescherichiacolirponproteindnainteractions AT thilehoa cheapchipexoanalysispipelineandtheinvestigationofescherichiacolirponproteindnainteractions AT mokleesang cheapchipexoanalysispipelineandtheinvestigationofescherichiacolirponproteindnainteractions AT kimdonghyuk cheapchipexoanalysispipelineandtheinvestigationofescherichiacolirponproteindnainteractions |