Cargando…

Important biological information uncovered in previously unaligned reads from chromatin immunoprecipitation experiments (ChIP-Seq)

Establishing the architecture of gene regulatory networks (GRNs) relies on chromatin immunoprecipitation followed by massively parallel sequencing (ChIP-Seq) methods that provide genome-wide transcription factor binding sites (TFBSs). ChIP-Seq furnishes millions of short reads that, after alignment,...

Descripción completa

Detalles Bibliográficos
Autores principales: Ouma, Wilberforce Zachary, Mejia-Guerra, Maria Katherine, Yilmaz, Alper, Pareja-Tobes, Pablo, Li, Wei, Doseff, Andrea I., Grotewold, Erich
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4345404/
https://www.ncbi.nlm.nih.gov/pubmed/25727450
http://dx.doi.org/10.1038/srep08635
_version_ 1782359576288428032
author Ouma, Wilberforce Zachary
Mejia-Guerra, Maria Katherine
Yilmaz, Alper
Pareja-Tobes, Pablo
Li, Wei
Doseff, Andrea I.
Grotewold, Erich
author_facet Ouma, Wilberforce Zachary
Mejia-Guerra, Maria Katherine
Yilmaz, Alper
Pareja-Tobes, Pablo
Li, Wei
Doseff, Andrea I.
Grotewold, Erich
author_sort Ouma, Wilberforce Zachary
collection PubMed
description Establishing the architecture of gene regulatory networks (GRNs) relies on chromatin immunoprecipitation followed by massively parallel sequencing (ChIP-Seq) methods that provide genome-wide transcription factor binding sites (TFBSs). ChIP-Seq furnishes millions of short reads that, after alignment, describe the genome-wide binding sites of a particular TF. However, in all organisms investigated an average of 40% of reads fail to align to the corresponding genome, with some datasets having as much as 80% of reads failing to align. We describe here the provenance of previously unaligned reads in ChIP-Seq experiments from animals and plants. We show that a substantial portion corresponds to sequences of bacterial and metazoan origin, irrespective of the ChIP-Seq chromatin source. Unforeseen was the finding that 30%–40% of unaligned reads were actually alignable. To validate these observations, we investigated the characteristics of the previously unaligned reads corresponding to TAL1, a human TF involved in lineage specification of hemopoietic cells. We show that, while unmapped ChIP-Seq read datasets contain foreign DNA sequences, additional TFBSs can be identified from the previously unaligned ChIP-Seq reads. Our results indicate that the re-evaluation of previously unaligned reads from ChIP-Seq experiments will significantly contribute to TF target identification and determination of emerging properties of GRNs.
format Online
Article
Text
id pubmed-4345404
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-43454042015-03-10 Important biological information uncovered in previously unaligned reads from chromatin immunoprecipitation experiments (ChIP-Seq) Ouma, Wilberforce Zachary Mejia-Guerra, Maria Katherine Yilmaz, Alper Pareja-Tobes, Pablo Li, Wei Doseff, Andrea I. Grotewold, Erich Sci Rep Article Establishing the architecture of gene regulatory networks (GRNs) relies on chromatin immunoprecipitation followed by massively parallel sequencing (ChIP-Seq) methods that provide genome-wide transcription factor binding sites (TFBSs). ChIP-Seq furnishes millions of short reads that, after alignment, describe the genome-wide binding sites of a particular TF. However, in all organisms investigated an average of 40% of reads fail to align to the corresponding genome, with some datasets having as much as 80% of reads failing to align. We describe here the provenance of previously unaligned reads in ChIP-Seq experiments from animals and plants. We show that a substantial portion corresponds to sequences of bacterial and metazoan origin, irrespective of the ChIP-Seq chromatin source. Unforeseen was the finding that 30%–40% of unaligned reads were actually alignable. To validate these observations, we investigated the characteristics of the previously unaligned reads corresponding to TAL1, a human TF involved in lineage specification of hemopoietic cells. We show that, while unmapped ChIP-Seq read datasets contain foreign DNA sequences, additional TFBSs can be identified from the previously unaligned ChIP-Seq reads. Our results indicate that the re-evaluation of previously unaligned reads from ChIP-Seq experiments will significantly contribute to TF target identification and determination of emerging properties of GRNs. Nature Publishing Group 2015-03-02 /pmc/articles/PMC4345404/ /pubmed/25727450 http://dx.doi.org/10.1038/srep08635 Text en Copyright © 2015, Macmillan Publishers Limited. All rights reserved http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder in order to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Ouma, Wilberforce Zachary
Mejia-Guerra, Maria Katherine
Yilmaz, Alper
Pareja-Tobes, Pablo
Li, Wei
Doseff, Andrea I.
Grotewold, Erich
Important biological information uncovered in previously unaligned reads from chromatin immunoprecipitation experiments (ChIP-Seq)
title Important biological information uncovered in previously unaligned reads from chromatin immunoprecipitation experiments (ChIP-Seq)
title_full Important biological information uncovered in previously unaligned reads from chromatin immunoprecipitation experiments (ChIP-Seq)
title_fullStr Important biological information uncovered in previously unaligned reads from chromatin immunoprecipitation experiments (ChIP-Seq)
title_full_unstemmed Important biological information uncovered in previously unaligned reads from chromatin immunoprecipitation experiments (ChIP-Seq)
title_short Important biological information uncovered in previously unaligned reads from chromatin immunoprecipitation experiments (ChIP-Seq)
title_sort important biological information uncovered in previously unaligned reads from chromatin immunoprecipitation experiments (chip-seq)
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4345404/
https://www.ncbi.nlm.nih.gov/pubmed/25727450
http://dx.doi.org/10.1038/srep08635
work_keys_str_mv AT oumawilberforcezachary importantbiologicalinformationuncoveredinpreviouslyunalignedreadsfromchromatinimmunoprecipitationexperimentschipseq
AT mejiaguerramariakatherine importantbiologicalinformationuncoveredinpreviouslyunalignedreadsfromchromatinimmunoprecipitationexperimentschipseq
AT yilmazalper importantbiologicalinformationuncoveredinpreviouslyunalignedreadsfromchromatinimmunoprecipitationexperimentschipseq
AT parejatobespablo importantbiologicalinformationuncoveredinpreviouslyunalignedreadsfromchromatinimmunoprecipitationexperimentschipseq
AT liwei importantbiologicalinformationuncoveredinpreviouslyunalignedreadsfromchromatinimmunoprecipitationexperimentschipseq
AT doseffandreai importantbiologicalinformationuncoveredinpreviouslyunalignedreadsfromchromatinimmunoprecipitationexperimentschipseq
AT grotewolderich importantbiologicalinformationuncoveredinpreviouslyunalignedreadsfromchromatinimmunoprecipitationexperimentschipseq