Cargando…

Understanding gene regulatory mechanisms by integrating ChIP-seq and RNA-seq data: statistical solutions to biological problems

The availability of omic data produced from international consortia, as well as from worldwide laboratories, is offering the possibility both to answer long-standing questions in biomedicine/molecular biology and to formulate novel hypotheses to test. However, the impact of such data is not fully ex...

Descripción completa

Detalles Bibliográficos
Autores principales: Angelini, Claudia, Costa, Valerio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4207007/
https://www.ncbi.nlm.nih.gov/pubmed/25364758
http://dx.doi.org/10.3389/fcell.2014.00051
_version_ 1782340907613290496
author Angelini, Claudia
Costa, Valerio
author_facet Angelini, Claudia
Costa, Valerio
author_sort Angelini, Claudia
collection PubMed
description The availability of omic data produced from international consortia, as well as from worldwide laboratories, is offering the possibility both to answer long-standing questions in biomedicine/molecular biology and to formulate novel hypotheses to test. However, the impact of such data is not fully exploited due to a limited availability of multi-omic data integration tools and methods. In this paper, we discuss the interplay between gene expression and epigenetic markers/transcription factors. We show how integrating ChIP-seq and RNA-seq data can help to elucidate gene regulatory mechanisms. In particular, we discuss the two following questions: (i) Can transcription factor occupancies or histone modification data predict gene expression? (ii) Can ChIP-seq and RNA-seq data be used to infer gene regulatory networks? We propose potential directions for statistical data integration. We discuss the importance of incorporating underestimated aspects (such as alternative splicing and long-range chromatin interactions). We also highlight the lack of data benchmarks and the need to develop tools for data integration from a statistical viewpoint, designed in the spirit of reproducible research.
format Online
Article
Text
id pubmed-4207007
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-42070072014-10-31 Understanding gene regulatory mechanisms by integrating ChIP-seq and RNA-seq data: statistical solutions to biological problems Angelini, Claudia Costa, Valerio Front Cell Dev Biol Physiology The availability of omic data produced from international consortia, as well as from worldwide laboratories, is offering the possibility both to answer long-standing questions in biomedicine/molecular biology and to formulate novel hypotheses to test. However, the impact of such data is not fully exploited due to a limited availability of multi-omic data integration tools and methods. In this paper, we discuss the interplay between gene expression and epigenetic markers/transcription factors. We show how integrating ChIP-seq and RNA-seq data can help to elucidate gene regulatory mechanisms. In particular, we discuss the two following questions: (i) Can transcription factor occupancies or histone modification data predict gene expression? (ii) Can ChIP-seq and RNA-seq data be used to infer gene regulatory networks? We propose potential directions for statistical data integration. We discuss the importance of incorporating underestimated aspects (such as alternative splicing and long-range chromatin interactions). We also highlight the lack of data benchmarks and the need to develop tools for data integration from a statistical viewpoint, designed in the spirit of reproducible research. Frontiers Media S.A. 2014-09-17 /pmc/articles/PMC4207007/ /pubmed/25364758 http://dx.doi.org/10.3389/fcell.2014.00051 Text en Copyright © 2014 Angelini and Costa. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Physiology
Angelini, Claudia
Costa, Valerio
Understanding gene regulatory mechanisms by integrating ChIP-seq and RNA-seq data: statistical solutions to biological problems
title Understanding gene regulatory mechanisms by integrating ChIP-seq and RNA-seq data: statistical solutions to biological problems
title_full Understanding gene regulatory mechanisms by integrating ChIP-seq and RNA-seq data: statistical solutions to biological problems
title_fullStr Understanding gene regulatory mechanisms by integrating ChIP-seq and RNA-seq data: statistical solutions to biological problems
title_full_unstemmed Understanding gene regulatory mechanisms by integrating ChIP-seq and RNA-seq data: statistical solutions to biological problems
title_short Understanding gene regulatory mechanisms by integrating ChIP-seq and RNA-seq data: statistical solutions to biological problems
title_sort understanding gene regulatory mechanisms by integrating chip-seq and rna-seq data: statistical solutions to biological problems
topic Physiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4207007/
https://www.ncbi.nlm.nih.gov/pubmed/25364758
http://dx.doi.org/10.3389/fcell.2014.00051
work_keys_str_mv AT angeliniclaudia understandinggeneregulatorymechanismsbyintegratingchipseqandrnaseqdatastatisticalsolutionstobiologicalproblems
AT costavalerio understandinggeneregulatorymechanismsbyintegratingchipseqandrnaseqdatastatisticalsolutionstobiologicalproblems