Cargando…

Uncovering tissue-specific binding features from differential deep learning

Transcription factors (TFs) can bind DNA in a cooperative manner, enabling a mutual increase in occupancy. Through this type of interaction, alternative binding sites can be preferentially bound in different tissues to regulate tissue-specific expression programmes. Recently, deep learning models ha...

Descripción completa

Detalles Bibliográficos
Autores principales: Phuycharoen, Mike, Zarrineh, Peyman, Bridoux, Laure, Amin, Shilu, Losa, Marta, Chen, Ke, Bobola, Nicoletta, Rattray, Magnus
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7049686/
https://www.ncbi.nlm.nih.gov/pubmed/31974574
http://dx.doi.org/10.1093/nar/gkaa009
_version_ 1783502490847674368
author Phuycharoen, Mike
Zarrineh, Peyman
Bridoux, Laure
Amin, Shilu
Losa, Marta
Chen, Ke
Bobola, Nicoletta
Rattray, Magnus
author_facet Phuycharoen, Mike
Zarrineh, Peyman
Bridoux, Laure
Amin, Shilu
Losa, Marta
Chen, Ke
Bobola, Nicoletta
Rattray, Magnus
author_sort Phuycharoen, Mike
collection PubMed
description Transcription factors (TFs) can bind DNA in a cooperative manner, enabling a mutual increase in occupancy. Through this type of interaction, alternative binding sites can be preferentially bound in different tissues to regulate tissue-specific expression programmes. Recently, deep learning models have become state-of-the-art in various pattern analysis tasks, including applications in the field of genomics. We therefore investigate the application of convolutional neural network (CNN) models to the discovery of sequence features determining cooperative and differential TF binding across tissues. We analyse ChIP-seq data from MEIS, TFs which are broadly expressed across mouse branchial arches, and HOXA2, which is expressed in the second and more posterior branchial arches. By developing models predictive of MEIS differential binding in all three tissues, we are able to accurately predict HOXA2 co-binding sites. We evaluate transfer-like and multitask approaches to regularizing the high-dimensional classification task with a larger regression dataset, allowing for the creation of deeper and more accurate models. We test the performance of perturbation and gradient-based attribution methods in identifying the HOXA2 sites from differential MEIS data. Our results show that deep regularized models significantly outperform shallow CNNs as well as k-mer methods in the discovery of tissue-specific sites bound in vivo.
format Online
Article
Text
id pubmed-7049686
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-70496862020-03-10 Uncovering tissue-specific binding features from differential deep learning Phuycharoen, Mike Zarrineh, Peyman Bridoux, Laure Amin, Shilu Losa, Marta Chen, Ke Bobola, Nicoletta Rattray, Magnus Nucleic Acids Res Methods Online Transcription factors (TFs) can bind DNA in a cooperative manner, enabling a mutual increase in occupancy. Through this type of interaction, alternative binding sites can be preferentially bound in different tissues to regulate tissue-specific expression programmes. Recently, deep learning models have become state-of-the-art in various pattern analysis tasks, including applications in the field of genomics. We therefore investigate the application of convolutional neural network (CNN) models to the discovery of sequence features determining cooperative and differential TF binding across tissues. We analyse ChIP-seq data from MEIS, TFs which are broadly expressed across mouse branchial arches, and HOXA2, which is expressed in the second and more posterior branchial arches. By developing models predictive of MEIS differential binding in all three tissues, we are able to accurately predict HOXA2 co-binding sites. We evaluate transfer-like and multitask approaches to regularizing the high-dimensional classification task with a larger regression dataset, allowing for the creation of deeper and more accurate models. We test the performance of perturbation and gradient-based attribution methods in identifying the HOXA2 sites from differential MEIS data. Our results show that deep regularized models significantly outperform shallow CNNs as well as k-mer methods in the discovery of tissue-specific sites bound in vivo. Oxford University Press 2020-03-18 2020-01-24 /pmc/articles/PMC7049686/ /pubmed/31974574 http://dx.doi.org/10.1093/nar/gkaa009 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Phuycharoen, Mike
Zarrineh, Peyman
Bridoux, Laure
Amin, Shilu
Losa, Marta
Chen, Ke
Bobola, Nicoletta
Rattray, Magnus
Uncovering tissue-specific binding features from differential deep learning
title Uncovering tissue-specific binding features from differential deep learning
title_full Uncovering tissue-specific binding features from differential deep learning
title_fullStr Uncovering tissue-specific binding features from differential deep learning
title_full_unstemmed Uncovering tissue-specific binding features from differential deep learning
title_short Uncovering tissue-specific binding features from differential deep learning
title_sort uncovering tissue-specific binding features from differential deep learning
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7049686/
https://www.ncbi.nlm.nih.gov/pubmed/31974574
http://dx.doi.org/10.1093/nar/gkaa009
work_keys_str_mv AT phuycharoenmike uncoveringtissuespecificbindingfeaturesfromdifferentialdeeplearning
AT zarrinehpeyman uncoveringtissuespecificbindingfeaturesfromdifferentialdeeplearning
AT bridouxlaure uncoveringtissuespecificbindingfeaturesfromdifferentialdeeplearning
AT aminshilu uncoveringtissuespecificbindingfeaturesfromdifferentialdeeplearning
AT losamarta uncoveringtissuespecificbindingfeaturesfromdifferentialdeeplearning
AT chenke uncoveringtissuespecificbindingfeaturesfromdifferentialdeeplearning
AT bobolanicoletta uncoveringtissuespecificbindingfeaturesfromdifferentialdeeplearning
AT rattraymagnus uncoveringtissuespecificbindingfeaturesfromdifferentialdeeplearning