Cargando…

Prediction and validation of protein–protein interactors from genome-wide DNA-binding data using a knowledge-based machine-learning approach

The ability to accurately predict the DNA targets and interacting cofactors of transcriptional regulators from genome-wide data can significantly advance our understanding of gene regulatory networks. NKX2-5 is a homeodomain transcription factor that sits high in the cardiac gene regulatory network...

Descripción completa

Detalles Bibliográficos
Autores principales: Waardenberg, Ashley J., Homan, Bernou, Mohamed, Stephanie, Harvey, Richard P., Bouveret, Romaric
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5043580/
https://www.ncbi.nlm.nih.gov/pubmed/27683156
http://dx.doi.org/10.1098/rsob.160183
_version_ 1782456779916967936
author Waardenberg, Ashley J.
Homan, Bernou
Mohamed, Stephanie
Harvey, Richard P.
Bouveret, Romaric
author_facet Waardenberg, Ashley J.
Homan, Bernou
Mohamed, Stephanie
Harvey, Richard P.
Bouveret, Romaric
author_sort Waardenberg, Ashley J.
collection PubMed
description The ability to accurately predict the DNA targets and interacting cofactors of transcriptional regulators from genome-wide data can significantly advance our understanding of gene regulatory networks. NKX2-5 is a homeodomain transcription factor that sits high in the cardiac gene regulatory network and is essential for normal heart development. We previously identified genomic targets for NKX2-5 in mouse HL-1 atrial cardiomyocytes using DNA-adenine methyltransferase identification (DamID). Here, we apply machine learning algorithms and propose a knowledge-based feature selection method for predicting NKX2-5 protein : protein interactions based on motif grammar in genome-wide DNA-binding data. We assessed model performance using leave-one-out cross-validation and a completely independent DamID experiment performed with replicates. In addition to identifying previously described NKX2-5-interacting proteins, including GATA, HAND and TBX family members, a number of novel interactors were identified, with direct protein : protein interactions between NKX2-5 and retinoid X receptor (RXR), paired-related homeobox (PRRX) and Ikaros zinc fingers (IKZF) validated using the yeast two-hybrid assay. We also found that the interaction of RXRα with NKX2-5 mutations found in congenital heart disease (Q187H, R189G and R190H) was altered. These findings highlight an intuitive approach to accessing protein–protein interaction information of transcription factors in DNA-binding experiments.
format Online
Article
Text
id pubmed-5043580
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher The Royal Society
record_format MEDLINE/PubMed
spelling pubmed-50435802016-10-05 Prediction and validation of protein–protein interactors from genome-wide DNA-binding data using a knowledge-based machine-learning approach Waardenberg, Ashley J. Homan, Bernou Mohamed, Stephanie Harvey, Richard P. Bouveret, Romaric Open Biol Research The ability to accurately predict the DNA targets and interacting cofactors of transcriptional regulators from genome-wide data can significantly advance our understanding of gene regulatory networks. NKX2-5 is a homeodomain transcription factor that sits high in the cardiac gene regulatory network and is essential for normal heart development. We previously identified genomic targets for NKX2-5 in mouse HL-1 atrial cardiomyocytes using DNA-adenine methyltransferase identification (DamID). Here, we apply machine learning algorithms and propose a knowledge-based feature selection method for predicting NKX2-5 protein : protein interactions based on motif grammar in genome-wide DNA-binding data. We assessed model performance using leave-one-out cross-validation and a completely independent DamID experiment performed with replicates. In addition to identifying previously described NKX2-5-interacting proteins, including GATA, HAND and TBX family members, a number of novel interactors were identified, with direct protein : protein interactions between NKX2-5 and retinoid X receptor (RXR), paired-related homeobox (PRRX) and Ikaros zinc fingers (IKZF) validated using the yeast two-hybrid assay. We also found that the interaction of RXRα with NKX2-5 mutations found in congenital heart disease (Q187H, R189G and R190H) was altered. These findings highlight an intuitive approach to accessing protein–protein interaction information of transcription factors in DNA-binding experiments. The Royal Society 2016-09-28 /pmc/articles/PMC5043580/ /pubmed/27683156 http://dx.doi.org/10.1098/rsob.160183 Text en © 2016 The Authors. http://creativecommons.org/licenses/by/4.0/ Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/, which permits unrestricted use, provided the original author and source are credited.
spellingShingle Research
Waardenberg, Ashley J.
Homan, Bernou
Mohamed, Stephanie
Harvey, Richard P.
Bouveret, Romaric
Prediction and validation of protein–protein interactors from genome-wide DNA-binding data using a knowledge-based machine-learning approach
title Prediction and validation of protein–protein interactors from genome-wide DNA-binding data using a knowledge-based machine-learning approach
title_full Prediction and validation of protein–protein interactors from genome-wide DNA-binding data using a knowledge-based machine-learning approach
title_fullStr Prediction and validation of protein–protein interactors from genome-wide DNA-binding data using a knowledge-based machine-learning approach
title_full_unstemmed Prediction and validation of protein–protein interactors from genome-wide DNA-binding data using a knowledge-based machine-learning approach
title_short Prediction and validation of protein–protein interactors from genome-wide DNA-binding data using a knowledge-based machine-learning approach
title_sort prediction and validation of protein–protein interactors from genome-wide dna-binding data using a knowledge-based machine-learning approach
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5043580/
https://www.ncbi.nlm.nih.gov/pubmed/27683156
http://dx.doi.org/10.1098/rsob.160183
work_keys_str_mv AT waardenbergashleyj predictionandvalidationofproteinproteininteractorsfromgenomewidednabindingdatausingaknowledgebasedmachinelearningapproach
AT homanbernou predictionandvalidationofproteinproteininteractorsfromgenomewidednabindingdatausingaknowledgebasedmachinelearningapproach
AT mohamedstephanie predictionandvalidationofproteinproteininteractorsfromgenomewidednabindingdatausingaknowledgebasedmachinelearningapproach
AT harveyrichardp predictionandvalidationofproteinproteininteractorsfromgenomewidednabindingdatausingaknowledgebasedmachinelearningapproach
AT bouveretromaric predictionandvalidationofproteinproteininteractorsfromgenomewidednabindingdatausingaknowledgebasedmachinelearningapproach