Cargando…
Prediction and validation of protein–protein interactors from genome-wide DNA-binding data using a knowledge-based machine-learning approach
The ability to accurately predict the DNA targets and interacting cofactors of transcriptional regulators from genome-wide data can significantly advance our understanding of gene regulatory networks. NKX2-5 is a homeodomain transcription factor that sits high in the cardiac gene regulatory network...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Royal Society
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5043580/ https://www.ncbi.nlm.nih.gov/pubmed/27683156 http://dx.doi.org/10.1098/rsob.160183 |
_version_ | 1782456779916967936 |
---|---|
author | Waardenberg, Ashley J. Homan, Bernou Mohamed, Stephanie Harvey, Richard P. Bouveret, Romaric |
author_facet | Waardenberg, Ashley J. Homan, Bernou Mohamed, Stephanie Harvey, Richard P. Bouveret, Romaric |
author_sort | Waardenberg, Ashley J. |
collection | PubMed |
description | The ability to accurately predict the DNA targets and interacting cofactors of transcriptional regulators from genome-wide data can significantly advance our understanding of gene regulatory networks. NKX2-5 is a homeodomain transcription factor that sits high in the cardiac gene regulatory network and is essential for normal heart development. We previously identified genomic targets for NKX2-5 in mouse HL-1 atrial cardiomyocytes using DNA-adenine methyltransferase identification (DamID). Here, we apply machine learning algorithms and propose a knowledge-based feature selection method for predicting NKX2-5 protein : protein interactions based on motif grammar in genome-wide DNA-binding data. We assessed model performance using leave-one-out cross-validation and a completely independent DamID experiment performed with replicates. In addition to identifying previously described NKX2-5-interacting proteins, including GATA, HAND and TBX family members, a number of novel interactors were identified, with direct protein : protein interactions between NKX2-5 and retinoid X receptor (RXR), paired-related homeobox (PRRX) and Ikaros zinc fingers (IKZF) validated using the yeast two-hybrid assay. We also found that the interaction of RXRα with NKX2-5 mutations found in congenital heart disease (Q187H, R189G and R190H) was altered. These findings highlight an intuitive approach to accessing protein–protein interaction information of transcription factors in DNA-binding experiments. |
format | Online Article Text |
id | pubmed-5043580 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | The Royal Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-50435802016-10-05 Prediction and validation of protein–protein interactors from genome-wide DNA-binding data using a knowledge-based machine-learning approach Waardenberg, Ashley J. Homan, Bernou Mohamed, Stephanie Harvey, Richard P. Bouveret, Romaric Open Biol Research The ability to accurately predict the DNA targets and interacting cofactors of transcriptional regulators from genome-wide data can significantly advance our understanding of gene regulatory networks. NKX2-5 is a homeodomain transcription factor that sits high in the cardiac gene regulatory network and is essential for normal heart development. We previously identified genomic targets for NKX2-5 in mouse HL-1 atrial cardiomyocytes using DNA-adenine methyltransferase identification (DamID). Here, we apply machine learning algorithms and propose a knowledge-based feature selection method for predicting NKX2-5 protein : protein interactions based on motif grammar in genome-wide DNA-binding data. We assessed model performance using leave-one-out cross-validation and a completely independent DamID experiment performed with replicates. In addition to identifying previously described NKX2-5-interacting proteins, including GATA, HAND and TBX family members, a number of novel interactors were identified, with direct protein : protein interactions between NKX2-5 and retinoid X receptor (RXR), paired-related homeobox (PRRX) and Ikaros zinc fingers (IKZF) validated using the yeast two-hybrid assay. We also found that the interaction of RXRα with NKX2-5 mutations found in congenital heart disease (Q187H, R189G and R190H) was altered. These findings highlight an intuitive approach to accessing protein–protein interaction information of transcription factors in DNA-binding experiments. The Royal Society 2016-09-28 /pmc/articles/PMC5043580/ /pubmed/27683156 http://dx.doi.org/10.1098/rsob.160183 Text en © 2016 The Authors. http://creativecommons.org/licenses/by/4.0/ Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/, which permits unrestricted use, provided the original author and source are credited. |
spellingShingle | Research Waardenberg, Ashley J. Homan, Bernou Mohamed, Stephanie Harvey, Richard P. Bouveret, Romaric Prediction and validation of protein–protein interactors from genome-wide DNA-binding data using a knowledge-based machine-learning approach |
title | Prediction and validation of protein–protein interactors from genome-wide DNA-binding data using a knowledge-based machine-learning approach |
title_full | Prediction and validation of protein–protein interactors from genome-wide DNA-binding data using a knowledge-based machine-learning approach |
title_fullStr | Prediction and validation of protein–protein interactors from genome-wide DNA-binding data using a knowledge-based machine-learning approach |
title_full_unstemmed | Prediction and validation of protein–protein interactors from genome-wide DNA-binding data using a knowledge-based machine-learning approach |
title_short | Prediction and validation of protein–protein interactors from genome-wide DNA-binding data using a knowledge-based machine-learning approach |
title_sort | prediction and validation of protein–protein interactors from genome-wide dna-binding data using a knowledge-based machine-learning approach |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5043580/ https://www.ncbi.nlm.nih.gov/pubmed/27683156 http://dx.doi.org/10.1098/rsob.160183 |
work_keys_str_mv | AT waardenbergashleyj predictionandvalidationofproteinproteininteractorsfromgenomewidednabindingdatausingaknowledgebasedmachinelearningapproach AT homanbernou predictionandvalidationofproteinproteininteractorsfromgenomewidednabindingdatausingaknowledgebasedmachinelearningapproach AT mohamedstephanie predictionandvalidationofproteinproteininteractorsfromgenomewidednabindingdatausingaknowledgebasedmachinelearningapproach AT harveyrichardp predictionandvalidationofproteinproteininteractorsfromgenomewidednabindingdatausingaknowledgebasedmachinelearningapproach AT bouveretromaric predictionandvalidationofproteinproteininteractorsfromgenomewidednabindingdatausingaknowledgebasedmachinelearningapproach |