Cargando…

Local epigenomic state cannot discriminate interacting and non-interacting enhancer–promoter pairs with high accuracy

We report an experimental design issue in recent machine learning formulations of the enhancer-promoter interaction problem arising from the fact that many enhancer-promoter pairs share features. Cross-fold validation schemes which do not correctly separate these feature sharing enhancer-promoter pa...

Descripción completa

Detalles Bibliográficos
Autores principales: Xi, Wang, Beer, Michael A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6298642/
https://www.ncbi.nlm.nih.gov/pubmed/30562350
http://dx.doi.org/10.1371/journal.pcbi.1006625
_version_ 1783381341252878336
author Xi, Wang
Beer, Michael A.
author_facet Xi, Wang
Beer, Michael A.
author_sort Xi, Wang
collection PubMed
description We report an experimental design issue in recent machine learning formulations of the enhancer-promoter interaction problem arising from the fact that many enhancer-promoter pairs share features. Cross-fold validation schemes which do not correctly separate these feature sharing enhancer-promoter pairs into one test set report high accuracy, which is actually arising from high training set accuracy and a failure to properly evaluate generalization performance. Cross-fold validation schemes which properly segregate pairs with shared features show markedly reduced ability to predict enhancer-promoter interactions from epigenomic state. Parameter scans with multiple models indicate that local epigenomic features of individual pairs of enhancers and promoters cannot distinguish those pairs that interact from those which do with high accuracy, suggesting that additional information is required to predict enhancer-promoter interactions.
format Online
Article
Text
id pubmed-6298642
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-62986422018-12-28 Local epigenomic state cannot discriminate interacting and non-interacting enhancer–promoter pairs with high accuracy Xi, Wang Beer, Michael A. PLoS Comput Biol Perspective We report an experimental design issue in recent machine learning formulations of the enhancer-promoter interaction problem arising from the fact that many enhancer-promoter pairs share features. Cross-fold validation schemes which do not correctly separate these feature sharing enhancer-promoter pairs into one test set report high accuracy, which is actually arising from high training set accuracy and a failure to properly evaluate generalization performance. Cross-fold validation schemes which properly segregate pairs with shared features show markedly reduced ability to predict enhancer-promoter interactions from epigenomic state. Parameter scans with multiple models indicate that local epigenomic features of individual pairs of enhancers and promoters cannot distinguish those pairs that interact from those which do with high accuracy, suggesting that additional information is required to predict enhancer-promoter interactions. Public Library of Science 2018-12-18 /pmc/articles/PMC6298642/ /pubmed/30562350 http://dx.doi.org/10.1371/journal.pcbi.1006625 Text en © 2018 Xi, Beer http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Perspective
Xi, Wang
Beer, Michael A.
Local epigenomic state cannot discriminate interacting and non-interacting enhancer–promoter pairs with high accuracy
title Local epigenomic state cannot discriminate interacting and non-interacting enhancer–promoter pairs with high accuracy
title_full Local epigenomic state cannot discriminate interacting and non-interacting enhancer–promoter pairs with high accuracy
title_fullStr Local epigenomic state cannot discriminate interacting and non-interacting enhancer–promoter pairs with high accuracy
title_full_unstemmed Local epigenomic state cannot discriminate interacting and non-interacting enhancer–promoter pairs with high accuracy
title_short Local epigenomic state cannot discriminate interacting and non-interacting enhancer–promoter pairs with high accuracy
title_sort local epigenomic state cannot discriminate interacting and non-interacting enhancer–promoter pairs with high accuracy
topic Perspective
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6298642/
https://www.ncbi.nlm.nih.gov/pubmed/30562350
http://dx.doi.org/10.1371/journal.pcbi.1006625
work_keys_str_mv AT xiwang localepigenomicstatecannotdiscriminateinteractingandnoninteractingenhancerpromoterpairswithhighaccuracy
AT beermichaela localepigenomicstatecannotdiscriminateinteractingandnoninteractingenhancerpromoterpairswithhighaccuracy