Cargando…

Enhancer/gene relationships: Need for more reliable genome-wide reference sets

Differences in cells’ functions arise from differential activity of regulatory elements, including enhancers. Enhancers are cis-regulatory elements that cooperate with promoters through transcription factors to activate the expression of one or several genes by getting physically close to them in th...

Descripción completa

Detalles Bibliográficos
Autores principales: Hoellinger, Tristan, Mestre, Camille, Aschard, Hugues, Le Goff, Wilfried, Foissac, Sylvain, Faraut, Thomas, Djebali, Sarah
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9999192/
https://www.ncbi.nlm.nih.gov/pubmed/36909938
http://dx.doi.org/10.3389/fbinf.2023.1092853
_version_ 1784903614330830848
author Hoellinger, Tristan
Mestre, Camille
Aschard, Hugues
Le Goff, Wilfried
Foissac, Sylvain
Faraut, Thomas
Djebali, Sarah
author_facet Hoellinger, Tristan
Mestre, Camille
Aschard, Hugues
Le Goff, Wilfried
Foissac, Sylvain
Faraut, Thomas
Djebali, Sarah
author_sort Hoellinger, Tristan
collection PubMed
description Differences in cells’ functions arise from differential activity of regulatory elements, including enhancers. Enhancers are cis-regulatory elements that cooperate with promoters through transcription factors to activate the expression of one or several genes by getting physically close to them in the 3D space of the nucleus. There is increasing evidence that genetic variants associated with common diseases are enriched in enhancers active in cell types relevant to these diseases. Identifying the enhancers associated with genes and conversely, the sets of genes activated by each enhancer (the so-called enhancer/gene or E/G relationships) across cell types, can help understanding the genetic mechanisms underlying human diseases. There are three broad approaches for the genome-wide identification of E/G relationships in a cell type: 1) genetic link methods or eQTL, 2) functional link methods based on 1D functional data such as open chromatin, histone mark or gene expression and 3) spatial link methods based on 3D data such as HiC. Since 1) and 3) are costly, the current strategy is to develop functional link methods and to use data from 1) and 3) as reference to evaluate them. However, there is still no consensus on the best functional link method to date, and method comparison remain seldom. Here, we compared the relative performances of three recent methods for the identification of enhancer-gene links, TargetFinder, Average-Rank, and the ABC model, using the three latest benchmarks from the field: a reference that combines 3D and eQTL data, called BENGI, and two genetic screening references, called CRiFF and CRiSPRi. Overall, none of the three methods performed best on the three references. CRiFF and CRISPRi reference sets are likely more reliable, but CRiFF is not genome-wide and CRiFF and CRISPRi are mostly available on the K562 cancer cell line. The BENGI reference set is genome-wide but likely contains many false positives. This study therefore calls for new reliable and genome-wide E/G reference data rather than new functional link E/G identification methods.
format Online
Article
Text
id pubmed-9999192
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-99991922023-03-11 Enhancer/gene relationships: Need for more reliable genome-wide reference sets Hoellinger, Tristan Mestre, Camille Aschard, Hugues Le Goff, Wilfried Foissac, Sylvain Faraut, Thomas Djebali, Sarah Front Bioinform Bioinformatics Differences in cells’ functions arise from differential activity of regulatory elements, including enhancers. Enhancers are cis-regulatory elements that cooperate with promoters through transcription factors to activate the expression of one or several genes by getting physically close to them in the 3D space of the nucleus. There is increasing evidence that genetic variants associated with common diseases are enriched in enhancers active in cell types relevant to these diseases. Identifying the enhancers associated with genes and conversely, the sets of genes activated by each enhancer (the so-called enhancer/gene or E/G relationships) across cell types, can help understanding the genetic mechanisms underlying human diseases. There are three broad approaches for the genome-wide identification of E/G relationships in a cell type: 1) genetic link methods or eQTL, 2) functional link methods based on 1D functional data such as open chromatin, histone mark or gene expression and 3) spatial link methods based on 3D data such as HiC. Since 1) and 3) are costly, the current strategy is to develop functional link methods and to use data from 1) and 3) as reference to evaluate them. However, there is still no consensus on the best functional link method to date, and method comparison remain seldom. Here, we compared the relative performances of three recent methods for the identification of enhancer-gene links, TargetFinder, Average-Rank, and the ABC model, using the three latest benchmarks from the field: a reference that combines 3D and eQTL data, called BENGI, and two genetic screening references, called CRiFF and CRiSPRi. Overall, none of the three methods performed best on the three references. CRiFF and CRISPRi reference sets are likely more reliable, but CRiFF is not genome-wide and CRiFF and CRISPRi are mostly available on the K562 cancer cell line. The BENGI reference set is genome-wide but likely contains many false positives. This study therefore calls for new reliable and genome-wide E/G reference data rather than new functional link E/G identification methods. Frontiers Media S.A. 2023-02-24 /pmc/articles/PMC9999192/ /pubmed/36909938 http://dx.doi.org/10.3389/fbinf.2023.1092853 Text en Copyright © 2023 Hoellinger, Mestre, Aschard, Le Goff, Foissac, Faraut and Djebali. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Bioinformatics
Hoellinger, Tristan
Mestre, Camille
Aschard, Hugues
Le Goff, Wilfried
Foissac, Sylvain
Faraut, Thomas
Djebali, Sarah
Enhancer/gene relationships: Need for more reliable genome-wide reference sets
title Enhancer/gene relationships: Need for more reliable genome-wide reference sets
title_full Enhancer/gene relationships: Need for more reliable genome-wide reference sets
title_fullStr Enhancer/gene relationships: Need for more reliable genome-wide reference sets
title_full_unstemmed Enhancer/gene relationships: Need for more reliable genome-wide reference sets
title_short Enhancer/gene relationships: Need for more reliable genome-wide reference sets
title_sort enhancer/gene relationships: need for more reliable genome-wide reference sets
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9999192/
https://www.ncbi.nlm.nih.gov/pubmed/36909938
http://dx.doi.org/10.3389/fbinf.2023.1092853
work_keys_str_mv AT hoellingertristan enhancergenerelationshipsneedformorereliablegenomewidereferencesets
AT mestrecamille enhancergenerelationshipsneedformorereliablegenomewidereferencesets
AT aschardhugues enhancergenerelationshipsneedformorereliablegenomewidereferencesets
AT legoffwilfried enhancergenerelationshipsneedformorereliablegenomewidereferencesets
AT foissacsylvain enhancergenerelationshipsneedformorereliablegenomewidereferencesets
AT farautthomas enhancergenerelationshipsneedformorereliablegenomewidereferencesets
AT djebalisarah enhancergenerelationshipsneedformorereliablegenomewidereferencesets