Cargando…
Enhancer/gene relationships: Need for more reliable genome-wide reference sets
Differences in cells’ functions arise from differential activity of regulatory elements, including enhancers. Enhancers are cis-regulatory elements that cooperate with promoters through transcription factors to activate the expression of one or several genes by getting physically close to them in th...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9999192/ https://www.ncbi.nlm.nih.gov/pubmed/36909938 http://dx.doi.org/10.3389/fbinf.2023.1092853 |
_version_ | 1784903614330830848 |
---|---|
author | Hoellinger, Tristan Mestre, Camille Aschard, Hugues Le Goff, Wilfried Foissac, Sylvain Faraut, Thomas Djebali, Sarah |
author_facet | Hoellinger, Tristan Mestre, Camille Aschard, Hugues Le Goff, Wilfried Foissac, Sylvain Faraut, Thomas Djebali, Sarah |
author_sort | Hoellinger, Tristan |
collection | PubMed |
description | Differences in cells’ functions arise from differential activity of regulatory elements, including enhancers. Enhancers are cis-regulatory elements that cooperate with promoters through transcription factors to activate the expression of one or several genes by getting physically close to them in the 3D space of the nucleus. There is increasing evidence that genetic variants associated with common diseases are enriched in enhancers active in cell types relevant to these diseases. Identifying the enhancers associated with genes and conversely, the sets of genes activated by each enhancer (the so-called enhancer/gene or E/G relationships) across cell types, can help understanding the genetic mechanisms underlying human diseases. There are three broad approaches for the genome-wide identification of E/G relationships in a cell type: 1) genetic link methods or eQTL, 2) functional link methods based on 1D functional data such as open chromatin, histone mark or gene expression and 3) spatial link methods based on 3D data such as HiC. Since 1) and 3) are costly, the current strategy is to develop functional link methods and to use data from 1) and 3) as reference to evaluate them. However, there is still no consensus on the best functional link method to date, and method comparison remain seldom. Here, we compared the relative performances of three recent methods for the identification of enhancer-gene links, TargetFinder, Average-Rank, and the ABC model, using the three latest benchmarks from the field: a reference that combines 3D and eQTL data, called BENGI, and two genetic screening references, called CRiFF and CRiSPRi. Overall, none of the three methods performed best on the three references. CRiFF and CRISPRi reference sets are likely more reliable, but CRiFF is not genome-wide and CRiFF and CRISPRi are mostly available on the K562 cancer cell line. The BENGI reference set is genome-wide but likely contains many false positives. This study therefore calls for new reliable and genome-wide E/G reference data rather than new functional link E/G identification methods. |
format | Online Article Text |
id | pubmed-9999192 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-99991922023-03-11 Enhancer/gene relationships: Need for more reliable genome-wide reference sets Hoellinger, Tristan Mestre, Camille Aschard, Hugues Le Goff, Wilfried Foissac, Sylvain Faraut, Thomas Djebali, Sarah Front Bioinform Bioinformatics Differences in cells’ functions arise from differential activity of regulatory elements, including enhancers. Enhancers are cis-regulatory elements that cooperate with promoters through transcription factors to activate the expression of one or several genes by getting physically close to them in the 3D space of the nucleus. There is increasing evidence that genetic variants associated with common diseases are enriched in enhancers active in cell types relevant to these diseases. Identifying the enhancers associated with genes and conversely, the sets of genes activated by each enhancer (the so-called enhancer/gene or E/G relationships) across cell types, can help understanding the genetic mechanisms underlying human diseases. There are three broad approaches for the genome-wide identification of E/G relationships in a cell type: 1) genetic link methods or eQTL, 2) functional link methods based on 1D functional data such as open chromatin, histone mark or gene expression and 3) spatial link methods based on 3D data such as HiC. Since 1) and 3) are costly, the current strategy is to develop functional link methods and to use data from 1) and 3) as reference to evaluate them. However, there is still no consensus on the best functional link method to date, and method comparison remain seldom. Here, we compared the relative performances of three recent methods for the identification of enhancer-gene links, TargetFinder, Average-Rank, and the ABC model, using the three latest benchmarks from the field: a reference that combines 3D and eQTL data, called BENGI, and two genetic screening references, called CRiFF and CRiSPRi. Overall, none of the three methods performed best on the three references. CRiFF and CRISPRi reference sets are likely more reliable, but CRiFF is not genome-wide and CRiFF and CRISPRi are mostly available on the K562 cancer cell line. The BENGI reference set is genome-wide but likely contains many false positives. This study therefore calls for new reliable and genome-wide E/G reference data rather than new functional link E/G identification methods. Frontiers Media S.A. 2023-02-24 /pmc/articles/PMC9999192/ /pubmed/36909938 http://dx.doi.org/10.3389/fbinf.2023.1092853 Text en Copyright © 2023 Hoellinger, Mestre, Aschard, Le Goff, Foissac, Faraut and Djebali. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Bioinformatics Hoellinger, Tristan Mestre, Camille Aschard, Hugues Le Goff, Wilfried Foissac, Sylvain Faraut, Thomas Djebali, Sarah Enhancer/gene relationships: Need for more reliable genome-wide reference sets |
title | Enhancer/gene relationships: Need for more reliable genome-wide reference sets |
title_full | Enhancer/gene relationships: Need for more reliable genome-wide reference sets |
title_fullStr | Enhancer/gene relationships: Need for more reliable genome-wide reference sets |
title_full_unstemmed | Enhancer/gene relationships: Need for more reliable genome-wide reference sets |
title_short | Enhancer/gene relationships: Need for more reliable genome-wide reference sets |
title_sort | enhancer/gene relationships: need for more reliable genome-wide reference sets |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9999192/ https://www.ncbi.nlm.nih.gov/pubmed/36909938 http://dx.doi.org/10.3389/fbinf.2023.1092853 |
work_keys_str_mv | AT hoellingertristan enhancergenerelationshipsneedformorereliablegenomewidereferencesets AT mestrecamille enhancergenerelationshipsneedformorereliablegenomewidereferencesets AT aschardhugues enhancergenerelationshipsneedformorereliablegenomewidereferencesets AT legoffwilfried enhancergenerelationshipsneedformorereliablegenomewidereferencesets AT foissacsylvain enhancergenerelationshipsneedformorereliablegenomewidereferencesets AT farautthomas enhancergenerelationshipsneedformorereliablegenomewidereferencesets AT djebalisarah enhancergenerelationshipsneedformorereliablegenomewidereferencesets |