Cargando…

Revisiting Assessment of Computational Methods for Hi-C Data Analysis

The performances of algorithms for Hi-C data preprocessing, the identification of topologically associating domains, and the detection of chromatin interactions and promoter–enhancer interactions have been mostly evaluated using semi-quantitative or synthetic data approaches, without utilizing the m...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Jing, Zhu, Xingxing, Wang, Rui, Li, Mingzhou, Tang, Qianzi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10531246/
https://www.ncbi.nlm.nih.gov/pubmed/37762117
http://dx.doi.org/10.3390/ijms241813814
_version_ 1785111673487491072
author Yang, Jing
Zhu, Xingxing
Wang, Rui
Li, Mingzhou
Tang, Qianzi
author_facet Yang, Jing
Zhu, Xingxing
Wang, Rui
Li, Mingzhou
Tang, Qianzi
author_sort Yang, Jing
collection PubMed
description The performances of algorithms for Hi-C data preprocessing, the identification of topologically associating domains, and the detection of chromatin interactions and promoter–enhancer interactions have been mostly evaluated using semi-quantitative or synthetic data approaches, without utilizing the most recent methods, since 2017. In this study, we comprehensively evaluated 24 popular state-of-the-art methods for the complete end-to-end pipeline of Hi-C data analysis, using manually curated or experimentally validated benchmark datasets, including a CRISPR dataset for promoter–enhancer interaction validation. Our results indicate that, although no single method exhibited superior performance in all situations, HiC-Pro, DomainCaller, and Fit-Hi-C2 showed relatively balanced performances of most evaluation metrics for preprocessing, topologically associating domain identification, and chromatin interaction/promoter–enhancer interaction detection, respectively. The comprehensive comparison presented in this manuscript provides a reference for researchers to choose Hi-C analysis tools that best suit their needs.
format Online
Article
Text
id pubmed-10531246
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-105312462023-09-28 Revisiting Assessment of Computational Methods for Hi-C Data Analysis Yang, Jing Zhu, Xingxing Wang, Rui Li, Mingzhou Tang, Qianzi Int J Mol Sci Article The performances of algorithms for Hi-C data preprocessing, the identification of topologically associating domains, and the detection of chromatin interactions and promoter–enhancer interactions have been mostly evaluated using semi-quantitative or synthetic data approaches, without utilizing the most recent methods, since 2017. In this study, we comprehensively evaluated 24 popular state-of-the-art methods for the complete end-to-end pipeline of Hi-C data analysis, using manually curated or experimentally validated benchmark datasets, including a CRISPR dataset for promoter–enhancer interaction validation. Our results indicate that, although no single method exhibited superior performance in all situations, HiC-Pro, DomainCaller, and Fit-Hi-C2 showed relatively balanced performances of most evaluation metrics for preprocessing, topologically associating domain identification, and chromatin interaction/promoter–enhancer interaction detection, respectively. The comprehensive comparison presented in this manuscript provides a reference for researchers to choose Hi-C analysis tools that best suit their needs. MDPI 2023-09-07 /pmc/articles/PMC10531246/ /pubmed/37762117 http://dx.doi.org/10.3390/ijms241813814 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Yang, Jing
Zhu, Xingxing
Wang, Rui
Li, Mingzhou
Tang, Qianzi
Revisiting Assessment of Computational Methods for Hi-C Data Analysis
title Revisiting Assessment of Computational Methods for Hi-C Data Analysis
title_full Revisiting Assessment of Computational Methods for Hi-C Data Analysis
title_fullStr Revisiting Assessment of Computational Methods for Hi-C Data Analysis
title_full_unstemmed Revisiting Assessment of Computational Methods for Hi-C Data Analysis
title_short Revisiting Assessment of Computational Methods for Hi-C Data Analysis
title_sort revisiting assessment of computational methods for hi-c data analysis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10531246/
https://www.ncbi.nlm.nih.gov/pubmed/37762117
http://dx.doi.org/10.3390/ijms241813814
work_keys_str_mv AT yangjing revisitingassessmentofcomputationalmethodsforhicdataanalysis
AT zhuxingxing revisitingassessmentofcomputationalmethodsforhicdataanalysis
AT wangrui revisitingassessmentofcomputationalmethodsforhicdataanalysis
AT limingzhou revisitingassessmentofcomputationalmethodsforhicdataanalysis
AT tangqianzi revisitingassessmentofcomputationalmethodsforhicdataanalysis