Cargando…

Deep learning identifies and quantifies recombination hotspot determinants

MOTIVATION: Recombination is one of the essential genetic processes for sexually reproducing organisms, which can happen more frequently in some regions, called recombination hotspots. Although several factors, such as PRDM9 binding motifs, are known to be related to the hotspots, their contribution...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Yu, Chen, Siyuan, Rapakoulia, Trisevgeni, Kuwahara, Hiroyuki, Yip, Kevin Y, Gao, Xin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9113300/
https://www.ncbi.nlm.nih.gov/pubmed/35561158
http://dx.doi.org/10.1093/bioinformatics/btac234
_version_ 1784709559146774528
author Li, Yu
Chen, Siyuan
Rapakoulia, Trisevgeni
Kuwahara, Hiroyuki
Yip, Kevin Y
Gao, Xin
author_facet Li, Yu
Chen, Siyuan
Rapakoulia, Trisevgeni
Kuwahara, Hiroyuki
Yip, Kevin Y
Gao, Xin
author_sort Li, Yu
collection PubMed
description MOTIVATION: Recombination is one of the essential genetic processes for sexually reproducing organisms, which can happen more frequently in some regions, called recombination hotspots. Although several factors, such as PRDM9 binding motifs, are known to be related to the hotspots, their contributions to the recombination hotspots have not been quantified, and other determinants are yet to be elucidated. Here, we propose a computational method, RHSNet, based on deep learning and signal processing, to identify and quantify the hotspot determinants in a purely data-driven manner, utilizing datasets from various studies, populations, sexes and species. RESULTS: RHSNet can significantly outperform other sequence-based methods on multiple datasets across different species, sexes and studies. In addition to being able to identify hotspot regions and the well-known determinants accurately, more importantly, RHSNet can quantify the determinants that contribute significantly to the recombination hotspot formation in the relation between PRDM9 binding motif, histone modification and GC content. Further cross-sex, cross-population and cross-species studies suggest that the proposed method has the generalization power and potential to identify and quantify the evolutionary determinant motifs. AVAILABILITY AND IMPLEMENTATION: https://github.com/frankchen121212/RHSNet. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9113300
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-91133002022-05-18 Deep learning identifies and quantifies recombination hotspot determinants Li, Yu Chen, Siyuan Rapakoulia, Trisevgeni Kuwahara, Hiroyuki Yip, Kevin Y Gao, Xin Bioinformatics Original Papers MOTIVATION: Recombination is one of the essential genetic processes for sexually reproducing organisms, which can happen more frequently in some regions, called recombination hotspots. Although several factors, such as PRDM9 binding motifs, are known to be related to the hotspots, their contributions to the recombination hotspots have not been quantified, and other determinants are yet to be elucidated. Here, we propose a computational method, RHSNet, based on deep learning and signal processing, to identify and quantify the hotspot determinants in a purely data-driven manner, utilizing datasets from various studies, populations, sexes and species. RESULTS: RHSNet can significantly outperform other sequence-based methods on multiple datasets across different species, sexes and studies. In addition to being able to identify hotspot regions and the well-known determinants accurately, more importantly, RHSNet can quantify the determinants that contribute significantly to the recombination hotspot formation in the relation between PRDM9 binding motif, histone modification and GC content. Further cross-sex, cross-population and cross-species studies suggest that the proposed method has the generalization power and potential to identify and quantify the evolutionary determinant motifs. AVAILABILITY AND IMPLEMENTATION: https://github.com/frankchen121212/RHSNet. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-04-12 /pmc/articles/PMC9113300/ /pubmed/35561158 http://dx.doi.org/10.1093/bioinformatics/btac234 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Li, Yu
Chen, Siyuan
Rapakoulia, Trisevgeni
Kuwahara, Hiroyuki
Yip, Kevin Y
Gao, Xin
Deep learning identifies and quantifies recombination hotspot determinants
title Deep learning identifies and quantifies recombination hotspot determinants
title_full Deep learning identifies and quantifies recombination hotspot determinants
title_fullStr Deep learning identifies and quantifies recombination hotspot determinants
title_full_unstemmed Deep learning identifies and quantifies recombination hotspot determinants
title_short Deep learning identifies and quantifies recombination hotspot determinants
title_sort deep learning identifies and quantifies recombination hotspot determinants
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9113300/
https://www.ncbi.nlm.nih.gov/pubmed/35561158
http://dx.doi.org/10.1093/bioinformatics/btac234
work_keys_str_mv AT liyu deeplearningidentifiesandquantifiesrecombinationhotspotdeterminants
AT chensiyuan deeplearningidentifiesandquantifiesrecombinationhotspotdeterminants
AT rapakouliatrisevgeni deeplearningidentifiesandquantifiesrecombinationhotspotdeterminants
AT kuwaharahiroyuki deeplearningidentifiesandquantifiesrecombinationhotspotdeterminants
AT yipkeviny deeplearningidentifiesandquantifiesrecombinationhotspotdeterminants
AT gaoxin deeplearningidentifiesandquantifiesrecombinationhotspotdeterminants