Cargando…

Deep multiple-instance learning accurately predicts gene haploinsufficiency and deletion pathogenicity

Copy number losses (deletions) are a major contributor to the etiology of severe genetic disorders. Although haploinsufficient genes play a critical role in deletion pathogenicity, current methods for deletion pathogenicity prediction fail to integrate multiple lines of evidence for haploinsufficien...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Zhihan, Huang, Yi-Fei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491176/
https://www.ncbi.nlm.nih.gov/pubmed/37693607
http://dx.doi.org/10.1101/2023.08.29.555384
_version_ 1785104008717795328
author Liu, Zhihan
Huang, Yi-Fei
author_facet Liu, Zhihan
Huang, Yi-Fei
author_sort Liu, Zhihan
collection PubMed
description Copy number losses (deletions) are a major contributor to the etiology of severe genetic disorders. Although haploinsufficient genes play a critical role in deletion pathogenicity, current methods for deletion pathogenicity prediction fail to integrate multiple lines of evidence for haploinsufficiency at the gene level, limiting their power to pinpoint deleterious deletions associated with genetic disorders. Here we introduce DosaCNV, a deep multiple-instance learning framework that, for the first time, models deletion pathogenicity jointly with gene haploinsufficiency. By integrating over 30 gene-level features potentially predictive of haploinsufficiency, DosaCNV shows unmatched performance in prioritizing pathogenic deletions associated with a broad spectrum of genetic disorders. Furthermore, DosaCNV outperforms existing methods in predicting gene haploinsufficiency even though it is not trained on known haploinsufficient genes. Finally, DosaCNV leverages a state-of-the-art technique to quantify the contributions of individual gene-level features to haploinsufficiency, allowing for human-understandable explanations of model predictions. Altogether, DosaCNV is a powerful computational tool for both fundamental and translational research.
format Online
Article
Text
id pubmed-10491176
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-104911762023-09-09 Deep multiple-instance learning accurately predicts gene haploinsufficiency and deletion pathogenicity Liu, Zhihan Huang, Yi-Fei bioRxiv Article Copy number losses (deletions) are a major contributor to the etiology of severe genetic disorders. Although haploinsufficient genes play a critical role in deletion pathogenicity, current methods for deletion pathogenicity prediction fail to integrate multiple lines of evidence for haploinsufficiency at the gene level, limiting their power to pinpoint deleterious deletions associated with genetic disorders. Here we introduce DosaCNV, a deep multiple-instance learning framework that, for the first time, models deletion pathogenicity jointly with gene haploinsufficiency. By integrating over 30 gene-level features potentially predictive of haploinsufficiency, DosaCNV shows unmatched performance in prioritizing pathogenic deletions associated with a broad spectrum of genetic disorders. Furthermore, DosaCNV outperforms existing methods in predicting gene haploinsufficiency even though it is not trained on known haploinsufficient genes. Finally, DosaCNV leverages a state-of-the-art technique to quantify the contributions of individual gene-level features to haploinsufficiency, allowing for human-understandable explanations of model predictions. Altogether, DosaCNV is a powerful computational tool for both fundamental and translational research. Cold Spring Harbor Laboratory 2023-10-05 /pmc/articles/PMC10491176/ /pubmed/37693607 http://dx.doi.org/10.1101/2023.08.29.555384 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Liu, Zhihan
Huang, Yi-Fei
Deep multiple-instance learning accurately predicts gene haploinsufficiency and deletion pathogenicity
title Deep multiple-instance learning accurately predicts gene haploinsufficiency and deletion pathogenicity
title_full Deep multiple-instance learning accurately predicts gene haploinsufficiency and deletion pathogenicity
title_fullStr Deep multiple-instance learning accurately predicts gene haploinsufficiency and deletion pathogenicity
title_full_unstemmed Deep multiple-instance learning accurately predicts gene haploinsufficiency and deletion pathogenicity
title_short Deep multiple-instance learning accurately predicts gene haploinsufficiency and deletion pathogenicity
title_sort deep multiple-instance learning accurately predicts gene haploinsufficiency and deletion pathogenicity
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491176/
https://www.ncbi.nlm.nih.gov/pubmed/37693607
http://dx.doi.org/10.1101/2023.08.29.555384
work_keys_str_mv AT liuzhihan deepmultipleinstancelearningaccuratelypredictsgenehaploinsufficiencyanddeletionpathogenicity
AT huangyifei deepmultipleinstancelearningaccuratelypredictsgenehaploinsufficiencyanddeletionpathogenicity