Cargando…
Deep multiple-instance learning accurately predicts gene haploinsufficiency and deletion pathogenicity
Copy number losses (deletions) are a major contributor to the etiology of severe genetic disorders. Although haploinsufficient genes play a critical role in deletion pathogenicity, current methods for deletion pathogenicity prediction fail to integrate multiple lines of evidence for haploinsufficien...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491176/ https://www.ncbi.nlm.nih.gov/pubmed/37693607 http://dx.doi.org/10.1101/2023.08.29.555384 |
_version_ | 1785104008717795328 |
---|---|
author | Liu, Zhihan Huang, Yi-Fei |
author_facet | Liu, Zhihan Huang, Yi-Fei |
author_sort | Liu, Zhihan |
collection | PubMed |
description | Copy number losses (deletions) are a major contributor to the etiology of severe genetic disorders. Although haploinsufficient genes play a critical role in deletion pathogenicity, current methods for deletion pathogenicity prediction fail to integrate multiple lines of evidence for haploinsufficiency at the gene level, limiting their power to pinpoint deleterious deletions associated with genetic disorders. Here we introduce DosaCNV, a deep multiple-instance learning framework that, for the first time, models deletion pathogenicity jointly with gene haploinsufficiency. By integrating over 30 gene-level features potentially predictive of haploinsufficiency, DosaCNV shows unmatched performance in prioritizing pathogenic deletions associated with a broad spectrum of genetic disorders. Furthermore, DosaCNV outperforms existing methods in predicting gene haploinsufficiency even though it is not trained on known haploinsufficient genes. Finally, DosaCNV leverages a state-of-the-art technique to quantify the contributions of individual gene-level features to haploinsufficiency, allowing for human-understandable explanations of model predictions. Altogether, DosaCNV is a powerful computational tool for both fundamental and translational research. |
format | Online Article Text |
id | pubmed-10491176 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-104911762023-09-09 Deep multiple-instance learning accurately predicts gene haploinsufficiency and deletion pathogenicity Liu, Zhihan Huang, Yi-Fei bioRxiv Article Copy number losses (deletions) are a major contributor to the etiology of severe genetic disorders. Although haploinsufficient genes play a critical role in deletion pathogenicity, current methods for deletion pathogenicity prediction fail to integrate multiple lines of evidence for haploinsufficiency at the gene level, limiting their power to pinpoint deleterious deletions associated with genetic disorders. Here we introduce DosaCNV, a deep multiple-instance learning framework that, for the first time, models deletion pathogenicity jointly with gene haploinsufficiency. By integrating over 30 gene-level features potentially predictive of haploinsufficiency, DosaCNV shows unmatched performance in prioritizing pathogenic deletions associated with a broad spectrum of genetic disorders. Furthermore, DosaCNV outperforms existing methods in predicting gene haploinsufficiency even though it is not trained on known haploinsufficient genes. Finally, DosaCNV leverages a state-of-the-art technique to quantify the contributions of individual gene-level features to haploinsufficiency, allowing for human-understandable explanations of model predictions. Altogether, DosaCNV is a powerful computational tool for both fundamental and translational research. Cold Spring Harbor Laboratory 2023-10-05 /pmc/articles/PMC10491176/ /pubmed/37693607 http://dx.doi.org/10.1101/2023.08.29.555384 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator. |
spellingShingle | Article Liu, Zhihan Huang, Yi-Fei Deep multiple-instance learning accurately predicts gene haploinsufficiency and deletion pathogenicity |
title | Deep multiple-instance learning accurately predicts gene haploinsufficiency and deletion pathogenicity |
title_full | Deep multiple-instance learning accurately predicts gene haploinsufficiency and deletion pathogenicity |
title_fullStr | Deep multiple-instance learning accurately predicts gene haploinsufficiency and deletion pathogenicity |
title_full_unstemmed | Deep multiple-instance learning accurately predicts gene haploinsufficiency and deletion pathogenicity |
title_short | Deep multiple-instance learning accurately predicts gene haploinsufficiency and deletion pathogenicity |
title_sort | deep multiple-instance learning accurately predicts gene haploinsufficiency and deletion pathogenicity |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491176/ https://www.ncbi.nlm.nih.gov/pubmed/37693607 http://dx.doi.org/10.1101/2023.08.29.555384 |
work_keys_str_mv | AT liuzhihan deepmultipleinstancelearningaccuratelypredictsgenehaploinsufficiencyanddeletionpathogenicity AT huangyifei deepmultipleinstancelearningaccuratelypredictsgenehaploinsufficiencyanddeletionpathogenicity |