Cargando…
Predicting and explaining the impact of genetic disruptions and interactions on organismal viability
MOTIVATION: Existing computational models can predict single- and double-mutant fitness but they do have limitations. First, they are often tested via evaluation metrics that are inappropriate for imbalanced datasets. Second, all of them only predict a binary outcome (viable or not, and negatively i...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9438956/ https://www.ncbi.nlm.nih.gov/pubmed/35861390 http://dx.doi.org/10.1093/bioinformatics/btac519 |
_version_ | 1784781942074376192 |
---|---|
author | Al-Anzi, Bader F Khajah, Mohammad Fakhraldeen, Saja A |
author_facet | Al-Anzi, Bader F Khajah, Mohammad Fakhraldeen, Saja A |
author_sort | Al-Anzi, Bader F |
collection | PubMed |
description | MOTIVATION: Existing computational models can predict single- and double-mutant fitness but they do have limitations. First, they are often tested via evaluation metrics that are inappropriate for imbalanced datasets. Second, all of them only predict a binary outcome (viable or not, and negatively interacting or not). Third, most are uninterpretable black box machine learning models. RESULTS: Budding yeast datasets were used to develop high-performance Multinomial Regression (MN) models capable of predicting the impact of single, double and triple genetic disruptions on viability. These models are interpretable and give realistic non-binary predictions and can predict negative genetic interactions (GIs) in triple-gene knockouts. They are based on a limited set of gene features and their predictions are influenced by the probability of target gene participating in molecular complexes or pathways. Furthermore, the MN models have utility in other organisms such as fission yeast, fruit flies and humans, with the single gene fitness MN model being able to distinguish essential genes necessary for cell-autonomous viability from those required for multicellular survival. Finally, our models exceed the performance of previous models, without sacrificing interpretability. AVAILABILITY AND IMPLEMENTATION: All code and processed datasets used to generate results and figures in this manuscript are available at our Github repository at https://github.com/KISRDevelopment/cell_viability_paper. The repository also contains a link to the GI prediction website that lets users search for GIs using the MN models. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-9438956 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-94389562022-09-06 Predicting and explaining the impact of genetic disruptions and interactions on organismal viability Al-Anzi, Bader F Khajah, Mohammad Fakhraldeen, Saja A Bioinformatics Original Papers MOTIVATION: Existing computational models can predict single- and double-mutant fitness but they do have limitations. First, they are often tested via evaluation metrics that are inappropriate for imbalanced datasets. Second, all of them only predict a binary outcome (viable or not, and negatively interacting or not). Third, most are uninterpretable black box machine learning models. RESULTS: Budding yeast datasets were used to develop high-performance Multinomial Regression (MN) models capable of predicting the impact of single, double and triple genetic disruptions on viability. These models are interpretable and give realistic non-binary predictions and can predict negative genetic interactions (GIs) in triple-gene knockouts. They are based on a limited set of gene features and their predictions are influenced by the probability of target gene participating in molecular complexes or pathways. Furthermore, the MN models have utility in other organisms such as fission yeast, fruit flies and humans, with the single gene fitness MN model being able to distinguish essential genes necessary for cell-autonomous viability from those required for multicellular survival. Finally, our models exceed the performance of previous models, without sacrificing interpretability. AVAILABILITY AND IMPLEMENTATION: All code and processed datasets used to generate results and figures in this manuscript are available at our Github repository at https://github.com/KISRDevelopment/cell_viability_paper. The repository also contains a link to the GI prediction website that lets users search for GIs using the MN models. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-07-21 /pmc/articles/PMC9438956/ /pubmed/35861390 http://dx.doi.org/10.1093/bioinformatics/btac519 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Al-Anzi, Bader F Khajah, Mohammad Fakhraldeen, Saja A Predicting and explaining the impact of genetic disruptions and interactions on organismal viability |
title | Predicting and explaining the impact of genetic disruptions and interactions on organismal viability |
title_full | Predicting and explaining the impact of genetic disruptions and interactions on organismal viability |
title_fullStr | Predicting and explaining the impact of genetic disruptions and interactions on organismal viability |
title_full_unstemmed | Predicting and explaining the impact of genetic disruptions and interactions on organismal viability |
title_short | Predicting and explaining the impact of genetic disruptions and interactions on organismal viability |
title_sort | predicting and explaining the impact of genetic disruptions and interactions on organismal viability |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9438956/ https://www.ncbi.nlm.nih.gov/pubmed/35861390 http://dx.doi.org/10.1093/bioinformatics/btac519 |
work_keys_str_mv | AT alanzibaderf predictingandexplainingtheimpactofgeneticdisruptionsandinteractionsonorganismalviability AT khajahmohammad predictingandexplainingtheimpactofgeneticdisruptionsandinteractionsonorganismalviability AT fakhraldeensajaa predictingandexplainingtheimpactofgeneticdisruptionsandinteractionsonorganismalviability |