Cargando…

Predicting and explaining the impact of genetic disruptions and interactions on organismal viability

MOTIVATION: Existing computational models can predict single- and double-mutant fitness but they do have limitations. First, they are often tested via evaluation metrics that are inappropriate for imbalanced datasets. Second, all of them only predict a binary outcome (viable or not, and negatively i...

Descripción completa

Detalles Bibliográficos
Autores principales: Al-Anzi, Bader F, Khajah, Mohammad, Fakhraldeen, Saja A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9438956/
https://www.ncbi.nlm.nih.gov/pubmed/35861390
http://dx.doi.org/10.1093/bioinformatics/btac519
_version_ 1784781942074376192
author Al-Anzi, Bader F
Khajah, Mohammad
Fakhraldeen, Saja A
author_facet Al-Anzi, Bader F
Khajah, Mohammad
Fakhraldeen, Saja A
author_sort Al-Anzi, Bader F
collection PubMed
description MOTIVATION: Existing computational models can predict single- and double-mutant fitness but they do have limitations. First, they are often tested via evaluation metrics that are inappropriate for imbalanced datasets. Second, all of them only predict a binary outcome (viable or not, and negatively interacting or not). Third, most are uninterpretable black box machine learning models. RESULTS: Budding yeast datasets were used to develop high-performance Multinomial Regression (MN) models capable of predicting the impact of single, double and triple genetic disruptions on viability. These models are interpretable and give realistic non-binary predictions and can predict negative genetic interactions (GIs) in triple-gene knockouts. They are based on a limited set of gene features and their predictions are influenced by the probability of target gene participating in molecular complexes or pathways. Furthermore, the MN models have utility in other organisms such as fission yeast, fruit flies and humans, with the single gene fitness MN model being able to distinguish essential genes necessary for cell-autonomous viability from those required for multicellular survival. Finally, our models exceed the performance of previous models, without sacrificing interpretability. AVAILABILITY AND IMPLEMENTATION: All code and processed datasets used to generate results and figures in this manuscript are available at our Github repository at https://github.com/KISRDevelopment/cell_viability_paper. The repository also contains a link to the GI prediction website that lets users search for GIs using the MN models. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9438956
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-94389562022-09-06 Predicting and explaining the impact of genetic disruptions and interactions on organismal viability Al-Anzi, Bader F Khajah, Mohammad Fakhraldeen, Saja A Bioinformatics Original Papers MOTIVATION: Existing computational models can predict single- and double-mutant fitness but they do have limitations. First, they are often tested via evaluation metrics that are inappropriate for imbalanced datasets. Second, all of them only predict a binary outcome (viable or not, and negatively interacting or not). Third, most are uninterpretable black box machine learning models. RESULTS: Budding yeast datasets were used to develop high-performance Multinomial Regression (MN) models capable of predicting the impact of single, double and triple genetic disruptions on viability. These models are interpretable and give realistic non-binary predictions and can predict negative genetic interactions (GIs) in triple-gene knockouts. They are based on a limited set of gene features and their predictions are influenced by the probability of target gene participating in molecular complexes or pathways. Furthermore, the MN models have utility in other organisms such as fission yeast, fruit flies and humans, with the single gene fitness MN model being able to distinguish essential genes necessary for cell-autonomous viability from those required for multicellular survival. Finally, our models exceed the performance of previous models, without sacrificing interpretability. AVAILABILITY AND IMPLEMENTATION: All code and processed datasets used to generate results and figures in this manuscript are available at our Github repository at https://github.com/KISRDevelopment/cell_viability_paper. The repository also contains a link to the GI prediction website that lets users search for GIs using the MN models. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-07-21 /pmc/articles/PMC9438956/ /pubmed/35861390 http://dx.doi.org/10.1093/bioinformatics/btac519 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Al-Anzi, Bader F
Khajah, Mohammad
Fakhraldeen, Saja A
Predicting and explaining the impact of genetic disruptions and interactions on organismal viability
title Predicting and explaining the impact of genetic disruptions and interactions on organismal viability
title_full Predicting and explaining the impact of genetic disruptions and interactions on organismal viability
title_fullStr Predicting and explaining the impact of genetic disruptions and interactions on organismal viability
title_full_unstemmed Predicting and explaining the impact of genetic disruptions and interactions on organismal viability
title_short Predicting and explaining the impact of genetic disruptions and interactions on organismal viability
title_sort predicting and explaining the impact of genetic disruptions and interactions on organismal viability
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9438956/
https://www.ncbi.nlm.nih.gov/pubmed/35861390
http://dx.doi.org/10.1093/bioinformatics/btac519
work_keys_str_mv AT alanzibaderf predictingandexplainingtheimpactofgeneticdisruptionsandinteractionsonorganismalviability
AT khajahmohammad predictingandexplainingtheimpactofgeneticdisruptionsandinteractionsonorganismalviability
AT fakhraldeensajaa predictingandexplainingtheimpactofgeneticdisruptionsandinteractionsonorganismalviability