Cargando…

Cross-Predicting Essential Genes between Two Model Eukaryotic Species Using Machine Learning

Experimental studies of Caenorhabditis elegans and Drosophila melanogaster have contributed substantially to our understanding of molecular and cellular processes in metazoans at large. Since the publication of their genomes, functional genomic investigations have identified genes that are essential...

Descripción completa

Detalles Bibliográficos
Autores principales: Campos, Tulio L., Korhonen, Pasi K., Young, Neil D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8150380/
https://www.ncbi.nlm.nih.gov/pubmed/34064595
http://dx.doi.org/10.3390/ijms22105056
_version_ 1783698138338426880
author Campos, Tulio L.
Korhonen, Pasi K.
Young, Neil D.
author_facet Campos, Tulio L.
Korhonen, Pasi K.
Young, Neil D.
author_sort Campos, Tulio L.
collection PubMed
description Experimental studies of Caenorhabditis elegans and Drosophila melanogaster have contributed substantially to our understanding of molecular and cellular processes in metazoans at large. Since the publication of their genomes, functional genomic investigations have identified genes that are essential or non-essential for survival in each species. Recently, a range of features linked to gene essentiality have been inferred using a machine learning (ML)-based approach, allowing essentiality predictions within a species. Nevertheless, predictions between species are still elusive. Here, we undertake a comprehensive study using ML to discover and validate features of essential genes common to both C. elegans and D. melanogaster. We demonstrate that the cross-species prediction of gene essentiality is possible using a subset of features linked to nucleotide/protein sequences, protein orthology and subcellular localisation, single-cell RNA-seq, and histone methylation markers. Complementary analyses showed that essential genes are enriched for transcription and translation functions and are preferentially located away from heterochromatin regions of C. elegans and D. melanogaster chromosomes. The present work should enable the cross-prediction of essential genes between model and non-model metazoans.
format Online
Article
Text
id pubmed-8150380
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-81503802021-05-27 Cross-Predicting Essential Genes between Two Model Eukaryotic Species Using Machine Learning Campos, Tulio L. Korhonen, Pasi K. Young, Neil D. Int J Mol Sci Article Experimental studies of Caenorhabditis elegans and Drosophila melanogaster have contributed substantially to our understanding of molecular and cellular processes in metazoans at large. Since the publication of their genomes, functional genomic investigations have identified genes that are essential or non-essential for survival in each species. Recently, a range of features linked to gene essentiality have been inferred using a machine learning (ML)-based approach, allowing essentiality predictions within a species. Nevertheless, predictions between species are still elusive. Here, we undertake a comprehensive study using ML to discover and validate features of essential genes common to both C. elegans and D. melanogaster. We demonstrate that the cross-species prediction of gene essentiality is possible using a subset of features linked to nucleotide/protein sequences, protein orthology and subcellular localisation, single-cell RNA-seq, and histone methylation markers. Complementary analyses showed that essential genes are enriched for transcription and translation functions and are preferentially located away from heterochromatin regions of C. elegans and D. melanogaster chromosomes. The present work should enable the cross-prediction of essential genes between model and non-model metazoans. MDPI 2021-05-11 /pmc/articles/PMC8150380/ /pubmed/34064595 http://dx.doi.org/10.3390/ijms22105056 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Campos, Tulio L.
Korhonen, Pasi K.
Young, Neil D.
Cross-Predicting Essential Genes between Two Model Eukaryotic Species Using Machine Learning
title Cross-Predicting Essential Genes between Two Model Eukaryotic Species Using Machine Learning
title_full Cross-Predicting Essential Genes between Two Model Eukaryotic Species Using Machine Learning
title_fullStr Cross-Predicting Essential Genes between Two Model Eukaryotic Species Using Machine Learning
title_full_unstemmed Cross-Predicting Essential Genes between Two Model Eukaryotic Species Using Machine Learning
title_short Cross-Predicting Essential Genes between Two Model Eukaryotic Species Using Machine Learning
title_sort cross-predicting essential genes between two model eukaryotic species using machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8150380/
https://www.ncbi.nlm.nih.gov/pubmed/34064595
http://dx.doi.org/10.3390/ijms22105056
work_keys_str_mv AT campostuliol crosspredictingessentialgenesbetweentwomodeleukaryoticspeciesusingmachinelearning
AT korhonenpasik crosspredictingessentialgenesbetweentwomodeleukaryoticspeciesusingmachinelearning
AT youngneild crosspredictingessentialgenesbetweentwomodeleukaryoticspeciesusingmachinelearning