Cargando…
Cross-Predicting Essential Genes between Two Model Eukaryotic Species Using Machine Learning
Experimental studies of Caenorhabditis elegans and Drosophila melanogaster have contributed substantially to our understanding of molecular and cellular processes in metazoans at large. Since the publication of their genomes, functional genomic investigations have identified genes that are essential...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8150380/ https://www.ncbi.nlm.nih.gov/pubmed/34064595 http://dx.doi.org/10.3390/ijms22105056 |
_version_ | 1783698138338426880 |
---|---|
author | Campos, Tulio L. Korhonen, Pasi K. Young, Neil D. |
author_facet | Campos, Tulio L. Korhonen, Pasi K. Young, Neil D. |
author_sort | Campos, Tulio L. |
collection | PubMed |
description | Experimental studies of Caenorhabditis elegans and Drosophila melanogaster have contributed substantially to our understanding of molecular and cellular processes in metazoans at large. Since the publication of their genomes, functional genomic investigations have identified genes that are essential or non-essential for survival in each species. Recently, a range of features linked to gene essentiality have been inferred using a machine learning (ML)-based approach, allowing essentiality predictions within a species. Nevertheless, predictions between species are still elusive. Here, we undertake a comprehensive study using ML to discover and validate features of essential genes common to both C. elegans and D. melanogaster. We demonstrate that the cross-species prediction of gene essentiality is possible using a subset of features linked to nucleotide/protein sequences, protein orthology and subcellular localisation, single-cell RNA-seq, and histone methylation markers. Complementary analyses showed that essential genes are enriched for transcription and translation functions and are preferentially located away from heterochromatin regions of C. elegans and D. melanogaster chromosomes. The present work should enable the cross-prediction of essential genes between model and non-model metazoans. |
format | Online Article Text |
id | pubmed-8150380 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-81503802021-05-27 Cross-Predicting Essential Genes between Two Model Eukaryotic Species Using Machine Learning Campos, Tulio L. Korhonen, Pasi K. Young, Neil D. Int J Mol Sci Article Experimental studies of Caenorhabditis elegans and Drosophila melanogaster have contributed substantially to our understanding of molecular and cellular processes in metazoans at large. Since the publication of their genomes, functional genomic investigations have identified genes that are essential or non-essential for survival in each species. Recently, a range of features linked to gene essentiality have been inferred using a machine learning (ML)-based approach, allowing essentiality predictions within a species. Nevertheless, predictions between species are still elusive. Here, we undertake a comprehensive study using ML to discover and validate features of essential genes common to both C. elegans and D. melanogaster. We demonstrate that the cross-species prediction of gene essentiality is possible using a subset of features linked to nucleotide/protein sequences, protein orthology and subcellular localisation, single-cell RNA-seq, and histone methylation markers. Complementary analyses showed that essential genes are enriched for transcription and translation functions and are preferentially located away from heterochromatin regions of C. elegans and D. melanogaster chromosomes. The present work should enable the cross-prediction of essential genes between model and non-model metazoans. MDPI 2021-05-11 /pmc/articles/PMC8150380/ /pubmed/34064595 http://dx.doi.org/10.3390/ijms22105056 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Campos, Tulio L. Korhonen, Pasi K. Young, Neil D. Cross-Predicting Essential Genes between Two Model Eukaryotic Species Using Machine Learning |
title | Cross-Predicting Essential Genes between Two Model Eukaryotic Species Using Machine Learning |
title_full | Cross-Predicting Essential Genes between Two Model Eukaryotic Species Using Machine Learning |
title_fullStr | Cross-Predicting Essential Genes between Two Model Eukaryotic Species Using Machine Learning |
title_full_unstemmed | Cross-Predicting Essential Genes between Two Model Eukaryotic Species Using Machine Learning |
title_short | Cross-Predicting Essential Genes between Two Model Eukaryotic Species Using Machine Learning |
title_sort | cross-predicting essential genes between two model eukaryotic species using machine learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8150380/ https://www.ncbi.nlm.nih.gov/pubmed/34064595 http://dx.doi.org/10.3390/ijms22105056 |
work_keys_str_mv | AT campostuliol crosspredictingessentialgenesbetweentwomodeleukaryoticspeciesusingmachinelearning AT korhonenpasik crosspredictingessentialgenesbetweentwomodeleukaryoticspeciesusingmachinelearning AT youngneild crosspredictingessentialgenesbetweentwomodeleukaryoticspeciesusingmachinelearning |