Cargando…

Defining the characteristics of interferon-alpha–stimulated human genes: insight from expression data and machine learning

BACKGROUND: A virus-infected cell triggers a signalling cascade, resulting in the secretion of interferons (IFNs), which in turn induces the upregulation of the IFN-stimulated genes (ISGs) that play a role in antipathogen host defence. Here, we conducted analyses on large-scale data relating to evol...

Descripción completa

Detalles Bibliográficos
Autores principales: Chai, Haiting, Gu, Quan, Robertson, David L, Hughes, Joseph
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9673497/
https://www.ncbi.nlm.nih.gov/pubmed/36399061
http://dx.doi.org/10.1093/gigascience/giac103
_version_ 1784832953054920704
author Chai, Haiting
Gu, Quan
Robertson, David L
Hughes, Joseph
author_facet Chai, Haiting
Gu, Quan
Robertson, David L
Hughes, Joseph
author_sort Chai, Haiting
collection PubMed
description BACKGROUND: A virus-infected cell triggers a signalling cascade, resulting in the secretion of interferons (IFNs), which in turn induces the upregulation of the IFN-stimulated genes (ISGs) that play a role in antipathogen host defence. Here, we conducted analyses on large-scale data relating to evolutionary gene expression, sequence composition, and network properties to elucidate factors associated with the stimulation of human genes in response to IFN-α. RESULTS: We find that ISGs are less evolutionary conserved than genes that are not significantly stimulated in IFN experiments (non-ISGs). ISGs show obvious depletion of GC content in the coding region. This influences the representation of some compositions following the translation process. IFN-repressed human genes (IRGs), downregulated genes in IFN experiments, can have similar properties to the ISGs. Additionally, we design a machine learning framework integrating the support vector machine and novel feature selection algorithm that achieves an area under the receiver operating characteristic curve (AUC) of 0.7455 for ISG prediction. Its application in other IFN systems suggests the similarity between the ISGs triggered by type I and III IFNs. CONCLUSIONS: ISGs have some unique properties that make them different from the non-ISGs. The representation of some properties has a strong correlation with gene expression following IFN-α stimulation, which can be used as a predictive feature in machine learning. Our model predicts several genes as putative ISGs that so far have shown no significant differential expression when stimulated with IFN-α in the cell/tissue types in the available databases. A web server implementing our method is accessible at http://isgpre.cvr.gla.ac.uk/. The docker image at https://hub.docker.com/r/hchai01/isgpre can be downloaded to reproduce the prediction.
format Online
Article
Text
id pubmed-9673497
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-96734972022-11-21 Defining the characteristics of interferon-alpha–stimulated human genes: insight from expression data and machine learning Chai, Haiting Gu, Quan Robertson, David L Hughes, Joseph Gigascience Research BACKGROUND: A virus-infected cell triggers a signalling cascade, resulting in the secretion of interferons (IFNs), which in turn induces the upregulation of the IFN-stimulated genes (ISGs) that play a role in antipathogen host defence. Here, we conducted analyses on large-scale data relating to evolutionary gene expression, sequence composition, and network properties to elucidate factors associated with the stimulation of human genes in response to IFN-α. RESULTS: We find that ISGs are less evolutionary conserved than genes that are not significantly stimulated in IFN experiments (non-ISGs). ISGs show obvious depletion of GC content in the coding region. This influences the representation of some compositions following the translation process. IFN-repressed human genes (IRGs), downregulated genes in IFN experiments, can have similar properties to the ISGs. Additionally, we design a machine learning framework integrating the support vector machine and novel feature selection algorithm that achieves an area under the receiver operating characteristic curve (AUC) of 0.7455 for ISG prediction. Its application in other IFN systems suggests the similarity between the ISGs triggered by type I and III IFNs. CONCLUSIONS: ISGs have some unique properties that make them different from the non-ISGs. The representation of some properties has a strong correlation with gene expression following IFN-α stimulation, which can be used as a predictive feature in machine learning. Our model predicts several genes as putative ISGs that so far have shown no significant differential expression when stimulated with IFN-α in the cell/tissue types in the available databases. A web server implementing our method is accessible at http://isgpre.cvr.gla.ac.uk/. The docker image at https://hub.docker.com/r/hchai01/isgpre can be downloaded to reproduce the prediction. Oxford University Press 2022-11-18 /pmc/articles/PMC9673497/ /pubmed/36399061 http://dx.doi.org/10.1093/gigascience/giac103 Text en © The Author(s) 2022. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Chai, Haiting
Gu, Quan
Robertson, David L
Hughes, Joseph
Defining the characteristics of interferon-alpha–stimulated human genes: insight from expression data and machine learning
title Defining the characteristics of interferon-alpha–stimulated human genes: insight from expression data and machine learning
title_full Defining the characteristics of interferon-alpha–stimulated human genes: insight from expression data and machine learning
title_fullStr Defining the characteristics of interferon-alpha–stimulated human genes: insight from expression data and machine learning
title_full_unstemmed Defining the characteristics of interferon-alpha–stimulated human genes: insight from expression data and machine learning
title_short Defining the characteristics of interferon-alpha–stimulated human genes: insight from expression data and machine learning
title_sort defining the characteristics of interferon-alpha–stimulated human genes: insight from expression data and machine learning
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9673497/
https://www.ncbi.nlm.nih.gov/pubmed/36399061
http://dx.doi.org/10.1093/gigascience/giac103
work_keys_str_mv AT chaihaiting definingthecharacteristicsofinterferonalphastimulatedhumangenesinsightfromexpressiondataandmachinelearning
AT guquan definingthecharacteristicsofinterferonalphastimulatedhumangenesinsightfromexpressiondataandmachinelearning
AT robertsondavidl definingthecharacteristicsofinterferonalphastimulatedhumangenesinsightfromexpressiondataandmachinelearning
AT hughesjoseph definingthecharacteristicsofinterferonalphastimulatedhumangenesinsightfromexpressiondataandmachinelearning