Cargando…

A Deep Learning Approach to the Screening of Oncogenic Gene Fusions in Humans

Gene fusions have a very important role in the study of cancer development. In this regard, predicting the probability of protein fusion transcripts of developing into a cancer is a very challenging and yet not fully explored research problem. To this date, all the available approaches in literature...

Descripción completa

Detalles Bibliográficos
Autores principales: Lovino, Marta, Urgese, Gianvito, Macii, Enrico, Di Cataldo, Santa, Ficarra, Elisa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6480333/
https://www.ncbi.nlm.nih.gov/pubmed/30987060
http://dx.doi.org/10.3390/ijms20071645
_version_ 1783413550201438208
author Lovino, Marta
Urgese, Gianvito
Macii, Enrico
Di Cataldo, Santa
Ficarra, Elisa
author_facet Lovino, Marta
Urgese, Gianvito
Macii, Enrico
Di Cataldo, Santa
Ficarra, Elisa
author_sort Lovino, Marta
collection PubMed
description Gene fusions have a very important role in the study of cancer development. In this regard, predicting the probability of protein fusion transcripts of developing into a cancer is a very challenging and yet not fully explored research problem. To this date, all the available approaches in literature try to explain the oncogenic potential of gene fusions based on protein domain analysis, that is cancer-specific and not easy to adapt to newly developed information. In our work, we choose the raw protein sequences as the input baseline, and propose the use of deep learning, and more specifically Convolutional Neural Networks, to infer the oncogenity probability score of gene fusion transcripts and to group them into a number of categories (e.g., oncogenic/not oncogenic). This is an inherently flexible methodology that, unlike previous approaches, can be re-trained with very less efforts on newly available data (for example, from a different cancer). Based on experimental results on a large dataset of pre-annotated gene fusions, our method is able to predict the oncogenity potential of gene fusion transcripts with accuracy of about 72%, which increases to 86% if we consider the only instances that are classified with a high confidence level.
format Online
Article
Text
id pubmed-6480333
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-64803332019-04-29 A Deep Learning Approach to the Screening of Oncogenic Gene Fusions in Humans Lovino, Marta Urgese, Gianvito Macii, Enrico Di Cataldo, Santa Ficarra, Elisa Int J Mol Sci Article Gene fusions have a very important role in the study of cancer development. In this regard, predicting the probability of protein fusion transcripts of developing into a cancer is a very challenging and yet not fully explored research problem. To this date, all the available approaches in literature try to explain the oncogenic potential of gene fusions based on protein domain analysis, that is cancer-specific and not easy to adapt to newly developed information. In our work, we choose the raw protein sequences as the input baseline, and propose the use of deep learning, and more specifically Convolutional Neural Networks, to infer the oncogenity probability score of gene fusion transcripts and to group them into a number of categories (e.g., oncogenic/not oncogenic). This is an inherently flexible methodology that, unlike previous approaches, can be re-trained with very less efforts on newly available data (for example, from a different cancer). Based on experimental results on a large dataset of pre-annotated gene fusions, our method is able to predict the oncogenity potential of gene fusion transcripts with accuracy of about 72%, which increases to 86% if we consider the only instances that are classified with a high confidence level. MDPI 2019-04-02 /pmc/articles/PMC6480333/ /pubmed/30987060 http://dx.doi.org/10.3390/ijms20071645 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Lovino, Marta
Urgese, Gianvito
Macii, Enrico
Di Cataldo, Santa
Ficarra, Elisa
A Deep Learning Approach to the Screening of Oncogenic Gene Fusions in Humans
title A Deep Learning Approach to the Screening of Oncogenic Gene Fusions in Humans
title_full A Deep Learning Approach to the Screening of Oncogenic Gene Fusions in Humans
title_fullStr A Deep Learning Approach to the Screening of Oncogenic Gene Fusions in Humans
title_full_unstemmed A Deep Learning Approach to the Screening of Oncogenic Gene Fusions in Humans
title_short A Deep Learning Approach to the Screening of Oncogenic Gene Fusions in Humans
title_sort deep learning approach to the screening of oncogenic gene fusions in humans
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6480333/
https://www.ncbi.nlm.nih.gov/pubmed/30987060
http://dx.doi.org/10.3390/ijms20071645
work_keys_str_mv AT lovinomarta adeeplearningapproachtothescreeningofoncogenicgenefusionsinhumans
AT urgesegianvito adeeplearningapproachtothescreeningofoncogenicgenefusionsinhumans
AT maciienrico adeeplearningapproachtothescreeningofoncogenicgenefusionsinhumans
AT dicataldosanta adeeplearningapproachtothescreeningofoncogenicgenefusionsinhumans
AT ficarraelisa adeeplearningapproachtothescreeningofoncogenicgenefusionsinhumans
AT lovinomarta deeplearningapproachtothescreeningofoncogenicgenefusionsinhumans
AT urgesegianvito deeplearningapproachtothescreeningofoncogenicgenefusionsinhumans
AT maciienrico deeplearningapproachtothescreeningofoncogenicgenefusionsinhumans
AT dicataldosanta deeplearningapproachtothescreeningofoncogenicgenefusionsinhumans
AT ficarraelisa deeplearningapproachtothescreeningofoncogenicgenefusionsinhumans