Cargando…
A Deep Learning Approach to the Screening of Oncogenic Gene Fusions in Humans
Gene fusions have a very important role in the study of cancer development. In this regard, predicting the probability of protein fusion transcripts of developing into a cancer is a very challenging and yet not fully explored research problem. To this date, all the available approaches in literature...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6480333/ https://www.ncbi.nlm.nih.gov/pubmed/30987060 http://dx.doi.org/10.3390/ijms20071645 |
_version_ | 1783413550201438208 |
---|---|
author | Lovino, Marta Urgese, Gianvito Macii, Enrico Di Cataldo, Santa Ficarra, Elisa |
author_facet | Lovino, Marta Urgese, Gianvito Macii, Enrico Di Cataldo, Santa Ficarra, Elisa |
author_sort | Lovino, Marta |
collection | PubMed |
description | Gene fusions have a very important role in the study of cancer development. In this regard, predicting the probability of protein fusion transcripts of developing into a cancer is a very challenging and yet not fully explored research problem. To this date, all the available approaches in literature try to explain the oncogenic potential of gene fusions based on protein domain analysis, that is cancer-specific and not easy to adapt to newly developed information. In our work, we choose the raw protein sequences as the input baseline, and propose the use of deep learning, and more specifically Convolutional Neural Networks, to infer the oncogenity probability score of gene fusion transcripts and to group them into a number of categories (e.g., oncogenic/not oncogenic). This is an inherently flexible methodology that, unlike previous approaches, can be re-trained with very less efforts on newly available data (for example, from a different cancer). Based on experimental results on a large dataset of pre-annotated gene fusions, our method is able to predict the oncogenity potential of gene fusion transcripts with accuracy of about 72%, which increases to 86% if we consider the only instances that are classified with a high confidence level. |
format | Online Article Text |
id | pubmed-6480333 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-64803332019-04-29 A Deep Learning Approach to the Screening of Oncogenic Gene Fusions in Humans Lovino, Marta Urgese, Gianvito Macii, Enrico Di Cataldo, Santa Ficarra, Elisa Int J Mol Sci Article Gene fusions have a very important role in the study of cancer development. In this regard, predicting the probability of protein fusion transcripts of developing into a cancer is a very challenging and yet not fully explored research problem. To this date, all the available approaches in literature try to explain the oncogenic potential of gene fusions based on protein domain analysis, that is cancer-specific and not easy to adapt to newly developed information. In our work, we choose the raw protein sequences as the input baseline, and propose the use of deep learning, and more specifically Convolutional Neural Networks, to infer the oncogenity probability score of gene fusion transcripts and to group them into a number of categories (e.g., oncogenic/not oncogenic). This is an inherently flexible methodology that, unlike previous approaches, can be re-trained with very less efforts on newly available data (for example, from a different cancer). Based on experimental results on a large dataset of pre-annotated gene fusions, our method is able to predict the oncogenity potential of gene fusion transcripts with accuracy of about 72%, which increases to 86% if we consider the only instances that are classified with a high confidence level. MDPI 2019-04-02 /pmc/articles/PMC6480333/ /pubmed/30987060 http://dx.doi.org/10.3390/ijms20071645 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Lovino, Marta Urgese, Gianvito Macii, Enrico Di Cataldo, Santa Ficarra, Elisa A Deep Learning Approach to the Screening of Oncogenic Gene Fusions in Humans |
title | A Deep Learning Approach to the Screening of Oncogenic Gene Fusions in Humans |
title_full | A Deep Learning Approach to the Screening of Oncogenic Gene Fusions in Humans |
title_fullStr | A Deep Learning Approach to the Screening of Oncogenic Gene Fusions in Humans |
title_full_unstemmed | A Deep Learning Approach to the Screening of Oncogenic Gene Fusions in Humans |
title_short | A Deep Learning Approach to the Screening of Oncogenic Gene Fusions in Humans |
title_sort | deep learning approach to the screening of oncogenic gene fusions in humans |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6480333/ https://www.ncbi.nlm.nih.gov/pubmed/30987060 http://dx.doi.org/10.3390/ijms20071645 |
work_keys_str_mv | AT lovinomarta adeeplearningapproachtothescreeningofoncogenicgenefusionsinhumans AT urgesegianvito adeeplearningapproachtothescreeningofoncogenicgenefusionsinhumans AT maciienrico adeeplearningapproachtothescreeningofoncogenicgenefusionsinhumans AT dicataldosanta adeeplearningapproachtothescreeningofoncogenicgenefusionsinhumans AT ficarraelisa adeeplearningapproachtothescreeningofoncogenicgenefusionsinhumans AT lovinomarta deeplearningapproachtothescreeningofoncogenicgenefusionsinhumans AT urgesegianvito deeplearningapproachtothescreeningofoncogenicgenefusionsinhumans AT maciienrico deeplearningapproachtothescreeningofoncogenicgenefusionsinhumans AT dicataldosanta deeplearningapproachtothescreeningofoncogenicgenefusionsinhumans AT ficarraelisa deeplearningapproachtothescreeningofoncogenicgenefusionsinhumans |