Cargando…
Comparing biological information contained in mRNA and non-coding RNAs for classification of lung cancer patients
BACKGROUND: Deciphering the meaning of the human DNA is an outstanding goal which would revolutionize medicine and our way for treating diseases. In recent years, non-coding RNAs have attracted much attention and shown to be functional in part. Yet the importance of these RNAs especially for higher...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6892207/ https://www.ncbi.nlm.nih.gov/pubmed/31796020 http://dx.doi.org/10.1186/s12885-019-6338-1 |
_version_ | 1783475984054353920 |
---|---|
author | Smolander, Johannes Stupnikov, Alexey Glazko, Galina Dehmer, Matthias Emmert-Streib, Frank |
author_facet | Smolander, Johannes Stupnikov, Alexey Glazko, Galina Dehmer, Matthias Emmert-Streib, Frank |
author_sort | Smolander, Johannes |
collection | PubMed |
description | BACKGROUND: Deciphering the meaning of the human DNA is an outstanding goal which would revolutionize medicine and our way for treating diseases. In recent years, non-coding RNAs have attracted much attention and shown to be functional in part. Yet the importance of these RNAs especially for higher biological functions remains under investigation. METHODS: In this paper, we analyze RNA-seq data, including non-coding and protein coding RNAs, from lung adenocarcinoma patients, a histologic subtype of non-small-cell lung cancer, with deep learning neural networks and other state-of-the-art classification methods. The purpose of our paper is three-fold. First, we compare the classification performance of different versions of deep belief networks with SVMs, decision trees and random forests. Second, we compare the classification capabilities of protein coding and non-coding RNAs. Third, we study the influence of feature selection on the classification performance. RESULTS: As a result, we find that deep belief networks perform at least competitively to other state-of-the-art classifiers. Second, data from non-coding RNAs perform better than coding RNAs across a number of different classification methods. This demonstrates the equivalence of predictive information as captured by non-coding RNAs compared to protein coding RNAs, conventionally used in computational diagnostics tasks. Third, we find that feature selection has in general a negative effect on the classification performance which means that unfiltered data with all features give the best classification results. CONCLUSIONS: Our study is the first to use ncRNAs beyond miRNAs for the computational classification of cancer and for performing a direct comparison of the classification capabilities of protein coding RNAs and non-coding RNAs. |
format | Online Article Text |
id | pubmed-6892207 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-68922072019-12-11 Comparing biological information contained in mRNA and non-coding RNAs for classification of lung cancer patients Smolander, Johannes Stupnikov, Alexey Glazko, Galina Dehmer, Matthias Emmert-Streib, Frank BMC Cancer Research Article BACKGROUND: Deciphering the meaning of the human DNA is an outstanding goal which would revolutionize medicine and our way for treating diseases. In recent years, non-coding RNAs have attracted much attention and shown to be functional in part. Yet the importance of these RNAs especially for higher biological functions remains under investigation. METHODS: In this paper, we analyze RNA-seq data, including non-coding and protein coding RNAs, from lung adenocarcinoma patients, a histologic subtype of non-small-cell lung cancer, with deep learning neural networks and other state-of-the-art classification methods. The purpose of our paper is three-fold. First, we compare the classification performance of different versions of deep belief networks with SVMs, decision trees and random forests. Second, we compare the classification capabilities of protein coding and non-coding RNAs. Third, we study the influence of feature selection on the classification performance. RESULTS: As a result, we find that deep belief networks perform at least competitively to other state-of-the-art classifiers. Second, data from non-coding RNAs perform better than coding RNAs across a number of different classification methods. This demonstrates the equivalence of predictive information as captured by non-coding RNAs compared to protein coding RNAs, conventionally used in computational diagnostics tasks. Third, we find that feature selection has in general a negative effect on the classification performance which means that unfiltered data with all features give the best classification results. CONCLUSIONS: Our study is the first to use ncRNAs beyond miRNAs for the computational classification of cancer and for performing a direct comparison of the classification capabilities of protein coding RNAs and non-coding RNAs. BioMed Central 2019-12-03 /pmc/articles/PMC6892207/ /pubmed/31796020 http://dx.doi.org/10.1186/s12885-019-6338-1 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Smolander, Johannes Stupnikov, Alexey Glazko, Galina Dehmer, Matthias Emmert-Streib, Frank Comparing biological information contained in mRNA and non-coding RNAs for classification of lung cancer patients |
title | Comparing biological information contained in mRNA and non-coding RNAs for classification of lung cancer patients |
title_full | Comparing biological information contained in mRNA and non-coding RNAs for classification of lung cancer patients |
title_fullStr | Comparing biological information contained in mRNA and non-coding RNAs for classification of lung cancer patients |
title_full_unstemmed | Comparing biological information contained in mRNA and non-coding RNAs for classification of lung cancer patients |
title_short | Comparing biological information contained in mRNA and non-coding RNAs for classification of lung cancer patients |
title_sort | comparing biological information contained in mrna and non-coding rnas for classification of lung cancer patients |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6892207/ https://www.ncbi.nlm.nih.gov/pubmed/31796020 http://dx.doi.org/10.1186/s12885-019-6338-1 |
work_keys_str_mv | AT smolanderjohannes comparingbiologicalinformationcontainedinmrnaandnoncodingrnasforclassificationoflungcancerpatients AT stupnikovalexey comparingbiologicalinformationcontainedinmrnaandnoncodingrnasforclassificationoflungcancerpatients AT glazkogalina comparingbiologicalinformationcontainedinmrnaandnoncodingrnasforclassificationoflungcancerpatients AT dehmermatthias comparingbiologicalinformationcontainedinmrnaandnoncodingrnasforclassificationoflungcancerpatients AT emmertstreibfrank comparingbiologicalinformationcontainedinmrnaandnoncodingrnasforclassificationoflungcancerpatients |