Cargando…

voomDDA: discovery of diagnostic biomarkers and classification of RNA-seq data

RNA-Seq is a recent and efficient technique that uses the capabilities of next-generation sequencing technology for characterizing and quantifying transcriptomes. One important task using gene-expression data is to identify a small subset of genes that can be used to build diagnostic classifiers par...

Descripción completa

Detalles Bibliográficos
Autores principales: Zararsiz, Gokmen, Goksuluk, Dincer, Klaus, Bernd, Korkmaz, Selcuk, Eldem, Vahap, Karabulut, Erdem, Ozturk, Ahmet
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5633036/
https://www.ncbi.nlm.nih.gov/pubmed/29018623
http://dx.doi.org/10.7717/peerj.3890
_version_ 1783269817065668608
author Zararsiz, Gokmen
Goksuluk, Dincer
Klaus, Bernd
Korkmaz, Selcuk
Eldem, Vahap
Karabulut, Erdem
Ozturk, Ahmet
author_facet Zararsiz, Gokmen
Goksuluk, Dincer
Klaus, Bernd
Korkmaz, Selcuk
Eldem, Vahap
Karabulut, Erdem
Ozturk, Ahmet
author_sort Zararsiz, Gokmen
collection PubMed
description RNA-Seq is a recent and efficient technique that uses the capabilities of next-generation sequencing technology for characterizing and quantifying transcriptomes. One important task using gene-expression data is to identify a small subset of genes that can be used to build diagnostic classifiers particularly for cancer diseases. Microarray based classifiers are not directly applicable to RNA-Seq data due to its discrete nature. Overdispersion is another problem that requires careful modeling of mean and variance relationship of the RNA-Seq data. In this study, we present voomDDA classifiers: variance modeling at the observational level (voom) extensions of the nearest shrunken centroids (NSC) and the diagonal discriminant classifiers. VoomNSC is one of these classifiers and brings voom and NSC approaches together for the purpose of gene-expression based classification. For this purpose, we propose weighted statistics and put these weighted statistics into the NSC algorithm. The VoomNSC is a sparse classifier that models the mean-variance relationship using the voom method and incorporates voom’s precision weights into the NSC classifier via weighted statistics. A comprehensive simulation study was designed and four real datasets are used for performance assessment. The overall results indicate that voomNSC performs as the sparsest classifier. It also provides the most accurate results together with power-transformed Poisson linear discriminant analysis, rlog transformed support vector machines and random forests algorithms. In addition to prediction purposes, the voomNSC classifier can be used to identify the potential diagnostic biomarkers for a condition of interest. Through this work, statistical learning methods proposed for microarrays can be reused for RNA-Seq data. An interactive web application is freely available at http://www.biosoft.hacettepe.edu.tr/voomDDA/.
format Online
Article
Text
id pubmed-5633036
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-56330362017-10-10 voomDDA: discovery of diagnostic biomarkers and classification of RNA-seq data Zararsiz, Gokmen Goksuluk, Dincer Klaus, Bernd Korkmaz, Selcuk Eldem, Vahap Karabulut, Erdem Ozturk, Ahmet PeerJ Bioinformatics RNA-Seq is a recent and efficient technique that uses the capabilities of next-generation sequencing technology for characterizing and quantifying transcriptomes. One important task using gene-expression data is to identify a small subset of genes that can be used to build diagnostic classifiers particularly for cancer diseases. Microarray based classifiers are not directly applicable to RNA-Seq data due to its discrete nature. Overdispersion is another problem that requires careful modeling of mean and variance relationship of the RNA-Seq data. In this study, we present voomDDA classifiers: variance modeling at the observational level (voom) extensions of the nearest shrunken centroids (NSC) and the diagonal discriminant classifiers. VoomNSC is one of these classifiers and brings voom and NSC approaches together for the purpose of gene-expression based classification. For this purpose, we propose weighted statistics and put these weighted statistics into the NSC algorithm. The VoomNSC is a sparse classifier that models the mean-variance relationship using the voom method and incorporates voom’s precision weights into the NSC classifier via weighted statistics. A comprehensive simulation study was designed and four real datasets are used for performance assessment. The overall results indicate that voomNSC performs as the sparsest classifier. It also provides the most accurate results together with power-transformed Poisson linear discriminant analysis, rlog transformed support vector machines and random forests algorithms. In addition to prediction purposes, the voomNSC classifier can be used to identify the potential diagnostic biomarkers for a condition of interest. Through this work, statistical learning methods proposed for microarrays can be reused for RNA-Seq data. An interactive web application is freely available at http://www.biosoft.hacettepe.edu.tr/voomDDA/. PeerJ Inc. 2017-10-06 /pmc/articles/PMC5633036/ /pubmed/29018623 http://dx.doi.org/10.7717/peerj.3890 Text en ©2017 Zararsiz et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Zararsiz, Gokmen
Goksuluk, Dincer
Klaus, Bernd
Korkmaz, Selcuk
Eldem, Vahap
Karabulut, Erdem
Ozturk, Ahmet
voomDDA: discovery of diagnostic biomarkers and classification of RNA-seq data
title voomDDA: discovery of diagnostic biomarkers and classification of RNA-seq data
title_full voomDDA: discovery of diagnostic biomarkers and classification of RNA-seq data
title_fullStr voomDDA: discovery of diagnostic biomarkers and classification of RNA-seq data
title_full_unstemmed voomDDA: discovery of diagnostic biomarkers and classification of RNA-seq data
title_short voomDDA: discovery of diagnostic biomarkers and classification of RNA-seq data
title_sort voomdda: discovery of diagnostic biomarkers and classification of rna-seq data
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5633036/
https://www.ncbi.nlm.nih.gov/pubmed/29018623
http://dx.doi.org/10.7717/peerj.3890
work_keys_str_mv AT zararsizgokmen voomddadiscoveryofdiagnosticbiomarkersandclassificationofrnaseqdata
AT goksulukdincer voomddadiscoveryofdiagnosticbiomarkersandclassificationofrnaseqdata
AT klausbernd voomddadiscoveryofdiagnosticbiomarkersandclassificationofrnaseqdata
AT korkmazselcuk voomddadiscoveryofdiagnosticbiomarkersandclassificationofrnaseqdata
AT eldemvahap voomddadiscoveryofdiagnosticbiomarkersandclassificationofrnaseqdata
AT karabuluterdem voomddadiscoveryofdiagnosticbiomarkersandclassificationofrnaseqdata
AT ozturkahmet voomddadiscoveryofdiagnosticbiomarkersandclassificationofrnaseqdata