Cargando…

DFA7, a New Method to Distinguish between Intron-Containing and Intronless Genes

Intron-containing and intronless genes have different biological properties and statistical characteristics. Here we propose a new computational method to distinguish between intron-containing and intronless gene sequences. Seven feature parameters [Image: see text], [Image: see text], [Image: see t...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Chenglong, Deng, Mo, Zheng, Lu, He, Rong Lucy, Yang, Jie, Yau, Stephen S.-T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4103774/
https://www.ncbi.nlm.nih.gov/pubmed/25036549
http://dx.doi.org/10.1371/journal.pone.0101363
_version_ 1782327191377281024
author Yu, Chenglong
Deng, Mo
Zheng, Lu
He, Rong Lucy
Yang, Jie
Yau, Stephen S.-T.
author_facet Yu, Chenglong
Deng, Mo
Zheng, Lu
He, Rong Lucy
Yang, Jie
Yau, Stephen S.-T.
author_sort Yu, Chenglong
collection PubMed
description Intron-containing and intronless genes have different biological properties and statistical characteristics. Here we propose a new computational method to distinguish between intron-containing and intronless gene sequences. Seven feature parameters [Image: see text], [Image: see text], [Image: see text], [Image: see text], [Image: see text], [Image: see text], and [Image: see text] based on detrended fluctuation analysis (DFA) are fully used, and thus we can compute a 7-dimensional feature vector for any given gene sequence to be discriminated. Furthermore, support vector machine (SVM) classifier with Gaussian radial basis kernel function is performed on this feature space to classify the genes into intron-containing and intronless. We investigate the performance of the proposed method in comparison with other state-of-the-art algorithms on biological datasets. The experimental results show that our new method significantly improves the accuracy over those existing techniques.
format Online
Article
Text
id pubmed-4103774
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-41037742014-07-21 DFA7, a New Method to Distinguish between Intron-Containing and Intronless Genes Yu, Chenglong Deng, Mo Zheng, Lu He, Rong Lucy Yang, Jie Yau, Stephen S.-T. PLoS One Research Article Intron-containing and intronless genes have different biological properties and statistical characteristics. Here we propose a new computational method to distinguish between intron-containing and intronless gene sequences. Seven feature parameters [Image: see text], [Image: see text], [Image: see text], [Image: see text], [Image: see text], [Image: see text], and [Image: see text] based on detrended fluctuation analysis (DFA) are fully used, and thus we can compute a 7-dimensional feature vector for any given gene sequence to be discriminated. Furthermore, support vector machine (SVM) classifier with Gaussian radial basis kernel function is performed on this feature space to classify the genes into intron-containing and intronless. We investigate the performance of the proposed method in comparison with other state-of-the-art algorithms on biological datasets. The experimental results show that our new method significantly improves the accuracy over those existing techniques. Public Library of Science 2014-07-18 /pmc/articles/PMC4103774/ /pubmed/25036549 http://dx.doi.org/10.1371/journal.pone.0101363 Text en © 2014 Yu et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Yu, Chenglong
Deng, Mo
Zheng, Lu
He, Rong Lucy
Yang, Jie
Yau, Stephen S.-T.
DFA7, a New Method to Distinguish between Intron-Containing and Intronless Genes
title DFA7, a New Method to Distinguish between Intron-Containing and Intronless Genes
title_full DFA7, a New Method to Distinguish between Intron-Containing and Intronless Genes
title_fullStr DFA7, a New Method to Distinguish between Intron-Containing and Intronless Genes
title_full_unstemmed DFA7, a New Method to Distinguish between Intron-Containing and Intronless Genes
title_short DFA7, a New Method to Distinguish between Intron-Containing and Intronless Genes
title_sort dfa7, a new method to distinguish between intron-containing and intronless genes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4103774/
https://www.ncbi.nlm.nih.gov/pubmed/25036549
http://dx.doi.org/10.1371/journal.pone.0101363
work_keys_str_mv AT yuchenglong dfa7anewmethodtodistinguishbetweenintroncontainingandintronlessgenes
AT dengmo dfa7anewmethodtodistinguishbetweenintroncontainingandintronlessgenes
AT zhenglu dfa7anewmethodtodistinguishbetweenintroncontainingandintronlessgenes
AT heronglucy dfa7anewmethodtodistinguishbetweenintroncontainingandintronlessgenes
AT yangjie dfa7anewmethodtodistinguishbetweenintroncontainingandintronlessgenes
AT yaustephenst dfa7anewmethodtodistinguishbetweenintroncontainingandintronlessgenes