Cargando…

Computational Detection of piRNA in Human Using Support Vector Machine

BACKGROUND: Piwi-interacting RNAs (piRNAs) are small non-coding RNAs (ncRNAs), with a length of about 24–32 nucleotides, which have been discovered recently. These ncRNAs play an important role in germline development, transposon silencing, epigenetic regulation, protecting the genome from invasive...

Descripción completa

Detalles Bibliográficos
Autores principales: Seyeddokht, Atefeh, Aslaminejad, Ali Asghar, Masoudi-Nejad, Ali, Nassiri, Mohammadreza, Zahiri, Javad, Sadeghi, Balal
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Avicenna Research Institute 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4717465/
https://www.ncbi.nlm.nih.gov/pubmed/26855734
_version_ 1782410663617888256
author Seyeddokht, Atefeh
Aslaminejad, Ali Asghar
Masoudi-Nejad, Ali
Nassiri, Mohammadreza
Zahiri, Javad
Sadeghi, Balal
author_facet Seyeddokht, Atefeh
Aslaminejad, Ali Asghar
Masoudi-Nejad, Ali
Nassiri, Mohammadreza
Zahiri, Javad
Sadeghi, Balal
author_sort Seyeddokht, Atefeh
collection PubMed
description BACKGROUND: Piwi-interacting RNAs (piRNAs) are small non-coding RNAs (ncRNAs), with a length of about 24–32 nucleotides, which have been discovered recently. These ncRNAs play an important role in germline development, transposon silencing, epigenetic regulation, protecting the genome from invasive transposable elements, and the pathophysiology of diseases such as cancer. piRNA identification is challenging due to the lack of conserved piRNA sequences and structural elements. METHODS: To detect piRNAs, an appropriate feature set, including 8 diverse feature groups to encode each RNA was applied. In addition, a Support Vector Machine (SVM) classifier was used with optimized parameters for RNA classification. According to the obtained results, the classification performance using the optimized feature subsets was much higher than the one in previously published studies. RESULTS: Our results revealed 98% accuracy, Mathew’ correlation coefficient of 98% and 99% specificity in discriminating piRNAs from the other RNAs. Also, the obtained results show that the proposed method outperforms its competitors. CONCLUSION: In this paper, a prediction method was proposed to identify piRNA in human. Also, 48 heterogeneous features (sequence and structural features) were used to encode RNAs. To assess the performance of the method, a benchmark dataset containing 515 piRNAs and 1206 types of other RNAs was constructed. Our method reached the accuracy of 99% on the benchmark dataset. Also, our analysis revealed that the structural features are the most contributing features in piRNA prediction.
format Online
Article
Text
id pubmed-4717465
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Avicenna Research Institute
record_format MEDLINE/PubMed
spelling pubmed-47174652016-02-05 Computational Detection of piRNA in Human Using Support Vector Machine Seyeddokht, Atefeh Aslaminejad, Ali Asghar Masoudi-Nejad, Ali Nassiri, Mohammadreza Zahiri, Javad Sadeghi, Balal Avicenna J Med Biotechnol Original Article BACKGROUND: Piwi-interacting RNAs (piRNAs) are small non-coding RNAs (ncRNAs), with a length of about 24–32 nucleotides, which have been discovered recently. These ncRNAs play an important role in germline development, transposon silencing, epigenetic regulation, protecting the genome from invasive transposable elements, and the pathophysiology of diseases such as cancer. piRNA identification is challenging due to the lack of conserved piRNA sequences and structural elements. METHODS: To detect piRNAs, an appropriate feature set, including 8 diverse feature groups to encode each RNA was applied. In addition, a Support Vector Machine (SVM) classifier was used with optimized parameters for RNA classification. According to the obtained results, the classification performance using the optimized feature subsets was much higher than the one in previously published studies. RESULTS: Our results revealed 98% accuracy, Mathew’ correlation coefficient of 98% and 99% specificity in discriminating piRNAs from the other RNAs. Also, the obtained results show that the proposed method outperforms its competitors. CONCLUSION: In this paper, a prediction method was proposed to identify piRNA in human. Also, 48 heterogeneous features (sequence and structural features) were used to encode RNAs. To assess the performance of the method, a benchmark dataset containing 515 piRNAs and 1206 types of other RNAs was constructed. Our method reached the accuracy of 99% on the benchmark dataset. Also, our analysis revealed that the structural features are the most contributing features in piRNA prediction. Avicenna Research Institute 2016 /pmc/articles/PMC4717465/ /pubmed/26855734 Text en Copyright© 2016 Avicenna Research Institute This work is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported License which allows users to read, copy, distribute and make derivative works for non-commercial purposes from the material, as long as the author of the original work is cited properly.
spellingShingle Original Article
Seyeddokht, Atefeh
Aslaminejad, Ali Asghar
Masoudi-Nejad, Ali
Nassiri, Mohammadreza
Zahiri, Javad
Sadeghi, Balal
Computational Detection of piRNA in Human Using Support Vector Machine
title Computational Detection of piRNA in Human Using Support Vector Machine
title_full Computational Detection of piRNA in Human Using Support Vector Machine
title_fullStr Computational Detection of piRNA in Human Using Support Vector Machine
title_full_unstemmed Computational Detection of piRNA in Human Using Support Vector Machine
title_short Computational Detection of piRNA in Human Using Support Vector Machine
title_sort computational detection of pirna in human using support vector machine
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4717465/
https://www.ncbi.nlm.nih.gov/pubmed/26855734
work_keys_str_mv AT seyeddokhtatefeh computationaldetectionofpirnainhumanusingsupportvectormachine
AT aslaminejadaliasghar computationaldetectionofpirnainhumanusingsupportvectormachine
AT masoudinejadali computationaldetectionofpirnainhumanusingsupportvectormachine
AT nassirimohammadreza computationaldetectionofpirnainhumanusingsupportvectormachine
AT zahirijavad computationaldetectionofpirnainhumanusingsupportvectormachine
AT sadeghibalal computationaldetectionofpirnainhumanusingsupportvectormachine