Cargando…
Computational Detection of piRNA in Human Using Support Vector Machine
BACKGROUND: Piwi-interacting RNAs (piRNAs) are small non-coding RNAs (ncRNAs), with a length of about 24–32 nucleotides, which have been discovered recently. These ncRNAs play an important role in germline development, transposon silencing, epigenetic regulation, protecting the genome from invasive...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Avicenna Research Institute
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4717465/ https://www.ncbi.nlm.nih.gov/pubmed/26855734 |
_version_ | 1782410663617888256 |
---|---|
author | Seyeddokht, Atefeh Aslaminejad, Ali Asghar Masoudi-Nejad, Ali Nassiri, Mohammadreza Zahiri, Javad Sadeghi, Balal |
author_facet | Seyeddokht, Atefeh Aslaminejad, Ali Asghar Masoudi-Nejad, Ali Nassiri, Mohammadreza Zahiri, Javad Sadeghi, Balal |
author_sort | Seyeddokht, Atefeh |
collection | PubMed |
description | BACKGROUND: Piwi-interacting RNAs (piRNAs) are small non-coding RNAs (ncRNAs), with a length of about 24–32 nucleotides, which have been discovered recently. These ncRNAs play an important role in germline development, transposon silencing, epigenetic regulation, protecting the genome from invasive transposable elements, and the pathophysiology of diseases such as cancer. piRNA identification is challenging due to the lack of conserved piRNA sequences and structural elements. METHODS: To detect piRNAs, an appropriate feature set, including 8 diverse feature groups to encode each RNA was applied. In addition, a Support Vector Machine (SVM) classifier was used with optimized parameters for RNA classification. According to the obtained results, the classification performance using the optimized feature subsets was much higher than the one in previously published studies. RESULTS: Our results revealed 98% accuracy, Mathew’ correlation coefficient of 98% and 99% specificity in discriminating piRNAs from the other RNAs. Also, the obtained results show that the proposed method outperforms its competitors. CONCLUSION: In this paper, a prediction method was proposed to identify piRNA in human. Also, 48 heterogeneous features (sequence and structural features) were used to encode RNAs. To assess the performance of the method, a benchmark dataset containing 515 piRNAs and 1206 types of other RNAs was constructed. Our method reached the accuracy of 99% on the benchmark dataset. Also, our analysis revealed that the structural features are the most contributing features in piRNA prediction. |
format | Online Article Text |
id | pubmed-4717465 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Avicenna Research Institute |
record_format | MEDLINE/PubMed |
spelling | pubmed-47174652016-02-05 Computational Detection of piRNA in Human Using Support Vector Machine Seyeddokht, Atefeh Aslaminejad, Ali Asghar Masoudi-Nejad, Ali Nassiri, Mohammadreza Zahiri, Javad Sadeghi, Balal Avicenna J Med Biotechnol Original Article BACKGROUND: Piwi-interacting RNAs (piRNAs) are small non-coding RNAs (ncRNAs), with a length of about 24–32 nucleotides, which have been discovered recently. These ncRNAs play an important role in germline development, transposon silencing, epigenetic regulation, protecting the genome from invasive transposable elements, and the pathophysiology of diseases such as cancer. piRNA identification is challenging due to the lack of conserved piRNA sequences and structural elements. METHODS: To detect piRNAs, an appropriate feature set, including 8 diverse feature groups to encode each RNA was applied. In addition, a Support Vector Machine (SVM) classifier was used with optimized parameters for RNA classification. According to the obtained results, the classification performance using the optimized feature subsets was much higher than the one in previously published studies. RESULTS: Our results revealed 98% accuracy, Mathew’ correlation coefficient of 98% and 99% specificity in discriminating piRNAs from the other RNAs. Also, the obtained results show that the proposed method outperforms its competitors. CONCLUSION: In this paper, a prediction method was proposed to identify piRNA in human. Also, 48 heterogeneous features (sequence and structural features) were used to encode RNAs. To assess the performance of the method, a benchmark dataset containing 515 piRNAs and 1206 types of other RNAs was constructed. Our method reached the accuracy of 99% on the benchmark dataset. Also, our analysis revealed that the structural features are the most contributing features in piRNA prediction. Avicenna Research Institute 2016 /pmc/articles/PMC4717465/ /pubmed/26855734 Text en Copyright© 2016 Avicenna Research Institute This work is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported License which allows users to read, copy, distribute and make derivative works for non-commercial purposes from the material, as long as the author of the original work is cited properly. |
spellingShingle | Original Article Seyeddokht, Atefeh Aslaminejad, Ali Asghar Masoudi-Nejad, Ali Nassiri, Mohammadreza Zahiri, Javad Sadeghi, Balal Computational Detection of piRNA in Human Using Support Vector Machine |
title | Computational Detection of piRNA in Human Using Support Vector Machine |
title_full | Computational Detection of piRNA in Human Using Support Vector Machine |
title_fullStr | Computational Detection of piRNA in Human Using Support Vector Machine |
title_full_unstemmed | Computational Detection of piRNA in Human Using Support Vector Machine |
title_short | Computational Detection of piRNA in Human Using Support Vector Machine |
title_sort | computational detection of pirna in human using support vector machine |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4717465/ https://www.ncbi.nlm.nih.gov/pubmed/26855734 |
work_keys_str_mv | AT seyeddokhtatefeh computationaldetectionofpirnainhumanusingsupportvectormachine AT aslaminejadaliasghar computationaldetectionofpirnainhumanusingsupportvectormachine AT masoudinejadali computationaldetectionofpirnainhumanusingsupportvectormachine AT nassirimohammadreza computationaldetectionofpirnainhumanusingsupportvectormachine AT zahirijavad computationaldetectionofpirnainhumanusingsupportvectormachine AT sadeghibalal computationaldetectionofpirnainhumanusingsupportvectormachine |