Cargando…
HRGPred: Prediction of herbicide resistant genes with k-mer nucleotide compositional features and support vector machine
Herbicide resistance (HR) is a major concern for the agricultural producers as well as environmentalists. Resistance to commonly used herbicides are conferred due to mutation(s) in the genes encoding herbicide target sites/proteins (GETS). Identification of these genes through wet-lab experiments is...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6349872/ https://www.ncbi.nlm.nih.gov/pubmed/30692561 http://dx.doi.org/10.1038/s41598-018-37309-9 |
_version_ | 1783390338028666880 |
---|---|
author | Meher, Prabina Kumar Sahu, Tanmaya Kumar Raghunandan, K. Gahoi, Shachi Choudhury, Nalini Kanta Rao, Atmakuri Ramakrishna |
author_facet | Meher, Prabina Kumar Sahu, Tanmaya Kumar Raghunandan, K. Gahoi, Shachi Choudhury, Nalini Kanta Rao, Atmakuri Ramakrishna |
author_sort | Meher, Prabina Kumar |
collection | PubMed |
description | Herbicide resistance (HR) is a major concern for the agricultural producers as well as environmentalists. Resistance to commonly used herbicides are conferred due to mutation(s) in the genes encoding herbicide target sites/proteins (GETS). Identification of these genes through wet-lab experiments is time consuming and expensive. Thus, a supervised learning-based computational model has been proposed in this study, which is first of its kind for the prediction of seven classes of GETS. The cDNA sequences of the genes were initially transformed into numeric features based on the k-mer compositions and then supplied as input to the support vector machine. In the proposed SVM-based model, the prediction occurs in two stages, where a binary classifier in the first stage discriminates the genes involved in conferring the resistance to herbicides from other genes, followed by a multi-class classifier in the second stage that categorizes the predicted herbicide resistant genes in the first stage into any one of the seven resistant classes. Overall classification accuracies were observed to be ~89% and >97% for binary and multi-class classifications respectively. The proposed model confirmed higher accuracy than the homology-based algorithms viz., BLAST and Hidden Markov Model. Besides, the developed computational model achieved ~87% accuracy, while tested with an independent dataset. An online prediction server HRGPred (http://cabgrid.res.in:8080/hrgpred) has also been established to facilitate the prediction of GETS by the scientific community. |
format | Online Article Text |
id | pubmed-6349872 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-63498722019-01-30 HRGPred: Prediction of herbicide resistant genes with k-mer nucleotide compositional features and support vector machine Meher, Prabina Kumar Sahu, Tanmaya Kumar Raghunandan, K. Gahoi, Shachi Choudhury, Nalini Kanta Rao, Atmakuri Ramakrishna Sci Rep Article Herbicide resistance (HR) is a major concern for the agricultural producers as well as environmentalists. Resistance to commonly used herbicides are conferred due to mutation(s) in the genes encoding herbicide target sites/proteins (GETS). Identification of these genes through wet-lab experiments is time consuming and expensive. Thus, a supervised learning-based computational model has been proposed in this study, which is first of its kind for the prediction of seven classes of GETS. The cDNA sequences of the genes were initially transformed into numeric features based on the k-mer compositions and then supplied as input to the support vector machine. In the proposed SVM-based model, the prediction occurs in two stages, where a binary classifier in the first stage discriminates the genes involved in conferring the resistance to herbicides from other genes, followed by a multi-class classifier in the second stage that categorizes the predicted herbicide resistant genes in the first stage into any one of the seven resistant classes. Overall classification accuracies were observed to be ~89% and >97% for binary and multi-class classifications respectively. The proposed model confirmed higher accuracy than the homology-based algorithms viz., BLAST and Hidden Markov Model. Besides, the developed computational model achieved ~87% accuracy, while tested with an independent dataset. An online prediction server HRGPred (http://cabgrid.res.in:8080/hrgpred) has also been established to facilitate the prediction of GETS by the scientific community. Nature Publishing Group UK 2019-01-28 /pmc/articles/PMC6349872/ /pubmed/30692561 http://dx.doi.org/10.1038/s41598-018-37309-9 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Meher, Prabina Kumar Sahu, Tanmaya Kumar Raghunandan, K. Gahoi, Shachi Choudhury, Nalini Kanta Rao, Atmakuri Ramakrishna HRGPred: Prediction of herbicide resistant genes with k-mer nucleotide compositional features and support vector machine |
title | HRGPred: Prediction of herbicide resistant genes with k-mer nucleotide compositional features and support vector machine |
title_full | HRGPred: Prediction of herbicide resistant genes with k-mer nucleotide compositional features and support vector machine |
title_fullStr | HRGPred: Prediction of herbicide resistant genes with k-mer nucleotide compositional features and support vector machine |
title_full_unstemmed | HRGPred: Prediction of herbicide resistant genes with k-mer nucleotide compositional features and support vector machine |
title_short | HRGPred: Prediction of herbicide resistant genes with k-mer nucleotide compositional features and support vector machine |
title_sort | hrgpred: prediction of herbicide resistant genes with k-mer nucleotide compositional features and support vector machine |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6349872/ https://www.ncbi.nlm.nih.gov/pubmed/30692561 http://dx.doi.org/10.1038/s41598-018-37309-9 |
work_keys_str_mv | AT meherprabinakumar hrgpredpredictionofherbicideresistantgeneswithkmernucleotidecompositionalfeaturesandsupportvectormachine AT sahutanmayakumar hrgpredpredictionofherbicideresistantgeneswithkmernucleotidecompositionalfeaturesandsupportvectormachine AT raghunandank hrgpredpredictionofherbicideresistantgeneswithkmernucleotidecompositionalfeaturesandsupportvectormachine AT gahoishachi hrgpredpredictionofherbicideresistantgeneswithkmernucleotidecompositionalfeaturesandsupportvectormachine AT choudhurynalinikanta hrgpredpredictionofherbicideresistantgeneswithkmernucleotidecompositionalfeaturesandsupportvectormachine AT raoatmakuriramakrishna hrgpredpredictionofherbicideresistantgeneswithkmernucleotidecompositionalfeaturesandsupportvectormachine |