Cargando…

HRGPred: Prediction of herbicide resistant genes with k-mer nucleotide compositional features and support vector machine

Herbicide resistance (HR) is a major concern for the agricultural producers as well as environmentalists. Resistance to commonly used herbicides are conferred due to mutation(s) in the genes encoding herbicide target sites/proteins (GETS). Identification of these genes through wet-lab experiments is...

Descripción completa

Detalles Bibliográficos
Autores principales: Meher, Prabina Kumar, Sahu, Tanmaya Kumar, Raghunandan, K., Gahoi, Shachi, Choudhury, Nalini Kanta, Rao, Atmakuri Ramakrishna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6349872/
https://www.ncbi.nlm.nih.gov/pubmed/30692561
http://dx.doi.org/10.1038/s41598-018-37309-9
_version_ 1783390338028666880
author Meher, Prabina Kumar
Sahu, Tanmaya Kumar
Raghunandan, K.
Gahoi, Shachi
Choudhury, Nalini Kanta
Rao, Atmakuri Ramakrishna
author_facet Meher, Prabina Kumar
Sahu, Tanmaya Kumar
Raghunandan, K.
Gahoi, Shachi
Choudhury, Nalini Kanta
Rao, Atmakuri Ramakrishna
author_sort Meher, Prabina Kumar
collection PubMed
description Herbicide resistance (HR) is a major concern for the agricultural producers as well as environmentalists. Resistance to commonly used herbicides are conferred due to mutation(s) in the genes encoding herbicide target sites/proteins (GETS). Identification of these genes through wet-lab experiments is time consuming and expensive. Thus, a supervised learning-based computational model has been proposed in this study, which is first of its kind for the prediction of seven classes of GETS. The cDNA sequences of the genes were initially transformed into numeric features based on the k-mer compositions and then supplied as input to the support vector machine. In the proposed SVM-based model, the prediction occurs in two stages, where a binary classifier in the first stage discriminates the genes involved in conferring the resistance to herbicides from other genes, followed by a multi-class classifier in the second stage that categorizes the predicted herbicide resistant genes in the first stage into any one of the seven resistant classes. Overall classification accuracies were observed to be ~89% and >97% for binary and multi-class classifications respectively. The proposed model confirmed higher accuracy than the homology-based algorithms viz., BLAST and Hidden Markov Model. Besides, the developed computational model achieved ~87% accuracy, while tested with an independent dataset. An online prediction server HRGPred (http://cabgrid.res.in:8080/hrgpred) has also been established to facilitate the prediction of GETS by the scientific community.
format Online
Article
Text
id pubmed-6349872
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-63498722019-01-30 HRGPred: Prediction of herbicide resistant genes with k-mer nucleotide compositional features and support vector machine Meher, Prabina Kumar Sahu, Tanmaya Kumar Raghunandan, K. Gahoi, Shachi Choudhury, Nalini Kanta Rao, Atmakuri Ramakrishna Sci Rep Article Herbicide resistance (HR) is a major concern for the agricultural producers as well as environmentalists. Resistance to commonly used herbicides are conferred due to mutation(s) in the genes encoding herbicide target sites/proteins (GETS). Identification of these genes through wet-lab experiments is time consuming and expensive. Thus, a supervised learning-based computational model has been proposed in this study, which is first of its kind for the prediction of seven classes of GETS. The cDNA sequences of the genes were initially transformed into numeric features based on the k-mer compositions and then supplied as input to the support vector machine. In the proposed SVM-based model, the prediction occurs in two stages, where a binary classifier in the first stage discriminates the genes involved in conferring the resistance to herbicides from other genes, followed by a multi-class classifier in the second stage that categorizes the predicted herbicide resistant genes in the first stage into any one of the seven resistant classes. Overall classification accuracies were observed to be ~89% and >97% for binary and multi-class classifications respectively. The proposed model confirmed higher accuracy than the homology-based algorithms viz., BLAST and Hidden Markov Model. Besides, the developed computational model achieved ~87% accuracy, while tested with an independent dataset. An online prediction server HRGPred (http://cabgrid.res.in:8080/hrgpred) has also been established to facilitate the prediction of GETS by the scientific community. Nature Publishing Group UK 2019-01-28 /pmc/articles/PMC6349872/ /pubmed/30692561 http://dx.doi.org/10.1038/s41598-018-37309-9 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Meher, Prabina Kumar
Sahu, Tanmaya Kumar
Raghunandan, K.
Gahoi, Shachi
Choudhury, Nalini Kanta
Rao, Atmakuri Ramakrishna
HRGPred: Prediction of herbicide resistant genes with k-mer nucleotide compositional features and support vector machine
title HRGPred: Prediction of herbicide resistant genes with k-mer nucleotide compositional features and support vector machine
title_full HRGPred: Prediction of herbicide resistant genes with k-mer nucleotide compositional features and support vector machine
title_fullStr HRGPred: Prediction of herbicide resistant genes with k-mer nucleotide compositional features and support vector machine
title_full_unstemmed HRGPred: Prediction of herbicide resistant genes with k-mer nucleotide compositional features and support vector machine
title_short HRGPred: Prediction of herbicide resistant genes with k-mer nucleotide compositional features and support vector machine
title_sort hrgpred: prediction of herbicide resistant genes with k-mer nucleotide compositional features and support vector machine
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6349872/
https://www.ncbi.nlm.nih.gov/pubmed/30692561
http://dx.doi.org/10.1038/s41598-018-37309-9
work_keys_str_mv AT meherprabinakumar hrgpredpredictionofherbicideresistantgeneswithkmernucleotidecompositionalfeaturesandsupportvectormachine
AT sahutanmayakumar hrgpredpredictionofherbicideresistantgeneswithkmernucleotidecompositionalfeaturesandsupportvectormachine
AT raghunandank hrgpredpredictionofherbicideresistantgeneswithkmernucleotidecompositionalfeaturesandsupportvectormachine
AT gahoishachi hrgpredpredictionofherbicideresistantgeneswithkmernucleotidecompositionalfeaturesandsupportvectormachine
AT choudhurynalinikanta hrgpredpredictionofherbicideresistantgeneswithkmernucleotidecompositionalfeaturesandsupportvectormachine
AT raoatmakuriramakrishna hrgpredpredictionofherbicideresistantgeneswithkmernucleotidecompositionalfeaturesandsupportvectormachine