Cargando…

Prediction of Drought-Resistant Genes in Arabidopsis thaliana Using SVM-RFE

BACKGROUND: Identifying genes with essential roles in resisting environmental stress rates high in agronomic importance. Although massive DNA microarray gene expression data have been generated for plants, current computational approaches underutilize these data for studying genotype-trait relations...

Descripción completa

Detalles Bibliográficos
Autores principales: Liang, Yanchun, Zhang, Fan, Wang, Juexin, Joshi, Trupti, Wang, Yan, Xu, Dong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3137602/
https://www.ncbi.nlm.nih.gov/pubmed/21789178
http://dx.doi.org/10.1371/journal.pone.0021750
_version_ 1782208301109346304
author Liang, Yanchun
Zhang, Fan
Wang, Juexin
Joshi, Trupti
Wang, Yan
Xu, Dong
author_facet Liang, Yanchun
Zhang, Fan
Wang, Juexin
Joshi, Trupti
Wang, Yan
Xu, Dong
author_sort Liang, Yanchun
collection PubMed
description BACKGROUND: Identifying genes with essential roles in resisting environmental stress rates high in agronomic importance. Although massive DNA microarray gene expression data have been generated for plants, current computational approaches underutilize these data for studying genotype-trait relationships. Some advanced gene identification methods have been explored for human diseases, but typically these methods have not been converted into publicly available software tools and cannot be applied to plants for identifying genes with agronomic traits. METHODOLOGY: In this study, we used 22 sets of Arabidopsis thaliana gene expression data from GEO to predict the key genes involved in water tolerance. We applied an SVM-RFE (Support Vector Machine-Recursive Feature Elimination) feature selection method for the prediction. To address small sample sizes, we developed a modified approach for SVM-RFE by using bootstrapping and leave-one-out cross-validation. We also expanded our study to predict genes involved in water susceptibility. CONCLUSIONS: We analyzed the top 10 genes predicted to be involved in water tolerance. Seven of them are connected to known biological processes in drought resistance. We also analyzed the top 100 genes in terms of their biological functions. Our study shows that the SVM-RFE method is a highly promising method in analyzing plant microarray data for studying genotype-phenotype relationships. The software is freely available with source code at http://ccst.jlu.edu.cn/JCSB/RFET/.
format Online
Article
Text
id pubmed-3137602
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-31376022011-07-25 Prediction of Drought-Resistant Genes in Arabidopsis thaliana Using SVM-RFE Liang, Yanchun Zhang, Fan Wang, Juexin Joshi, Trupti Wang, Yan Xu, Dong PLoS One Research Article BACKGROUND: Identifying genes with essential roles in resisting environmental stress rates high in agronomic importance. Although massive DNA microarray gene expression data have been generated for plants, current computational approaches underutilize these data for studying genotype-trait relationships. Some advanced gene identification methods have been explored for human diseases, but typically these methods have not been converted into publicly available software tools and cannot be applied to plants for identifying genes with agronomic traits. METHODOLOGY: In this study, we used 22 sets of Arabidopsis thaliana gene expression data from GEO to predict the key genes involved in water tolerance. We applied an SVM-RFE (Support Vector Machine-Recursive Feature Elimination) feature selection method for the prediction. To address small sample sizes, we developed a modified approach for SVM-RFE by using bootstrapping and leave-one-out cross-validation. We also expanded our study to predict genes involved in water susceptibility. CONCLUSIONS: We analyzed the top 10 genes predicted to be involved in water tolerance. Seven of them are connected to known biological processes in drought resistance. We also analyzed the top 100 genes in terms of their biological functions. Our study shows that the SVM-RFE method is a highly promising method in analyzing plant microarray data for studying genotype-phenotype relationships. The software is freely available with source code at http://ccst.jlu.edu.cn/JCSB/RFET/. Public Library of Science 2011-07-15 /pmc/articles/PMC3137602/ /pubmed/21789178 http://dx.doi.org/10.1371/journal.pone.0021750 Text en Liang et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Liang, Yanchun
Zhang, Fan
Wang, Juexin
Joshi, Trupti
Wang, Yan
Xu, Dong
Prediction of Drought-Resistant Genes in Arabidopsis thaliana Using SVM-RFE
title Prediction of Drought-Resistant Genes in Arabidopsis thaliana Using SVM-RFE
title_full Prediction of Drought-Resistant Genes in Arabidopsis thaliana Using SVM-RFE
title_fullStr Prediction of Drought-Resistant Genes in Arabidopsis thaliana Using SVM-RFE
title_full_unstemmed Prediction of Drought-Resistant Genes in Arabidopsis thaliana Using SVM-RFE
title_short Prediction of Drought-Resistant Genes in Arabidopsis thaliana Using SVM-RFE
title_sort prediction of drought-resistant genes in arabidopsis thaliana using svm-rfe
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3137602/
https://www.ncbi.nlm.nih.gov/pubmed/21789178
http://dx.doi.org/10.1371/journal.pone.0021750
work_keys_str_mv AT liangyanchun predictionofdroughtresistantgenesinarabidopsisthalianausingsvmrfe
AT zhangfan predictionofdroughtresistantgenesinarabidopsisthalianausingsvmrfe
AT wangjuexin predictionofdroughtresistantgenesinarabidopsisthalianausingsvmrfe
AT joshitrupti predictionofdroughtresistantgenesinarabidopsisthalianausingsvmrfe
AT wangyan predictionofdroughtresistantgenesinarabidopsisthalianausingsvmrfe
AT xudong predictionofdroughtresistantgenesinarabidopsisthalianausingsvmrfe