Prediction of Drought-Resistant Genes in Arabidopsis thaliana Using SVM-RFE
收藏Figshare2016-01-18 更新2026-04-29 收录
下载链接:
https://figshare.com/articles/dataset/Prediction_of_Drought_Resistant_Genes_in_Arabidopsis_thaliana_Using_SVM_RFE/135134
下载链接
链接失效反馈官方服务:
资源简介:
BackgroundIdentifying genes with essential roles in resisting environmental stress rates high in agronomic importance. Although massive DNA microarray gene expression data have been generated for plants, current computational approaches underutilize these data for studying genotype-trait relationships. Some advanced gene identification methods have been explored for human diseases, but typically these methods have not been converted into publicly available software tools and cannot be applied to plants for identifying genes with agronomic traits. MethodologyIn this study, we used 22 sets of Arabidopsis thaliana gene expression data from GEO to predict the key genes involved in water tolerance. We applied an SVM-RFE (Support Vector Machine-Recursive Feature Elimination) feature selection method for the prediction. To address small sample sizes, we developed a modified approach for SVM-RFE by using bootstrapping and leave-one-out cross-validation. We also expanded our study to predict genes involved in water susceptibility. ConclusionsWe analyzed the top 10 genes predicted to be involved in water tolerance. Seven of them are connected to known biological processes in drought resistance. We also analyzed the top 100 genes in terms of their biological functions. Our study shows that the SVM-RFE method is a highly promising method in analyzing plant microarray data for studying genotype-phenotype relationships. The software is freely available with source code at http://ccst.jlu.edu.cn/JCSB/RFET/.
创建时间:
2016-01-18



