Table_5_Accurate Prediction of a Quantitative Trait Using the Genes Controlling the Trait for Gene-Based Breeding in Cotton.XLSX
收藏frontiersin.figshare.com2023-06-01 更新2025-01-22 收录
下载链接:
https://frontiersin.figshare.com/articles/dataset/Table_5_Accurate_Prediction_of_a_Quantitative_Trait_Using_the_Genes_Controlling_the_Trait_for_Gene-Based_Breeding_in_Cotton_XLSX/13206431/1
下载链接
链接失效反馈官方服务:
资源简介:
Accurate phenotype prediction of quantitative traits is paramount to enhanced plant research and breeding. Here, we report the accurate prediction of cotton fiber length, a typical quantitative trait, using 474 cotton (Gossypium ssp.) fiber length (GFL) genes and nine prediction models. When the SNPs/InDels contained in 226 of the GFL genes or the expressions of all 474 GFL genes was used for fiber length prediction, a prediction accuracy of r = 0.83 was obtained, approaching the maximally possible prediction accuracy of a quantitative trait. This has improved by 116%, the prediction accuracies of the fiber length thus far achieved for genomic selection using genome-wide random DNA markers. Moreover, analysis of the GFL genes identified 125 of the GFL genes that are key to accurate prediction of fiber length, with which a prediction accuracy similar to that of all 474 GFL genes was obtained. The fiber lengths of the plants predicted with expressions of the 125 key GFL genes were significantly correlated with those predicted with the SNPs/InDels of the above 226 SNP/InDel-containing GFL genes (r = 0.892, P = 0.000). The prediction accuracies of fiber length using both genic datasets were highly consistent across environments or generations. Finally, we found that a training population consisting of 100–120 plants was sufficient to train a model for accurate prediction of a quantitative trait using the genes controlling the trait. Therefore, the genes controlling a quantitative trait are capable of accurately predicting its phenotype, thereby dramatically improving the ability, accuracy, and efficiency of phenotype prediction and promoting gene-based breeding in cotton and other species.
精确预测数量性状的表型对于提升植物研究和育种至关重要。本研究报道了利用474个棉花(Gossypium ssp.)纤维长度(GFL)基因和九种预测模型,对棉花纤维长度这一典型数量性状的精确预测。当使用包含于226个GFL基因中的SNPs/InDels或所有474个GFL基因的表达进行纤维长度预测时,获得了r = 0.83的预测精度,接近数量性状可能达到的最大预测精度。这一精度相较于以往基于全基因组随机DNA标记进行基因组选择所达到的纤维长度预测精度提高了116%。对GFL基因的分析识别出125个对纤维长度精确预测至关重要的基因,使用这些基因的表达进行预测,其精度与使用所有474个GFL基因的表达相似。预测的125个关键GFL基因表达的纤维长度与上述226个包含SNPs/InDel的GFL基因的SNPs/InDels预测的纤维长度显著相关(r = 0.892,P = 0.000)。使用这两种基因数据集预测纤维长度的精度在环境和世代间高度一致。最终,我们发现由100-120个植物组成的训练群体足以训练一个模型,以利用控制该性状的基因对数量性状进行精确预测。因此,控制数量性状的基因能够准确预测其表型,从而显著提升表型预测的能力、精度和效率,并促进棉花及其他物种的基于基因的育种。
提供机构:
Frontiers



