five

Data from: The effect of close relatives on unsupervised Bayesian clustering algorithms in population genetic structure analysis

收藏
DataONE2012-06-13 更新2024-06-27 收录
下载链接:
https://search.dataone.org/view/null
下载链接
链接失效反馈
官方服务:
资源简介:
The inference of population genetic structures is essential in many research areas in population genetics, conservation biology and evolutionary biology. Recently, unsupervised Bayesian clustering algorithms have been developed to detect a hidden population structure from genotypic data, assuming among others that individuals taken from the population are unrelated. Because of this hypothesis, markers in a sample taken from a subpopulation can be considered to be in Hardy-Weinberg and linkage equilibrium. However, close relatives might be sampled from the same subpopulation, and consequently, might cause Hardy-Weinberg and linkage disequilibrium and thus bias a population genetic structure analysis. In this study, we used simulated and real data to investigate the impact of close relatives in a sample on Bayesian population structure analysis. We also showed that, when close relatives were identified by a pedigree reconstruction approach and removed, the accuracy of a population genetic structure analysis can be greatly improved. The results indicate that unsupervised Bayesian clustering algorithms cannot be used blindly to detect genetic structure in a sample with closely related individuals. Rather, when closely related individuals are suspected to be frequent in a sample, these individuals should be first identified and removed before conducting a population structure analysis.

种群遗传结构推断在种群遗传学、保护生物学与进化生物学的诸多研究领域中至关重要。近年来,研究者已开发出无监督贝叶斯聚类算法(unsupervised Bayesian clustering algorithms),用于从基因型数据(genotypic data)中检测隐匿的种群遗传结构,其前提假设之一为:从种群中采集的个体均无亲缘关系。基于该假设,从亚种群采集的样本内的遗传标记可被认为处于哈迪-温伯格平衡(Hardy-Weinberg equilibrium)与连锁平衡(linkage equilibrium)状态。然而,若从同一亚种群中采集到近缘个体,则可能引发哈迪-温伯格失衡与连锁不平衡(linkage disequilibrium),进而对种群遗传结构分析造成偏倚。本研究通过模拟数据与真实数据,探究了样本中存在近缘个体时对贝叶斯种群结构分析的影响。研究同时证实,若通过谱系重建方法(pedigree reconstruction approach)识别并移除样本中的近缘个体,种群遗传结构分析的准确性可得到显著提升。本研究结果表明,不可盲目使用无监督贝叶斯聚类算法对包含近缘个体的样本开展遗传结构检测。反之,若怀疑样本中存在大量近缘个体,则应先识别并移除这些个体后再开展种群遗传结构分析。
创建时间:
2012-06-13
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作