five

Efficient Coalescent Simulation and Genealogical Analysis for Large Sample Sizes

收藏
Figshare2016-05-05 更新2026-04-29 收录
下载链接:
https://figshare.com/articles/dataset/Efficient_Coalescent_Simulation_and_Genealogical_Analysis_for_Large_Sample_Sizes/3249088
下载链接
链接失效反馈
官方服务:
资源简介:
A central challenge in the analysis of genetic variation is to provide realistic genome simulation across millions of samples. Present day coalescent simulations do not scale well, or use approximations that fail to capture important long-range linkage properties. Analysing the results of simulations also presents a substantial challenge, as current methods to store genealogies consume a great deal of space, are slow to parse and do not take advantage of shared structure in correlated trees. We solve these problems by introducing sparse trees and coalescence records as the key units of genealogical analysis. Using these tools, exact simulation of the coalescent with recombination for chromosome-sized regions over hundreds of thousands of samples is possible, and substantially faster than present-day approximate methods. We can also analyse the results orders of magnitude more quickly than with existing methods.
创建时间:
2016-05-05
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作