five

Examples comparisons for random partition algorithms

收藏
DataCite Commons2020-09-05 更新2024-07-25 收录
下载链接:
https://figshare.com/articles/dataset/Examples_comparisons_for_random_partition_algorithms/96310/1
下载链接
链接失效反馈
官方服务:
资源简介:
<strong>Topic:</strong> generating uniform random samples from the set of all integer partitions for a given total (N) and a number of parts (S)<br><br><strong>Problem:</strong> existing random integer partitioning functions can take a long time to generate a single partition for a given N (regardless of S). If one is interested in generating random integer partitions for N having S parts, then one must waste time generating random partitions of N and rejecting those not matching S. Those partitions of N matching S can have low probabilities of being random generated (e.g. p &lt; 10^-10 ).<br><br><strong>Solution:</strong> generate a single random partition of N and randomly manipulate it until its length (i.e. number of parts) equals S. General method for manipulating the partition: randomly choose 2 summands, remove them from the partition, conjugate the partition, append the sum of the two removed summand, repeat until partition has S parts. Why? Because randomly perturbing a partition is much faster than randomly generating a new one, and this advantage grows as the time of generating random partitions for N increases. <br><br><strong>Lingering problem:</strong> I can’t mathematically prove why this works to generate uniform random partitions of N having S parts. Not because it's potentially difficult, but because I'm an ecologist and not a mathematician. <strong> What follows:</strong> 486 visual comparisons of 500 random samples generated from the new function derived by myself, Kenneth J. Locey, (red curves) against 500 random samples generated using the random partition function found in the Sage mathematical environment (black curves). <br><br>Kernel density curves (red ones and black ones) are for evenness across the partition. Here, evenness is estimated using Evar, a transform of the variance of log summand values. Evar is standardized to take values between 0.0 (no evenness) and 1.0 (perfect evenness). <strong>N</strong> is the total and <strong>S</strong> is the number of parts. This same close agreement was also found using other statistical characteristics (e.g. median summand, relative size of largest summand). These results reveal that the statistical qualities of the two samples of randomly generated partitions (My algorithm and that of the Sage mathematical environment) are in high agreement. <br><br>There appears to be no systematic bias despite the sensitivity of Evar. So far, only algorithms that, by definition, return uniform random samples have shown such close agreement.<br><br>
提供机构:
figshare
创建时间:
2016-01-11
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作