VFUSE
收藏DataCite Commons2025-06-01 更新2024-07-27 收录
下载链接:
https://figshare.com/articles/dataset/VFUSE/4798000/3
下载链接
链接失效反馈官方服务:
资源简介:
<br>FUSE is a reproducible, internet-scale corpus, and contains 249,376 unique spreadsheets that were extracted from over 26.83 billion pages. We applied SpreadCluster to the FUSE and manually validated 200 groups that were randomly selected from the clustering result. Based on the validated result, we built the VFUSE corpus, containing 188 evolution groups and 1,143 spreadsheets.VFUSE is published associated with our MSR 2017 paper in May 2017. <br>Liang Xu, Wensheng Dou, Chushu Gao, Jie Wang, Jun Wei, Hua Zhong, Tao Huang. SpreadCluster: Recovering Versioned Spreadsheets through Similarity-Based Clustering. In <i>Proceedings of the 14th International Conference on Mining Software Repositories</i> (<b><i>MSR 2017</i></b>), May 2017.<br>
提供机构:
figshare
创建时间:
2017-03-29



