five

Datasets for Itemset, Sequence and Tree Mining

收藏
NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://zenodo.org/record/3785364
下载链接
链接失效反馈
官方服务:
资源简介:
There are three different datasets included, that can be used for itemset, sequence and tree mining methods. dense_db.zip contains various real itemset datasets like chess, connect, mushroom, pumsb, T10I4D100K, T40I10D100K and so on, used in the papers on frequent, closed and maximal itemset mining. For example, Mohammed J. Zaki and Ching-Jui Hsiao. Efficient algorithms for mining closed itemsets and their lattice structure. IEEE Transactions on Knowledge and Data Engineering, 17(4):462–478, April 2005. doi:10.1109/69.846291. Or Karam Gouda and Mohammed J. Zaki. Genmax: an efficient algorithm for mining maximal frequent itemsets. Data Mining and Knowledge Discovery: An International Journal, 11(3):223–242, November 2005. doi:10.1007/s10618-005-0002-x.   plandata.zip:  Planning dataset for sequence mining. It was used in the paper Mohammed J. Zaki, Neal Lesh, and Mitsunori Ogihara. PLANMINE: predicting plan failures using sequence mining. Artificial Intelligence Review, 14(6):421–446, December 2000. Special issue on Applications of Data Mining. doi:https://doi.org/10.1023/A:1006612804250.   cslogs.zip:  The CSLOGS data was used for tree mining, e.g., in Mohammed J. Zaki. Efficiently mining frequent trees in a forest: algorithms and applications. IEEE Transactions on Knowledge and Data Engineering, 17(8):1021–1035, August 2005. Special issue on Mining Biological Data. doi:10.1109/TKDE.2005.125.
创建时间:
2020-05-04
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作