five

PI2I/PI2I

收藏
Hugging Face2026-01-30 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/PI2I/PI2I
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 tags: - recommender-system size_categories: - 1B<n<10B --- # Dataset Overview The dataset presented in our paper *"PI2I: A Personalized Item-Based Collaborative Filtering Retrieval Framework"*, which has been accepted by the **Industry Track of TheWebConf 2026**, comprises **130 million real-world user-item interactions** collected from Taobao. Below is a summary of key statistics (<time,userid,itemid>): | Description | Value | |---------------------------------------------|---------------| | Total number of interactions (rows) | 130,828,023 | | Number of distinct users (`userid`) | 705,647 | | &nbsp;&nbsp;&nbsp;&nbsp;*Note:* Slight discrepancies may exist compared to the values reported in the paper due to hash collisions. | | | Number of distinct items (`itemid`) | 20,351,625 | | &nbsp;&nbsp;&nbsp;&nbsp;*Note:* Slight discrepancies may exist compared to the values reported in the paper due to hash collisions. | | | Time span | 23 days | | Average user interaction count | 185 | | Maximum user interaction count | 20,894 | | Minimum user interaction count | 1 | | Sparsity | 99.9% | | &nbsp;&nbsp;&nbsp;&nbsp;*(calculated as $1 - \frac{130,828,023}{20,351,625 \times 705,647}$)* | | Please cite the following paper if you find our code helpful: @article{wang2026pi2i, title={PI2I: A Personalized Item-Based Collaborative Filtering Retrieval Framework}, author={Wang, Shaoqing and Ma, Yingcai and Fu, Kairui and Wang, Ziyang and Huang, Dunxian and Yan, Yuliang and Wu, Jian}, journal={arXiv preprint arXiv:2601.16815}, year={2026} }
提供机构:
PI2I
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作