cx-cmu/ClueWeb-Reco
收藏Hugging Face2025-10-24 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/cx-cmu/ClueWeb-Reco
下载链接
链接失效反馈官方服务:
资源简介:
ClueWeb-Reco数据集是由真实的美国浏览历史映射到ClueWeb22-B数据集的英语子集中的公开可用的网站构建而成。它模拟了真实用户交互,并作为ORBIT基准的隐藏测试集使用。数据集通过顺序留一法分割成验证集和测试集,用于预测用户在浏览序列中的下一个交互项目。
ClueWeb-Reco is a dataset constructed by mapping real-life U.S. browsing history to publicly available websites in the English subset of the ClueWeb22-B dataset. It simulates real user interactions and is used as the hidden test set for the ORBIT benchmark. The dataset is split into validation and test sets using sequential leave-one-out method, for predicting the next user interaction in a browsing sequence.
提供机构:
cx-cmu



