Knowledge Unit dataset
收藏arXiv2025-09-30 收录
下载链接:
https://anonymousaaai2019.github.io
下载链接
链接失效反馈官方服务:
资源简介:
该数据集基于Stack Overflow数据集构建,包含了347,372对与Java相关的知识单元。数据集分为四个相关度类别,分布均匀,并划分为训练集(60%)、验证集(10%)和测试集(30%)。规模达到了347,372对,任务是对问题相关度进行预测。
This dataset is constructed based on the Stack Overflow dataset, containing 347,372 pairs of Java-related knowledge units. It is evenly divided into four relevance categories, and split into training set (60%), validation set (10%) and test set (30%). The total size of the dataset amounts to 347,372 pairs, with the task of predicting question relevance.
提供机构:
Shirani et al.



