seongil-dn/seongil-dn_mteb-stackexchange-title-body_perc
收藏Hugging Face2025-03-05 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/seongil-dn/seongil-dn_mteb-stackexchange-title-body_perc
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含查询及其相关和不相关样本的数据集。数据集由id(唯一标识符)、query(查询文本)、positives(相关样本列表,包含id、score(相关性得分)、text(样本文本))以及negatives(不相关样本列表,包含id、score、text以及topk_rank(样本在top-k排序中的位置))组成。数据集分为训练集,其中包含228542个示例,大小为3.47GB。
This dataset contains queries along with their relevant and irrelevant samples. The dataset consists of id (unique identifier), query (query text), positives (a list of relevant samples, including id, score (relevance score), text (sample text)), and negatives (a list of irrelevant samples, including id, score, text, and topk_rank (the position of the sample in the top-k ranking)). The dataset is split into a training set, which contains 228,542 examples and is 3.47GB in size.
提供机构:
seongil-dn



