bcai001/c-sts-quora-query-as-condition-group-as-pos
收藏Hugging Face2025-04-06 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/bcai001/c-sts-quora-query-as-condition-group-as-pos
下载链接
链接失效反馈官方服务:
资源简介:
这是一个关于Quora的检索数据集,包含condition和group两个字段。数据集分为训练集、验证集和测试集三个部分,共有11750句语料和3449条查询。平均每个查询对应3.4个语料,总共有大约14000条数据。但是,该数据集存在一些问题,比如query和corpus的质量不高,缺乏label为0的qerl,且数据量较少,不是理想的数据集。
This is a Quora retrieval dataset containing two fields: condition and group. The dataset is divided into three parts: training set, validation set, and test set. There are a total of 11,750 sentences of corpus and 3,449 queries. On average, each query corresponds to 3.4 corpora, with a total of about 14,000 pieces of data. However, there are some issues with this dataset, such as the poor quality of query and corpus, the lack of qerl with label 0, and the small amount of data, making it not an ideal dataset.
提供机构:
bcai001



