five

NetherlandsForensicInstitute/squad-nl-v2.0

收藏
Hugging Face2024-12-20 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/NetherlandsForensicInstitute/squad-nl-v2.0
下载链接
链接失效反馈
官方服务:
资源简介:
SQuAD-NL v2.0数据集是一个用于句子相似性和问答系统任务的数据集,特别适用于评估句子嵌入模型。该数据集包含问题、上下文、分数、ID、标题和答案等特征,其中分数用于指示问题是否在上下文中找到答案。数据集是从SQuAD和XQuAD英文数据集翻译而来,特别是通过Google Translate进行翻译,测试集还进行了人工校对。数据集分为训练集、开发集和测试集,建议仅使用测试集来测试荷兰语句子嵌入模型。

The SQuAD-NL v2.0 dataset is a Dutch translation of the original SQuAD and XQuAD datasets, specifically designed for sentence similarity tasks in Sentence Transformers. The dataset includes features such as question, context, score, id, title, and answers. The score column is added to indicate whether the question has an answer in the context, with 1.0 indicating an answer and 0.0 indicating no answer. The dataset is divided into train, dev, and test splits, with the test split recommended for evaluating Dutch sentence embedding models. The original license for the dataset is CC BY-SA 4.0.
提供机构:
NetherlandsForensicInstitute
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作