five

zenless-lab/jsem

收藏
Hugging Face2024-12-19 更新2024-12-21 收录
下载链接:
https://hf-mirror.com/datasets/zenless-lab/jsem
下载链接
链接失效反馈
官方服务:
资源简介:
JSeM数据集是一个用于日语语义测试的基准数据集,主要用于测试叙述文之间的蕴含关系。数据集包含三个主要特征:前提(premise)、假设(hypothesis)和标签(label),标签分为三类:蕴含(entailment)、中立(neutral)和矛盾(contradiction)。数据集分为训练集、验证集和测试集,分别包含12667、1583和1584个样本。数据集的创建者是DaisukeBekki,语言为日语,采用BSD 3-Clause许可证。数据集的设计基于FraCaS测试套件,并扩展了日语特有的语言现象。

The JSeM dataset is a benchmark dataset for Japanese semantic testing, primarily used to test the entailment relationships between narrative texts. The dataset includes three main features: premise, hypothesis, and label, with the label categorized into three types: entailment, neutral, and contradiction. The dataset is divided into training, validation, and test sets, containing 12667, 1583, and 1584 samples respectively. The dataset was created by DaisukeBekki, is in Japanese, and is licensed under BSD 3-Clause. The design of the dataset is based on the FraCaS test suite and extends to include phenomena unique to the Japanese language.
提供机构:
zenless-lab
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作