five

Datasets in Spanish for short/long slots and short/long sentences

收藏
DataCite Commons2022-03-15 更新2024-08-18 收录
下载链接:
https://figshare.com/articles/dataset/Datasets_in_Spanish_for_short_long_slots_and_short_long_sentences/19361825/1
下载链接
链接失效反馈
官方服务:
资源简介:
Datasets generated in Spanish for medication management scenarios.<br>It consists of four datasets:- Short slots and short sentences- Short slots and long sentences- Long slots and short sentences- Long slots and long sentences<br>All the datasets include the train and the development data.<br>The files are the followings:- data: The sentence, the slot tags, and the intent (separated with tabulations).- label: The sentences' intents.- seq.in: The sentences.- seq.out: The slot tags (following the IOB format).

本数据集为面向药物管理场景生成的西班牙语语料数据集。该数据集包含四类子数据集:短槽位(slot)短句数据集、短槽位长句数据集、长槽位短句数据集以及长槽位长句数据集。所有数据集均涵盖训练集与开发集。各文件说明如下: - data文件:存储句子、槽位标签与意图,各项内容以制表符分隔; - label文件:存储各句子对应的意图标签; - seq.in文件:存储原始句子文本; - seq.out文件:存储槽位标签,遵循IOB标注格式。
提供机构:
figshare
创建时间:
2022-03-15
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作