Datasets in Spanish for short/long slots and short/long sentences
收藏DataCite Commons2022-03-15 更新2024-08-18 收录
下载链接:
https://figshare.com/articles/dataset/Datasets_in_Spanish_for_short_long_slots_and_short_long_sentences/19361825/1
下载链接
链接失效反馈官方服务:
资源简介:
Datasets generated in Spanish for medication management scenarios.<br>It consists of four datasets:- Short slots and short sentences- Short slots and long sentences- Long slots and short sentences- Long slots and long sentences<br>All the datasets include the train and the development data.<br>The files are the followings:- data: The sentence, the slot tags, and the intent (separated with tabulations).- label: The sentences' intents.- seq.in: The sentences.- seq.out: The slot tags (following the IOB format).
本数据集为面向药物管理场景生成的西班牙语语料数据集。该数据集包含四类子数据集:短槽位(slot)短句数据集、短槽位长句数据集、长槽位短句数据集以及长槽位长句数据集。所有数据集均涵盖训练集与开发集。各文件说明如下:
- data文件:存储句子、槽位标签与意图,各项内容以制表符分隔;
- label文件:存储各句子对应的意图标签;
- seq.in文件:存储原始句子文本;
- seq.out文件:存储槽位标签,遵循IOB标注格式。
提供机构:
figshare
创建时间:
2022-03-15



