NLP-FBK/e3c-sentences-IT-unrevised
收藏Hugging Face2025-02-07 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/NLP-FBK/e3c-sentences-IT-unrevised
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含句子和实体信息,实体信息包括其在句子中的偏移量、文本内容和实体类型。数据集分为训练集、验证集和测试集,适用于临床实体的识别和关系提取任务。训练集包含632个示例,验证集包含101个示例,测试集包含738个示例。数据集文件包括.tsx格式(用于临床实体的BIO标注)和.txt格式(用于关系提取的PubTator格式)。
The dataset includes sentences and entity information, with entities described by their offsets in the sentence, the text they represent, and their type. It is split into training, validation, and test sets, suitable for clinical entity recognition and relation extraction tasks. The training set contains 632 examples, the validation set contains 101 examples, and the test set contains 738 examples. The dataset files are in .tsv format (for clinical entity BIO annotation) and .txt format (for PubTator version of relation extraction).
提供机构:
NLP-FBK



