FrancophonIA/E3C-Corpus-2.0.0
收藏Hugging Face2025-03-30 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/FrancophonIA/E3C-Corpus-2.0.0
下载链接
链接失效反馈官方服务:
资源简介:
E3C是一个免费的多语言(英语、法语、意大利语、西班牙语和巴斯克语)语义注释临床叙事语料库,旨在用于语言分析、信息提取系统的基准测试和训练。该语料库包含两种类型的注释:临床实体(如疾病)和时间信息及事实性信息(如事件)。研究者可以使用我们语料库的基准训练和测试分割来开发和测试他们自己的模型。
E3C is a freely available multilingual (English, French, Italian, Spanish, and Basque) corpus of semantically annotated clinical narratives for linguistic analysis, benchmarking, and training of information extraction systems. It consists of two types of annotations: (i) clinical entities (e.g., pathologies), (ii) temporal information and factuality (e.g., events). Researchers can use the benchmark training and test splits of our corpus to develop and test their own models.
提供机构:
FrancophonIA



