sunilsah-447349/Hansel
收藏Hugging Face2025-12-12 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/sunilsah-447349/Hansel
下载链接
链接失效反馈官方服务:
资源简介:
Hansel是一个高质量的人工标注中文实体链接(EL)数据集,专注于尾部实体和新兴实体。测试集包含Few-shot(FS)和zero-shot(ZS)切片,有10K个示例,并使用Wikidata作为相应的知识库。训练和验证集来自Wikipedia超链接,可用于中文EL系统的大规模预训练。
Hansel is a high-quality human-annotated Chinese entity linking (EL) dataset, focusing on tail entities and emerging entities: The test set contains Few-shot (FS) and zero-shot (ZS) slices, has 10K examples and uses Wikidata as the corresponding knowledge base. The training and validation sets are from Wikipedia hyperlinks, useful for large-scale pretraining of Chinese EL systems.
提供机构:
sunilsah-447349



