ai4data/datause-train
收藏Hugging Face2025-08-15 更新2025-11-01 收录
下载链接:
https://hf-mirror.com/datasets/ai4data/datause-train
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含现实世界的数据集提及示例,旨在训练和评估模型进行命名实体识别(NER)和关系提取(RE)。每个示例包括分词后的文本和命名实体识别的实体跨度信息。数据集适用于训练多任务模型,如GLiNER,用于数据集提及提取;用于基准测试同时学习NER和RE的模型;以及在领域迁移之前在合成场景上测试泛化。
This dataset contains real-world examples of dataset mentions designed to train and evaluate models for Named Entity Recognition (NER) and Relation Extraction (RE). Each example includes tokenized text and entity span annotations for NER. The dataset is suitable for training multitask models like GLiNER for dataset mention extraction, benchmarking models that jointly learn NER and RE, and testing generalization on synthetic scenarios before domain transfer.
提供机构:
ai4data



