SBB/ZEFYS2025
收藏Hugging Face2025-09-11 更新2025-09-13 收录
下载链接:
https://hf-mirror.com/datasets/SBB/ZEFYS2025
下载链接
链接失效反馈官方服务:
资源简介:
ZEFYS2025是一个德国历史报纸的命名实体识别和实体链接数据集,包含100页德国语言报纸,出版时间在1837年至1940年之间。数据集用于训练机器学习模型,以正确识别命名实体并将它们链接到维基数据等权威文件。数据集由柏林国家图书馆的研究项目团队编译,由联邦政府文化媒体专员资助。数据集以.tsv格式提供,包含文本、命名实体标签和维基数据链接。
ZEFYS2025 is a German dataset for Named Entity Recognition and Entity Linking for historical newspapers, containing 100 pages of German-language newspapers published between 1837 and 1940. The dataset is used to train machine learning models to correctly identify named entities and link them to authority files such as Wikidata. The dataset was compiled by the research project team at the Berlin State Library, funded by the Federal Government Commissioner for Culture and the Media. The dataset is provided in .tsv format, including text, named entity tags, and Wikidata links.
提供机构:
SBB



