five

Multilingual Dataset for Named Entity Recognition, Entity Linking and Stance Detection in Historical Newspapers

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://zenodo.org/record/4573312
下载链接
链接失效反馈
官方服务:
资源简介:
We introduce the development of the NewsEye resource, a multilingual dataset for named entity recognition and linking enriched with stances towards named entities. The dataset is comprised of diachronic historical newspaper material published between 1850 and 1950 in French, German, Finnish, and Swedish. Such historical resource is essential in the context of developing and evaluating named entity processing systems. It evenly allows enhancing the forcefulness of existing approaches on historical documents which enable adequate and efficient semantic indexing of historical documents on digital cultural heritage collections. These are the supplementary materials for the SIGIR 2021 paper A Multilingual Dataset for Named Entity Recognition, Entity Linking and Stance Detection in Historical Newspapers. If you end up using whole or parts of this resource, please cite this paper: @inproceedings{Hamdi2021multilingual, title={A Multilingual Dataset for Named Entity Recognition, Entity Linking and Stance Detection in Historical Newspapers}, author={Hamdi, Ahmed and Boro{\c{s}}, Emanuela and Pontes, Elvys Linhares and Nguyen, Thi Tuyet Hai and Hackl, G{\"u}nter and Moreno, Jose G and Doucet, Antoine}, booktitle={Proceedings of the 44rd International ACM SIGIR Conference on Research and Development in Information Retrieval}, year={2021} } Acknowledgments This work has been supported by the European Union's Horizon 2020 research and innovation programme under grant 770299 [NewsEye](https://www.newseye.eu/).
创建时间:
2021-05-25
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作