five

animacy data for animcay classification

收藏
NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://zenodo.org/record/7589647
下载链接
链接失效反馈
官方服务:
资源简介:
This is the training data for an animacy classifier (see References LREC) 1) gold_actor:           7468 nouns denoting animate entities 2) gold_nonactor        5511 nouns denoting non-animate entities subsets of 1: gold_direct           6897 nouns directly denoting animate entities gold_metonym           587  metonymy trigger nouns gold_female           3738 nouns denoting female actors gold_male           2830 nouns denoting male actors gold_nogender           329 nouns denoting female or male actors (often plural) Format: just lists Note: although some person names are in the data, a separate NER for person names should be used . References: @inproceedings{LREC,            month = {Juni},           author = {Manfred Klenner and Anne G{\"o}hring},        booktitle = {Proceedings of the Language Resources and Evaluation Conference},          address = {Marseille, France},            title = {Animacy Denoting {G}erman Nouns: Annotation and Classification},        publisher = {European Language Resources Association},            pages = {1360--1364},             year = {2022},         language = {english},              url = {https://doi.org/10.5167/uzh-219148},         abstract = {In this paper, we introduce a gold standard for animacy detection comprising almost 14,500 German nouns that might be used to denote either animate entities or non-animate entities. We present inter-annotator agreement of our crowd-sourced seed annotations (9,000 nouns) and discuss the results of machine learning models applied to this data.} }
创建时间:
2023-02-10
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作