five

JobAds Ground Truth Dataset (FWF Project P35783)

收藏
SSH Open MarketPlace2025-04-16 更新2025-04-19 收录
下载链接:
https://marketplace.sshopencloud.eu/dataset/iIMgqy
下载链接
链接失效反馈
官方服务:
资源简介:
The file contains manuallty annotated coordinates of 14 985 job advertisements in 29 different newspaper titles from ANNO corpus. Each line contains a link to a page in the ANNO corpus and coordinates of all job advertisements on that page. Each job ad is labeled as one of the following categories: - job_offer for job offers, - job_search for job searches, - service_offer for somebody offering services, - vermittlung for job ads from mediation offices, - heading for a heading indicating that a job ads sections starts, - indicators for more general headings indicating that an ads section starts. Ground truth was created as part of the JobAds (FWF P35783) and published as part of the conference paper "Who Advertises in Newspapers? Data Criticism in Mining Historical Job Ads" presented at the CHR2024 conference. We thank the Austrian National Library for providing data from the ANNO corpus.
创建时间:
2025-04-16
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作