Twitter Job/Employment Corpus
收藏arXiv2019-01-30 更新2024-08-06 收录
下载链接:
http://arxiv.org/abs/1901.10619v1
下载链接
链接失效反馈官方服务:
资源简介:
Twitter Job/Employment Corpus是由Golisano College of Computing and Information Sciences Rochester Institute of Technology创建的数据集,包含约700万条推文,其中0.2百万条与工作相关,6.8百万条非工作相关。数据集通过人类在环的监督学习框架进行标注,结合众包和专业知识,旨在提取和分析公共社交媒体中的工作相关话题。该数据集的应用领域广泛,包括公共健康、心理学、雇主分析等,旨在解决工作相关压力监测、员工满意度提升等问题。
The Twitter Job/Employment Corpus is a dataset developed by the Golisano College of Computing and Information Sciences at the Rochester Institute of Technology. It contains approximately 7 million Tweets, among which 0.2 million are work-related and 6.8 million are non-work-related. Annotated via a human-in-the-loop supervised learning framework that combines crowdsourcing and professional expertise, this dataset aims to extract and analyze work-related topics from public social media. It has a wide range of application fields including public health, psychology, employer analysis and more, and is designed to address issues such as work-related stress monitoring and employee satisfaction improvement.
提供机构:
Golisano College of Computing and Information Sciences Rochester Institute of Technology
创建时间:
2019-01-30



