Industrial and Professional Occupations Dataset (IPOD)
收藏arXiv2020-04-27 更新2024-06-21 收录
下载链接:
https://www.github.com/junhua/ipod
下载链接
链接失效反馈官方服务:
资源简介:
IPOD是由新加坡科技设计大学创建的一个包含超过190,000个职业头衔的数据集,这些数据从LinkedIn上的56,000多个个人资料中爬取。数据集内容丰富,涵盖了从亚洲和美国的职业头衔,主要用于职业数据挖掘和分析。创建过程中,数据经过了一系列处理,如转换为小写、替换有意义的标点符号等。IPOD的应用领域包括职业头衔分析、职业命名实体识别等,旨在通过机器学习技术解决职业市场分析和预测等问题。
IPOD is a dataset containing over 190,000 job titles, developed by the Singapore University of Technology and Design. The dataset was constructed by crawling data from more than 56,000 LinkedIn user profiles. It covers a rich set of occupational titles from Asia and the United States, and is primarily utilized for occupational data mining and analysis. During its creation, the raw data underwent multiple preprocessing steps, including conversion to lowercase and replacement of meaningful punctuation symbols. The application domains of IPOD include job title analysis and occupational named entity recognition, among others. Its core purpose is to address issues such as occupational market analysis and prediction via machine learning technologies.
提供机构:
新加坡科技设计大学
创建时间:
2019-10-22



