five

vacancy_skills_data

收藏
Figshare2021-11-24 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/vacancy_skills_data/17075717
下载链接
链接失效反馈
官方服务:
资源简介:
3 datasets representing processed skill-sets for job advertisements obtained from HeadHunter online hiring platform (collected with open API https://dev.hh.ru/) for specialists in Information Technologies (in accordance with classifier https://github.com/hhru/api/blob/master/docs_eng/specializations.md). Description of main fields for vacancies is available via link https://github.com/hhru/api/blob/master/docs_eng/vacancies.md.Datasets:1. "vacancy_skill.csv" - two column dataset, representing vacancy ID ("vacid") and processed skill name [in-demand skill] ("lv");2. "sh_soft_clusters.csv" - three column dataset, representing initial formulations (translated) of "soft" skills ("V1"), the frequency of occurrence in the sample ("V2"), the etalon name [generalized categories of "soft" skills] ("ETALON");3. "jaccard_matrix.csv" - dissimilarity square matrix between processed skill names (1,730 X 1,730) with Jaccard distances (computed by comparison of vacancy ID sets for each skill pair) [the first row and the first column contain skill names].
创建时间:
2021-11-24
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作