vacancy_skills_data
收藏Figshare2021-11-24 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/vacancy_skills_data/17075717
下载链接
链接失效反馈官方服务:
资源简介:
3 datasets representing processed skill-sets for job advertisements obtained from HeadHunter online hiring platform (collected with open API https://dev.hh.ru/) for specialists in Information Technologies (in accordance with classifier https://github.com/hhru/api/blob/master/docs_eng/specializations.md). Description of main fields for vacancies is available via link https://github.com/hhru/api/blob/master/docs_eng/vacancies.md.Datasets:1. "vacancy_skill.csv" - two column dataset, representing vacancy ID ("vacid") and processed skill name [in-demand skill] ("lv");2. "sh_soft_clusters.csv" - three column dataset, representing initial formulations (translated) of "soft" skills ("V1"), the frequency of occurrence in the sample ("V2"), the etalon name [generalized categories of "soft" skills] ("ETALON");3. "jaccard_matrix.csv" - dissimilarity square matrix between processed skill names (1,730 X 1,730) with Jaccard distances (computed by comparison of vacancy ID sets for each skill pair) [the first row and the first column contain skill names].
创建时间:
2021-11-24



