five

aselasperera/CAD-SResumeCredibilityAssessmentDataset

收藏
Hugging Face2026-03-25 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/aselasperera/CAD-SResumeCredibilityAssessmentDataset
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-4.0 task_categories: - text-classification language: - en tags: - HR - Technical - Resume pretty_name: CAD-S size_categories: - 10K<n<100K --- # CAD-S: Resume Credibility Assessment Dataset ## 📌 Overview CAD-S (Credibility Assessment Dataset - Synthetic) is the first openly available dataset designed specifically for **resume credibility assessment** using Natural Language Processing techniques. The dataset supports supervised learning for detecting inconsistencies between claimed skills and supporting evidence (e.g., projects, experience) within resumes. This dataset is intended for: - Resume verification systems - Natural Language Inference (NLI) tasks - AI-assisted recruitment tools - Research in credibility assessment and fraud detection --- ## 🎯 Task Definition CAD-S frames resume verification as a **Natural Language Inference (NLI)** problem, where: - **Premise** → Claimed skill or statement - **Hypothesis** → Supporting evidence from the resume The model predicts the logical relationship: - **Entailment** → Evidence supports the claim - **Contradiction** → Evidence conflicts with the claim - **Neutral** → No clear relationship --- ## 📊 Dataset Summary - **Total Resumes:** 10,100 - **Format:** CSV - **Language:** English - **Domain:** Technical / IT resumes - **Experience Level:** 0–5 years (students & early-career professionals) --- ## 📄 Data Format Each row represents a labeled inference pair: | Column Name | Description | |------------|-------------| | resume_id | Unique identifier for each resume | | premise | Claimed skill or statement | | hypothesis | Supporting project or experience description | | label | Entailment / Contradiction / Neutral | ### Example ```csv resume_id,premise,hypothesis,label 1,"Proficient in Python","Developed a machine learning model using Python",entailment 2,"Expert in Java","No projects related to Java listed",contradiction
提供机构:
aselasperera
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作