AhmedBou/ParsedResumes
收藏Hugging Face2024-05-24 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/AhmedBou/ParsedResumes
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: Category
dtype: string
- name: Resume
dtype: string
- name: Parsed_resume
dtype: string
- name: __index_level_0__
dtype: int64
splits:
- name: train
num_bytes: 1096035
num_examples: 267
download_size: 281067
dataset_size: 1096035
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
license: apache-2.0
task_categories:
- text-classification
- text-generation
- summarization
language:
- en
size_categories:
- n<1K
---
# Parsed Resume
## Overview
This dataset is designed for both text classification and text generation tasks.
It comprises more that 250 resumes from 25 IT professionals, along with their corresponding parsed information.
The parsed data adheres to a consistent schema, facilitating efficient analysis and utilization for various machine learning applications.
## Dataset Details
- **Parsing schema**:
```json
schema:
{
"name": "John Doe",
"email": "john.doe@example.com",
"phone": "123-456-7890",
"location": "San Francisco, CA",
"summary": "Highly motivated software engineer with 5+ years of experience in web development.",
"skills": {
"technical": ["Java", "Python", "JavaScript", "HTML/CSS"],
"soft": ["Teamwork", "Communication", "Problem-solving"]
},
"links": [
{
"network": "LinkedIn",
"url": "[https://www.linkedin.com/in/john-doe](https://www.linkedin.com/in/john-doe)"
}
]
}
提供机构:
AhmedBou
原始信息汇总
数据集概述
数据集信息
-
特征:
- Category: 字符串
- Resume: 字符串
- Parsed_resume: 字符串
- index_level_0: 整数
-
分割:
- train: 267个样本,大小为1096035字节
-
下载大小: 281067字节
-
数据集大小: 1096035字节
-
配置:
- default: 训练数据路径为
data/train-*
- default: 训练数据路径为
-
许可证: Apache-2.0
-
任务类别:
- 文本分类
- 文本生成
- 摘要生成
-
语言: 英语
-
大小类别: 小于1K
数据集详情
- 解析模式: json { "name": "字符串", "email": "字符串", "phone": "字符串", "location": "字符串", "summary": "字符串", "skills": { "technical": ["字符串"], "soft": ["字符串"] }, "links": [ { "network": "字符串", "url": "字符串" } ] }



