srilaxmii-d04/sample_resumes
收藏Hugging Face2024-03-01 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/srilaxmii-d04/sample_resumes
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: ID
dtype: int64
- name: Resume_str
dtype: string
- name: Resume_html
dtype: string
- name: Category
dtype: string
splits:
- name: train
num_bytes: 38342346.1489533
num_examples: 1738
- name: test
num_bytes: 16457646.851046698
num_examples: 746
download_size: 20336204
dataset_size: 54799993.0
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: test
path: data/test-*
---
The dataset includes four features: ID (int64), Resume_str (string, possibly the text format of a resume), Resume_html (string, possibly the HTML format of a resume), and Category (string, possibly the classification of the resume). The dataset is divided into a training set and a test set, with 1738 samples in the training set and 746 samples in the test set. The total download size of the dataset is 20336204 bytes, and the total size is 54799993.0 bytes. The dataset configuration is set to default, with the training and test set data files stored in the data/train-* and data/test-* paths respectively.
提供机构:
srilaxmii-d04
原始信息汇总
数据集信息
特征
- ID: 数据类型为
int64 - Resume_str: 数据类型为
string - Resume_html: 数据类型为
string - Category: 数据类型为
string
数据分割
- 训练集 (train):
- 字节数: 38342346.1489533
- 样本数: 1738
- 测试集 (test):
- 字节数: 16457646.851046698
- 样本数: 746
数据大小
- 下载大小: 20336204 字节
- 数据集大小: 54799993.0 字节
配置
- 默认配置 (default):
- 数据文件:
- 训练集路径:
data/train-* - 测试集路径:
data/test-*
- 训练集路径:
- 数据文件:



