mingfengxue/OccuQuest
收藏Hugging Face2023-10-23 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/mingfengxue/OccuQuest
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: validation
path: data/validation-*
- split: test
path: data/test-*
- split: estate
path: data/estate-*
- split: quora
path: data/quora-*
dataset_info:
features:
- name: category
dtype: string
- name: occupation
dtype: string
- name: topic
dtype: string
- name: messages
list:
- name: role
dtype: string
- name: content
dtype: string
splits:
- name: train
num_bytes: 330314955
num_examples: 114090
- name: validation
num_bytes: 7314741
num_examples: 2500
- name: test
num_bytes: 718046
num_examples: 250
- name: estate
num_bytes: 703613
num_examples: 250
- name: quora
num_bytes: 45540
num_examples: 250
download_size: 139074820
dataset_size: 339096895
---
# Dataset Card for "OccuQuest"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
mingfengxue
原始信息汇总
数据集概述
数据集配置
- 默认配置:
- 训练集:路径为
data/train-* - 验证集:路径为
data/validation-* - 测试集:路径为
data/test-* - 房产集:路径为
data/estate-* - Quora集:路径为
data/quora-*
- 训练集:路径为
数据集信息
-
特征:
category:字符串类型occupation:字符串类型topic:字符串类型messages:列表类型,包含以下子特征:role:字符串类型content:字符串类型
-
数据分割:
- 训练集:
- 字节数:330314955
- 样本数:114090
- 验证集:
- 字节数:7314741
- 样本数:2500
- 测试集:
- 字节数:718046
- 样本数:250
- 房产集:
- 字节数:703613
- 样本数:250
- Quora集:
- 字节数:45540
- 样本数:250
- 训练集:
-
数据集大小:
- 下载大小:139074820 字节
- 数据集大小:339096895 字节



