dev-senolys/categories_dataset
收藏Hugging Face2023-07-31 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/dev-senolys/categories_dataset
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: identifier
dtype: string
- name: categories
dtype:
class_label:
names:
'0': Desirability
'1': Engineering
'2': Financing
'3': Legitimacy
'4': Magic Team
'5': Market Impact
'6': Needs
'7': R.O.I.
'8': Securing
'9': Storytelling
'10': Uniqueness
'11': Value Chain
- name: article
dtype: string
- name: themas
sequence: string
- name: categorie_id
dtype: int64
splits:
- name: train_dataset_categories
num_bytes: 3903817
num_examples: 1010
- name: val_dataset_categories
num_bytes: 766808
num_examples: 216
- name: test_dataset_categories
num_bytes: 832229
num_examples: 217
download_size: 3174766
dataset_size: 5502854
---
# Dataset Card for "categories_dataset"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
dev-senolys
原始信息汇总
数据集概述
数据集名称
- 名称: categories_dataset
数据结构
- 特征:
- identifier: 字符串类型
- categories: 分类标签类型,包含以下类别:
- Desirability
- Engineering
- Financing
- Legitimacy
- Magic Team
- Market Impact
- Needs
- R.O.I.
- Securing
- Storytelling
- Uniqueness
- Value Chain
- article: 字符串类型
- themas: 字符串序列类型
- categorie_id: 整数类型(int64)
数据分割
- 训练集:
- 名称: train_dataset_categories
- 大小: 3903817字节
- 样本数: 1010
- 验证集:
- 名称: val_dataset_categories
- 大小: 766808字节
- 样本数: 216
- 测试集:
- 名称: test_dataset_categories
- 大小: 832229字节
- 样本数: 217
数据集大小
- 下载大小: 3174766字节
- 总大小: 5502854字节



