Weni/zeroshot-sft-3.7.0

Name: Weni/zeroshot-sft-3.7.0
Creator: Weni
Published: 2024-07-23 17:51:07
License: 暂无描述

Hugging Face2024-07-23 更新2024-07-22 收录

下载链接：

https://hf-mirror.com/datasets/Weni/zeroshot-sft-3.7.0

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含多个特征，包括上下文、所有类别、输入、输出、输出ID、语言和数据类别。其中，所有类别是一个列表，包含类别、上下文和ID三个子特征。语言和数据类别是分类标签，分别表示语言类型（如葡萄牙语、英语、西班牙语）和数据的情感类别（如积极、消极）。数据集分为训练集，包含21268个样本，总大小为22475455字节。

The dataset includes multiple features such as context, all_classes, input, output, output_id, language, and data_category. Among these, all_classes is a list containing class, context, and id. The language feature is a class label including Portuguese (pt), English (en), and Spanish (es). The data_category is also a class label including positive and negative. The dataset is divided into a training set (train) with 21268 samples, totaling 22475455 bytes. The download size of the dataset is 6753279 bytes.

提供机构：

Weni

原始信息汇总

数据集概述

数据集特征

context: 类型为字符串。
all_classes: 包含以下子特征：
- class: 类型为字符串。
- context: 类型为字符串。
- id: 类型为整数（int64）。
input: 类型为字符串。
output: 类型为字符串。
output_id: 类型为整数（int64）。
language: 类型为分类标签，包含以下类别：
- 0: pt（葡萄牙语）
- 1: en（英语）
- 2: es（西班牙语）
data_category: 类型为分类标签，包含以下类别：
- 0: positive（正面）
- 1: negative（负面）

数据集分割

train: 包含21268个样本，总大小为22475455字节。

数据集大小

下载大小: 6753279字节
数据集总大小: 22475455字节

配置

default: 包含训练数据文件，路径为data/train-*。

5,000+

优质数据集

54 个

任务类型

进入经典数据集