surya-narayanan/labeled_aux_train_57_labels

Name: surya-narayanan/labeled_aux_train_57_labels
Creator: surya-narayanan
Published: 2024-06-26 00:24:18
License: 暂无描述

Hugging Face2024-06-26 更新2024-06-29 收录

下载链接：

https://hf-mirror.com/datasets/surya-narayanan/labeled_aux_train_57_labels

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含四个主要特征：问题、选项、答案和主题。问题是一个字符串类型，选项是一个字符串序列，答案是一个类别标签，分别对应A、B、C、D四个选项，主题也是一个字符串类型。数据集包含一个名为auxiliary_train的分割，该分割包含99842个样本，总大小为162632158字节。数据集的下载大小为47643581字节，总数据集大小为162632158字节。

The dataset contains four main features: question, choices, answer, and subject. The question is of string type, choices are a sequence of strings, the answer is a class label corresponding to options A, B, C, and D, and the subject is also a string type. The dataset includes a split named auxiliary_train, which contains 99,842 samples with a total size of 162,632,158 bytes. The download size of the dataset is 47,643,581 bytes, and the total dataset size is 162,632,158 bytes.

提供机构：

surya-narayanan

原始信息汇总

数据集概述

数据集信息

特征

question: 问题描述，数据类型为字符串。
choices: 选项列表，数据类型为字符串序列。
answer: 答案，数据类型为分类标签，标签名称为：
- 0: A
- 1: B
- 2: C
- 3: D
subject: 主题，数据类型为字符串。

数据分割

auxiliary_train: 辅助训练集，包含99842个样本，占用162632158字节。

数据集大小

下载大小: 47643581字节
数据集大小: 162632158字节

配置

config_name: default
- data_files:
  - split: auxiliary_train
  - path: data/auxiliary_train-*

5,000+

优质数据集

54 个

任务类型

进入经典数据集