GamjaPower/baseline-prepare-dataset-500

Name: GamjaPower/baseline-prepare-dataset-500
Creator: GamjaPower
Published: 2024-11-27 10:30:52
License: 暂无描述

Hugging Face2024-11-27 更新2024-12-14 收录

下载链接：

https://hf-mirror.com/datasets/GamjaPower/baseline-prepare-dataset-500

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含用于自然语言处理任务的文本数据，具体包括输入ID（input_ids）、令牌类型ID（token_type_ids）、注意力掩码（attention_mask）和下一句标签（next_sentence_label）等特征。数据集分为训练集和验证集，训练集包含1952个样本，验证集包含2259个样本。数据以序列形式存储，适用于模型训练和验证。

This dataset contains text data for natural language processing tasks, specifically including features such as input IDs (input_ids), token type IDs (token_type_ids), attention masks (attention_mask), and next sentence labels (next_sentence_label). The dataset is divided into a training set and a validation set, with the training set containing 1952 samples and the validation set containing 2259 samples. The data is stored in sequence format, suitable for model training and validation.

提供机构：

GamjaPower

5,000+

优质数据集

54 个

任务类型

进入经典数据集