bew/sciqshuffle-preprocessed-1024

Hugging Face2025-02-09 更新2025-02-15 收录

下载链接：

https://hf-mirror.com/datasets/bew/sciqshuffle-preprocessed-1024

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集是一个包含文本处理相关特征的字段的数据集，适用于自然语言处理任务。它包括训练集和验证集，总共含有11779个样本，数据大小约为168MB。数据集的特征包括input_ids（文本的token ID序列）、attention_mask（用于指示有效token位置的掩码）和labels（标签或目标值）。

This dataset is a collection of text processing-related features suitable for natural language processing tasks. It includes a training set and a validation set, totaling 11,679 samples, with a dataset size of approximately 168MB. The features of the dataset include input_ids (token ID sequences of text), attention_mask (masks indicating the positions of valid tokens), and labels (labels or target values).

提供机构：

bew

5,000+

优质数据集

54 个

任务类型

进入经典数据集