josecaloca/multiclass-text-classification-dataset
收藏Hugging Face2025-03-17 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/josecaloca/multiclass-text-classification-dataset
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含文本数据的数据集,具体应用场景未知。数据集由标题预处理(title_prepared)、标签(label)、输入ID序列(input_ids)和注意力掩码序列(attention_mask)组成。它被划分为训练集(337,935个示例)、验证集(42,242个示例)和测试集(42,242个示例)。数据集的总大小为93,281,293字节,下载大小为32,357,056字节。
This is a dataset containing text data, with an unknown specific application scenario. The dataset consists of title preprocessing (title_prepared), labels (label), input ID sequences (input_ids), and attention mask sequences (attention_mask). It is divided into a training set (337,935 examples), a validation set (42,242 examples), and a test set (42,242 examples). The total size of the dataset is 93,281,293 bytes, and the download size is 32,357,056 bytes.
提供机构:
josecaloca



