rchu233/ni-ood-dataset-20250131-modernbert-train-kmeans-dim128-20250312
收藏Hugging Face2025-03-13 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/rchu233/ni-ood-dataset-20250131-modernbert-train-kmeans-dim128-20250312
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含文本数据的数据集,其中包括文本内容(text)、标签(labels)、任务名称(task_name)、类别(category)、领域(domain)以及不同规模的聚类标识(cluster_30, cluster_60, cluster_120, cluster_240)。数据集分为训练集和测试集,训练集包含3,075,585个示例,大小为3.71GB,测试集包含484,006个示例,大小为485MB。
This dataset contains text data, including fields such as text content (text), labels (labels), task name (task_name), category (category), domain (domain), and cluster identifiers of different scales (cluster_30, cluster_60, cluster_120, cluster_240). The dataset is divided into training and test sets, with the training set containing 3,075,585 examples and a size of 3.71GB, and the test set containing 484,006 examples and a size of 485MB.
提供机构:
rchu233



