tsch00001/corpus_ka_nsp

Name: tsch00001/corpus_ka_nsp
Creator: tsch00001
Published: 2025-01-18 13:47:06
License: 暂无描述

Hugging Face2025-01-18 更新2025-02-15 收录

下载链接：

https://hf-mirror.com/datasets/tsch00001/corpus_ka_nsp

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含了两个句子序列和一个标签，适用于训练句子对分类模型。训练集包含了1117万7千720个样本，数据集总大小约为10.1GB。

The dataset includes two sentence sequences and a label, suitable for training sentence pair classification models. The training set contains 11,177,720 samples, with the total size of the dataset being approximately 10.1GB.

提供机构：

tsch00001

5,000+

优质数据集

54 个

任务类型

进入经典数据集