toramaru-u/test
收藏Hugging Face2025-10-16 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/toramaru-u/test
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含两个配置:cc100-ja_one-thousandth和cc100-ja_one-thousandth_nsp。cc100-ja_one-thousandth配置包含文本数据,而cc100-ja_one-thousandth_nsp配置包含了句子对和相关标签,用于判断两个句子是否是连续的。每个配置都有训练集划分,提供了相应的字节数和示例数信息。
The dataset consists of two configurations: cc100-ja_one-thousandth and cc100-ja_one-thousandth_nsp. The cc100-ja_one-thousandth configuration contains text data, while the cc100-ja_one-thousandth_nsp configuration includes pairs of sentences and related labels for determining if two sentences are consecutive. Each configuration has a training split with provided byte size and number of example information.
提供机构:
toramaru-u



