kp7742/YALM-pretrain2-envy2-tok
收藏Hugging Face2025-04-03 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/kp7742/YALM-pretrain2-envy2-tok
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个包含input_ids、token_type_ids和attention_mask三个字段的大型自然语言处理数据集,适用于训练和测试机器学习模型。训练集包含约1.63亿个样本,测试集包含2.4万个样本。
This dataset is a large-scale natural language processing dataset containing three fields: input_ids, token_type_ids, and attention_mask, which is suitable for training and testing machine learning models. The training set includes about 163 million samples, and the test set includes 24,000 samples.
提供机构:
kp7742



