hcy-43/DeepMentor_PreTrain
收藏Hugging Face2025-05-22 更新2025-08-30 收录
下载链接:
https://hf-mirror.com/datasets/hcy-43/DeepMentor_PreTrain
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个文本数据集,包含一个名为text的字符串类型特征。数据集分为训练集,共有约1.575亿个样本,总大小约为174GB。数据集遵循Apache-2.0许可证。
This dataset is a text dataset containing a feature named text of string type. The dataset is split into a training set with a total of approximately 157.5 million samples, with a total size of about 174GB. The dataset is licensed under Apache-2.0.
提供机构:
hcy-43



