kanishka/babylm2-rewritten-clean_hierarchical-adj_211_size-color_adj2-ablation
收藏Hugging Face2025-10-30 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/kanishka/babylm2-rewritten-clean_hierarchical-adj_211_size-color_adj2-ablation
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个文本数据集,包含一个名为text的字符串特征。它被分为训练集和验证集,总共包含约1200万条文本数据。
The dataset is a text dataset containing a feature named text of string type. It is split into a training set and a validation set, totaling approximately 12 million text entries.
提供机构:
kanishka



