kanishka/babylm2-rewritten-clean-spacy_hierarchical-adj_211_size-color_adj2-ablation
收藏Hugging Face2025-10-30 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/kanishka/babylm2-rewritten-clean-spacy_hierarchical-adj_211_size-color_adj2-ablation
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文本特征,分为训练集和验证集两部分,共计约1200万个示例,总大小约为602.75MB。
The dataset includes text features, split into training and validation sets, totaling approximately 12 million examples, with a total size of about 602.75MB.
提供机构:
kanishka



