llm-slice/babylm-switchboard-preprocessed
收藏Hugging Face2025-06-28 更新2025-11-01 收录
下载链接:
https://hf-mirror.com/datasets/llm-slice/babylm-switchboard-preprocessed
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文本信息,分为训练集、验证集和测试集三个部分。训练集包含161740个示例,验证集包含18000个示例,测试集包含20000个示例。数据集的总大小为8732424字节。
The dataset contains text data, split into three parts: training set, validation set, and test set. The training set includes 161,740 examples, the validation set includes 18,000 examples, and the test set includes 20,000 examples. The total size of the dataset is 8,732,424 bytes.
提供机构:
llm-slice



