Billyyy/cleaned-mongolian-dataset
收藏Hugging Face2025-01-31 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/Billyyy/cleaned-mongolian-dataset
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含文本数据的训练集,其中包括文本内容(text)和对应的输入ID序列(input_ids)。整个数据集大小为6706MB,共有997125个训练样本。
This is a training dataset containing text data, which includes the text content (text) and corresponding input ID sequences (input_ids). The entire dataset is 6706MB in size and contains 997125 training samples.
提供机构:
Billyyy



