bismarck91/enA-frA-tokenised-part2
收藏Hugging Face2025-04-11 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/bismarck91/enA-frA-tokenised-part2
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个序列数据集,包含三个字段:input_ids,labels和attention_mask。数据集包含一个训练集,共有500000个样本,数据大小为5042429112字节。提供了一个默认配置用于访问训练数据。
This dataset is a sequence dataset containing three fields: input_ids, labels, and attention_mask. The dataset includes a training set with a total of 500,000 samples, with a data size of 5042429112 bytes. A default configuration is provided for accessing the training data.
提供机构:
bismarck91



