ojo/orpheus_hui_dataset_tokenised
收藏Hugging Face2025-04-02 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/ojo/orpheus_hui_dataset_tokenised
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个特征字段:input_ids、labels和attention_mask,分别代表输入ID序列、标签序列和注意力掩码序列。数据集分为训练集,包含1505个示例,总大小为15433811字节。数据集的下载大小为4975800字节。具体的应用场景和数据集内容未在README中说明。
The dataset includes three feature fields: input_ids, labels, and attention_mask, representing the input ID sequence, label sequence, and attention mask sequence respectively. The dataset is split into a training set with 1505 examples, totaling 15433811 bytes in size. The download size of the dataset is 4975800 bytes. The specific application scenario and content of the dataset are not described in the README.
提供机构:
ojo



