junlinw/opc-sft-s2-annealing-ins3-python-precode0.5-og0.1_OG_var3
收藏Hugging Face2025-10-05 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/junlinw/opc-sft-s2-annealing-ins3-python-precode0.5-og0.1_OG_var3
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含以下字段:input_ids(int32类型的列表)、labels(int64类型的列表)、t(float64类型)、num_prompt_tokens(int64类型)、num_mask_tokens(int64类型)和original(int64类型的列表)。数据集分为训练集和测试集,其中训练集大小为6076413508字节,包含2219948个示例;测试集大小为61237164字节,包含22424个示例。数据集的总下载大小为1001500154字节,总数据大小为6137650672字节。
The dataset includes the following fields: input_ids (list of int32), labels (list of int64), t (float64), num_prompt_tokens (int64), num_mask_tokens (int64), and original (list of int64). The dataset is split into a training set and a test set, with the training set being 6076413508 bytes in size and containing 2219948 examples; the test set is 61237164 bytes in size and contains 22424 examples. The total download size of the dataset is 1001500154 bytes, and the total data size is 6137650672 bytes.
提供机构:
junlinw



