IntervitensInc/slimorca_50k_axo_qwen3
收藏Hugging Face2025-06-13 更新2025-07-05 收录
下载链接:
https://hf-mirror.com/datasets/IntervitensInc/slimorca_50k_axo_qwen3
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含输入ID序列、注意力掩码和标签序列。输入ID序列为int32类型,注意力掩码为int8类型,标签序列为int64类型。数据集分为训练集,共有50000个示例,数据集大小为246759667字节。提供了默认配置,指定了训练集数据文件的路径。
The dataset includes input ID sequences, attention masks, and label sequences. Input ID sequences are of type int32, attention masks are of type int8, and label sequences are of type int64. The dataset is split into a training set with 50,000 examples, and the total dataset size is 246759667 bytes. A default configuration is provided, specifying the path to the training set data files.
提供机构:
IntervitensInc



