LLMTeamAkiyama/clean_openthought312_difficulty_9_qwentoken
收藏Hugging Face2025-08-22 更新2025-09-13 收录
下载链接:
https://hf-mirror.com/datasets/LLMTeamAkiyama/clean_openthought312_difficulty_9_qwentoken
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个文本推理数据集,包含大约14339个数据项,平均每个项包含约13367个トークン,最大为16805个トークン。数据集总トーク数约为191,665,652个,以JSONL格式存储,分为3个文件,总大小为724.7MB。数据集已经使用Qwen235B-A22B模型进行了tokenize处理。
This dataset is a text reasoning dataset containing approximately 14,339 items, with an average of 13,367 tokens per item and a maximum of 16,805 tokens. The total number of tokens in the dataset is about 191,665,652, stored in JSONL format, split into 3 files, with a total size of 724.7MB. The dataset has been tokenized using the Qwen235B-A22B model.
提供机构:
LLMTeamAkiyama



