LLMTeamAkiyama/cleand_open-r1_codeforces-cots
收藏Hugging Face2025-08-06 更新2025-08-09 收录
下载链接:
https://hf-mirror.com/datasets/LLMTeamAkiyama/cleand_open-r1_codeforces-cots
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含文本和推理标签的问答数据集,共有5334条数据。数据集的平均tokens数为11512,最大tokens数为30725,总tokens数为61406976。数据以JSONL格式存储,文件大小为213.1 MB。数据集在加工过程中去除了编辑器解决方案、限定停止理由、进行了token处理过滤、移除了think标签以及去除了重复内容。
This is a question-answering dataset with text and reasoning tags, containing a total of 5,334 entries. The dataset has an average of 11,512 tokens, a maximum of 30,725 tokens, and a total of 61,406,976 tokens. The data is stored in JSONL format with a file size of 213.1 MB. The dataset has been processed to remove editor solutions, limit stop reasons, filter for heavy token processing, remove entries with the think tag, and eliminate repetitions.
提供机构:
LLMTeamAkiyama



