MoLA-LLM/OpenHelix-R-86k-v2
收藏Hugging Face2025-08-21 更新2025-10-18 收录
下载链接:
https://hf-mirror.com/datasets/MoLA-LLM/OpenHelix-R-86k-v2
下载链接
链接失效反馈官方服务:
资源简介:
OpenHelix R 86k v2是一个多样化和平衡的推理数据集,它从5个其他数据集中编译而成,包含了角色扮演、创意写作、通用问答以及STEM(科学、技术、工程和数学)领域的相关内容。该数据集旨在生成通用、思维敏捷且不过分专注于STEM领域的模型。数据集经过高n-gram重合度的提示过滤,使得数据更加多样化和平衡。
This is a diverse and balanced reasoning dataset called OpenHelix R 86k v2, compiled from 5 other datasets, including role-playing, creative writing, general QA, and STEM (Science, Technology, Engineering, and Mathematics) related contents. The dataset is intended to produce general, wise-thinking models that are not overly focused on STEM areas. The dataset has been filtered with high n-gram overlap prompts to make it more diverse and balanced.
提供机构:
MoLA-LLM



