horus-ai-labs/smoltalk-filtered
收藏Hugging Face2025-01-11 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/horus-ai-labs/smoltalk-filtered
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含消息内容、角色、类别、难度、质量、奖励模型分数、会话令牌数、令牌数和类型等字段。数据集分为训练集和测试集,其中训练集包含778,415个示例,测试集包含40,972个示例。数据集的总大小为3,716,948,944字节。
The dataset includes fields such as message content, role, category, difficulty, quality, reward model score, conversation token count, token count, and type. The dataset is split into a training set and a test set, with the training set containing 778,415 examples and the test set containing 40,972 examples. The total size of the dataset is 3,716,948,944 bytes.
提供机构:
horus-ai-labs



