kshitijthakkar/loggenix-mc-oraca-agentinstruct-1m-moonshot-v1
收藏Hugging Face2025-08-06 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/kshitijthakkar/loggenix-mc-oraca-agentinstruct-1m-moonshot-v1
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含不同类型的文本数据,经过过滤处理后,共有训练集941768条数据和测试集104642条数据。数据集中的字段包括消息内容、分割名称、格式化文本、总标记数和编码文本。
The dataset includes various types of text data, after filtering process, it consists of 941,768 training examples and 104,642 testing examples. The fields in the dataset include message content, split name, formatted text, total token count, and encoded text.
提供机构:
kshitijthakkar



