llm-lab/oryxtrain_fanarvisknow_5_en
收藏Hugging Face2025-08-03 更新2025-08-09 收录
下载链接:
https://hf-mirror.com/datasets/llm-lab/oryxtrain_fanarvisknow_5_en
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了文本消息和图像,文本消息由内容和发送者角色组成,还有一个表示数据有效性的布尔字段。数据集提供了一个训练集,其中包含超过117万个示例,总文件大小约为329MB。数据集默认配置下的训练数据文件以data/train-*的模式存储。
The dataset consists of text messages and images, with the text messages comprising content and sender roles, along with a boolean field indicating data validity. The dataset provides a training set with over 1.17 million examples, with a total file size of approximately 329MB. The training data files are stored under the default configuration in the pattern data/train-*.
提供机构:
llm-lab



