llm-lab/oryxtrain_country_3_en
收藏Hugging Face2025-08-03 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/llm-lab/oryxtrain_country_3_en
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含消息内容和对应的角色信息,以及图片链接和有效性标志。消息内容以字符串形式存储,并分为训练集。训练集包含超过113万示例,数据集总大小为约306MB。数据集适用于需要处理文本和图像信息的NLP任务。
The dataset includes message content and corresponding role information, as well as image URLs and a validity flag. The message content is stored as strings and is split into a training set. The training set contains over 1.13 million examples, with the total dataset size being approximately 306MB. The dataset is suitable for NLP tasks that require processing text and image information.
提供机构:
llm-lab



