One-RL-to-See-Them-All/Orsta-Data-47k
收藏Hugging Face2025-06-04 更新2025-07-05 收录
下载链接:
https://hf-mirror.com/datasets/One-RL-to-See-Them-All/Orsta-Data-47k
下载链接
链接失效反馈官方服务:
资源简介:
Orsta-Data-47k是一个专门为使用V-Triune统一强化学习系统对视觉语言模型(VLMs)进行后训练而策划的专业数据集。其主要目的是支持在视觉推理和视觉感知任务之间进行强大的联合训练,使模型如Orsta实现高级多模态能力。这个数据集是从18个公开可用的数据集中精心挑选和精炼的,通过严格的过滤过程确保高质量和适合基于强化学习的微调。数据集涵盖了八个主要的任务类别,包括视觉推理任务(数学、拼图解决、科学问答、图表理解)和视觉感知任务(物体检测、视觉接地、物体计数、光学字符识别(OCR))。数据集总共有大约47,700个样本,内容分为视觉感知样本和视觉推理样本。所有数据都以高效的Parquet格式存储。该数据集旨在与V-Triune框架一起使用,用于基于强化学习的视觉语言模型的后训练。在Orsta模型的训练中,该数据集中的所有样本都被统一混合和使用。
Orsta-Data-47k is a specialized dataset curated for post-training of Vision-Language Models (VLMs) using the V-Triune unified reinforcement learning system. Its primary purpose is to enable robust joint training across a diverse spectrum of both visual reasoning and visual perception tasks, powering models like Orsta to achieve advanced multimodal capabilities. This dataset is a carefully selected aggregation from 18 publicly available datasets, refined through a rigorous filtering process to ensure high quality and suitability for RL-based fine-tuning.
提供机构:
One-RL-to-See-Them-All



