SI-Lab/StepCountQA-RL-Traj_11_50_Combined

Name: SI-Lab/StepCountQA-RL-Traj_11_50_Combined
Creator: SI-Lab
Published: 2026-04-29 11:33:46
License: 暂无描述

Hugging Face2026-04-29 更新2026-05-03 收录

下载链接：

https://hf-mirror.com/datasets/SI-Lab/StepCountQA-RL-Traj_11_50_Combined

下载链接

链接失效反馈

官方服务：

资源简介：

StepCountQA-RL-Traj_11_50_Combined是一个统一的视觉计数RL训练数据集，覆盖了从11到50的对象计数。它合并了两个互补的11-50数据集：StepCountQA-RL-Traj_11_50（9,508个样本，带有JSON注释轨迹，转换为数字）和StepCountQA-RL-Traj_11_50_NumericOnly（29,722个样本，纯数字答案来自SFT轨迹）。合并后的数据集共有39,230个样本，无重复。每个样本包含一个原始图像、一个计数问题和一个纯数字答案字符串。数据集的结构包括图像序列、问题字符串和答案字符串。数据集的统计信息显示，总样本数为39,230，计数范围为11到50，无重复图像，总大小约为8.95 GB。计数分布表详细列出了每个计数对应的样本数量。数据集的使用方法是通过HuggingFace的load_dataset函数加载。数据集的创建过程包括从两个源数据集提取图像路径字段，验证无重复，然后合并为4个输出分片。

StepCountQA-RL-Traj_11_50_Combined is a unified visual counting RL training dataset covering object counts from 11 to 50. It merges two complementary 11-50 datasets: StepCountQA-RL-Traj_11_50 (9,508 samples with `<point>` JSON annotation trajectories, converted to numeric) and StepCountQA-RL-Traj_11_50_NumericOnly (29,722 samples with pure numeric answers from SFT trajectories). The combined dataset contains 39,230 samples with zero overlap. Each sample pairs an original image with a counting question and a pure numeric answer string. The dataset structure includes a sequence of images, a problem string, and an answer string. Statistics show 39,230 total samples, count range 11–50, zero duplicate images, and ~8.95 GB total size. A count distribution table details sample counts per number. Usage involves loading via HuggingFaces `load_dataset`. Creation involved merging two sources with verified zero overlap into 4 output shards.

提供机构：

SI-Lab

5,000+

优质数据集

54 个

任务类型

进入经典数据集