JingHaoZ/RLFR-Dataset-VLM
收藏Hugging Face2025-10-14 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/JingHaoZ/RLFR-Dataset-VLM
下载链接
链接失效反馈官方服务:
资源简介:
RLFR-Dataset-VLM是一个包含170k数学样本的数据集,用于增强大型视觉语言模型(VLMs)的推理能力。该数据集包括来自MMPRv1.1的数学子集的离线部分,共115k样本,以及来自MM-Eureka-Dataset的强化学习部分,共55k样本。
The RLFR-Dataset-VLM is a collection of 170k math samples designed to enhance the reasoning capabilities of Large Vision Language Models (VLMs). This dataset includes an offline part from the math subsets of MMPRv1.1, containing 115k samples, and an RL part from the MM-Eureka-Dataset, containing 55k samples.
提供机构:
JingHaoZ



