RLAIF-V Dataset
收藏arXiv2025-09-30 收录
下载链接:
https://huggingface.co/datasets/openbmb/rlaif-v-dataset
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为RLAIF-V,主要用于与Chameleon-7B模型配合进行数据集消融研究。在其他信息方面,采用余弦相似度过滤的方法,在仅使用全部数据的10%的情况下,取得了54.23的性能得分,这一得分达到了全数据集性能的125%。该研究的任务是评估不同的过滤方法在数据集消融研究中的效果。
The dataset, named RLAIF-V, is primarily used for dataset ablation studies in conjunction with the Chameleon-7B model. Regarding other details, it adopts the cosine similarity filtering method. When only 10% of the total dataset is utilized, it achieves a performance score of 54.23, which reaches 125% of the performance of the full dataset. The task of this study is to evaluate the effects of different filtering methods in dataset ablation studies.



