five

InterMT

收藏
arXiv2025-09-30 收录
下载链接:
https://pku-intermt.github.io
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集名为InterMT,专为多轮次、多模态的理解和生成任务设计,通过专家标注来捕捉人类偏好。它由10万个图像-文本示例构建而成,包含了32,459个人类偏好标注,这些标注以评分形式的成对比较组织,既包括局部级别也包括全局级别。数据集规模包括15.6千个提示、52.6千个多轮对话实例以及32.4千个人类标记的偏好对。其任务重点在于多轮次多模态交互和问答。

This dataset, named InterMT, is specifically designed for multi-turn, multimodal understanding and generation tasks, and captures human preferences via expert annotations. It is constructed with 100,000 image-text examples, and contains 32,459 human preference annotations, which are organized as pairwise comparisons in the form of ratings covering both local and global levels. The dataset has a scale comprising 15.6k prompts, 52.6k multi-turn dialogue instances, and 32.4k human-annotated preference pairs. Its core task focuses on multi-turn multimodal interaction and question answering.
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作