five

RAP-260K

收藏
魔搭社区2025-09-02 更新2025-09-06 收录
下载链接:
https://modelscope.cn/datasets/lhn526/RAP-260K
下载链接
链接失效反馈
官方服务:
资源简介:
# RAP Instruct 260K ## Dataset details [`rap_train_260k.json`](https://huggingface.co/datasets/Hoar012/RAP-260K/blob/main/rap_train_260k.json) is the full dataset used for training RAP-Phi3-V; [`rap_train_210k.json`](https://huggingface.co/datasets/Hoar012/RAP-260K/blob/main/rap_train_210k.json) is a subset used for training RAP-LLaVA. We also provide the script [`generate_negative.py`](https://huggingface.co/datasets/Hoar012/RAP-260K/blob/main/generate_negative.py) for generating additional negative samples. **Paper Link:** https://arxiv.org/pdf/2410.13360 **Project Page:** https://hoar012.github.io/RAP-Project/ ## Intended Use **Primary intended uses:** The primary use is research on the personalization of multimodal LLMs (MLLMs). **Primary intended users:** The primary intended users of the model are researchers and hobbyists in computer vision, natural language processing, machine learning, and artificial intelligence. ## Agreement - The RAP dataset is available for non-commercial research purposes only, we do not own the rights to these images. - You agree not to reproduce, duplicate, copy, sell, trade, resell or exploit for any commercial purposes, any portion of the images and any portion of derived data. ## Citation ``` @InProceedings{Hao_2025_CVPR, author = {Hao, Haoran and Han, Jiaming and Li, Changsheng and Li, Yu-Feng and Yue, Xiangyu}, title = {RAP: Retrieval-Augmented Personalization for Multimodal Large Language Models}, booktitle = {Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR)}, month = {June}, year = {2025}, pages = {14538-14548} } ```

# RAP Instruct 260K 数据集 ## 数据集详情 [`rap_train_260k.json`](https://huggingface.co/datasets/Hoar012/RAP-260K/blob/main/rap_train_260k.json) 为用于训练 RAP-Phi3-V 的完整数据集;[`rap_train_210k.json`](https://huggingface.co/datasets/Hoar012/RAP-260K/blob/main/rap_train_210k.json) 为用于训练 RAP-LLaVA 的子集数据集。 我们还提供了脚本 [`generate_negative.py`](https://huggingface.co/datasets/Hoar012/RAP-260K/blob/main/generate_negative.py),用于生成额外的负样本。 **论文链接:** https://arxiv.org/pdf/2410.13360 **项目主页:** https://hoar012.github.io/RAP-Project/ ## 预期用途 **核心用途:** 本数据集主要用于多模态大语言模型(Multimodal Large Language Models,MLLMs)的个性化研究。 **核心适用人群:** 本数据集的核心适用人群为计算机视觉、自然语言处理、机器学习与人工智能领域的研究人员及爱好者。 ## 使用协议 - RAP 数据集仅可用于非商业性研究用途,我们不拥有这些图像的版权。 - 您同意不得出于任何商业目的,对图像或衍生数据的任何部分进行复制、拷贝、售卖、交易、转售或利用。 ## 引用格式 @InProceedings{Hao_2025_CVPR, author = {Hao, Haoran and Han, Jiaming and Li, Changsheng and Li, Yu-Feng and Yue, Xiangyu}, title = {RAP: Retrieval-Augmented Personalization for Multimodal Large Language Models}, booktitle = {Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR)}, month = {June}, year = {2025}, pages = {14538-14548} }
提供机构:
maas
创建时间:
2025-09-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作