ALLaVA-4V-Arabic
收藏魔搭社区2025-11-02 更新2025-01-25 收录
下载链接:
https://modelscope.cn/datasets/FreedomIntelligence/ALLaVA-4V-Arabic
下载链接
链接失效反馈官方服务:
资源简介:
## ALLaVA-4V for Arabic
This is the Arabic version of the ALLaVA-4V data. We have translated the ALLaVA-4V data into Arabic through ChatGPT and instructed ChatGPT not to translate content related to OCR.
The original dataset can be found [here](https://huggingface.co/datasets/FreedomIntelligence/ALLaVA-4V), and the image data can be downloaded from [ALLaVA-4V](https://huggingface.co/datasets/FreedomIntelligence/ALLaVA-4V).
#### Citation
If you find our data useful, please consider citing our work! We are FreedomIntelligence from Shenzhen Research Institute of Big Data and The Chinese University of Hong Kong, Shenzhen.
```
@misc{chen2024allava,
title={ALLaVA: Harnessing GPT4V-synthesized Data for A Lite Vision-Language Model},
author={Guiming Hardy Chen and Shunian Chen and Ruifei Zhang and Junying Chen and Xiangbo Wu and Zhiyi Zhang and Zhihong Chen and Jianquan Li and Xiang Wan and Benyou Wang},
year={2024},
eprint={2402.11684},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
```
## ALLaVA-4V 阿拉伯语版本
本数据集为ALLaVA-4V数据集的阿拉伯语适配版本。我们通过ChatGPT将原版ALLaVA-4V数据集翻译为阿拉伯语,并明确指示ChatGPT不对与光学字符识别(OCR)相关的内容进行翻译。
原版数据集可通过以下链接获取:https://huggingface.co/datasets/FreedomIntelligence/ALLaVA-4V,图像数据亦可从标注为ALLaVA-4V的该链接下载:https://huggingface.co/datasets/FreedomIntelligence/ALLaVA-4V。
#### 引用
若您认为本数据集对您的研究有所帮助,请考虑引用我们的相关工作!本团队为来自深圳大数据研究院与香港中文大学(深圳)的FreedomIntelligence。
@misc{chen2024allava,
title={ALLaVA: Harnessing GPT4V-synthesized Data for A Lite Vision-Language Model},
author={Guiming Hardy Chen and Shunian Chen and Ruifei Zhang and Junying Chen and Xiangbo Wu and Zhiyi Zhang and Zhihong Chen and Jianquan Li and Xiang Wan and Benyou Wang},
year={2024},
eprint={2402.11684},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
提供机构:
maas
创建时间:
2025-01-20



