ALLaVA-4V-Chinese

Name: ALLaVA-4V-Chinese
Creator: maas
Published: 2025-12-05 16:21:15
License: 暂无描述

魔搭社区2025-12-05 更新2024-06-08 收录

下载链接：

https://modelscope.cn/datasets/FreedomIntelligence/ALLaVA-4V-Chinese

下载链接

链接失效反馈

官方服务：

资源简介：

## ALLaVA-4V for Chinese This is the Chinese version of the ALLaVA-4V data. We have translated the ALLaVA-4V data into Chinese through ChatGPT and instructed ChatGPT not to translate content related to OCR. The original dataset can be found [here](https://huggingface.co/datasets/FreedomIntelligence/ALLaVA-4V), and the image data can be downloaded from [ALLaVA-4V](https://huggingface.co/datasets/FreedomIntelligence/ALLaVA-4V). #### Citation If you find our data useful, please consider citing our work! We are FreedomIntelligence from Shenzhen Research Institute of Big Data and The Chinese University of Hong Kong, Shenzhen. ``` @misc{chen2024allava, title={ALLaVA: Harnessing GPT4V-synthesized Data for A Lite Vision-Language Model}, author={Guiming Hardy Chen and Shunian Chen and Ruifei Zhang and Junying Chen and Xiangbo Wu and Zhiyi Zhang and Zhihong Chen and Jianquan Li and Xiang Wan and Benyou Wang}, year={2024}, eprint={2402.11684}, archivePrefix={arXiv}, primaryClass={cs.CL} } ```

#### ALLaVA-4V 中文版本本数据集为 ALLaVA-4V 的中文版本。我们通过ChatGPT将原始ALLaVA-4V数据集翻译为中文，并要求ChatGPT不对与光学字符识别（Optical Character Recognition, OCR）相关的内容进行翻译。原始数据集可通过以下链接获取：https://huggingface.co/datasets/FreedomIntelligence/ALLaVA-4V，图像数据可从 [ALLaVA-4V](https://huggingface.co/datasets/FreedomIntelligence/ALLaVA-4V) 下载。 #### 引用若您认为本数据集对您的研究有所帮助，请考虑引用我们的相关工作。本项目由来自深圳大数据研究院与香港中文大学（深圳）的 FreedomIntelligence 团队完成。 @misc{chen2024allava, title={ALLaVA: Harnessing GPT4V-synthesized Data for A Lite Vision-Language Model}, author={Guiming Hardy Chen and Shunian Chen and Ruifei Zhang and Junying Chen and Xiangbo Wu and Zhiyi Zhang and Zhihong Chen and Jianquan Li and Xiang Wan and Benyou Wang}, year={2024}, eprint={2402.11684}, archivePrefix={arXiv}, primaryClass={cs.CL} }

提供机构：

maas

创建时间：

2025-01-20

搜集汇总

数据集介绍