opendatalab/WanJuanSiLu-Multimodal-5Languages
收藏Hugging Face2025-04-23 更新2025-05-31 收录
下载链接:
https://hf-mirror.com/datasets/opendatalab/WanJuanSiLu-Multimodal-5Languages
下载链接
链接失效反馈官方服务:
资源简介:
WanJuan·SiLu多模态多语言语料库是一个经过大幅改进的数据集,它包含了八种关键语言的丰富多模态数据,适用于多模态研究和低资源语言处理。数据集经过精细标注,质量达到了工业级标准,包含了超过20种细粒度的多维度分类标签和详细的文本描述,适用于文化旅游、商业贸易、科技教育等多种场景。
The WanJuan·SiLu Multimodal Multilingual Corpus is an improved dataset containing rich multimodal data in eight key languages, suitable for multimodal research and low-resource language processing. The dataset has been finely annotated and meets industrial quality standards, including more than 20 types of fine-grained multidimensional classification labels and detailed text descriptions, applicable to various scenarios such as cultural tourism, commercial trade, and science and technology education.
提供机构:
opendatalab



