zjunlp/OceanBenchMark

Name: zjunlp/OceanBenchMark
Creator: zjunlp
Published: 2026-04-10 09:43:02
License: 暂无描述

Hugging Face2026-04-10 更新2026-04-12 收录

下载链接：

https://hf-mirror.com/datasets/zjunlp/OceanBenchMark

下载链接

链接失效反馈

官方服务：

资源简介：

--- dataset_info: features: - name: input dtype: string - name: output dtype: string - name: image_path dtype: string splits: - name: test num_bytes: 0 num_examples: 0 configs: - config_name: science_qa description: "Marine Science QA (Text only)." - config_name: science_vqa description: "Marine Science VQA." - config_name: sonar_vqa description: "Sonar Image QA." - config_name: organism_vqa description: "Marine Organism VQA." license: mit language: - zh - en task_categories: - question-answering - image-text-to-text pretty_name: OceanBenchmark tags: - benchmark - evaluation - ocean - marine-science --- # OceanBenchmark ## 1. Dataset Description OceanBenchmark is a benchmark dataset designed to evaluate the comprehensive capabilities of marine-focused large models. It encompasses a diverse range of tasks, spanning from unimodal marine science knowledge question answering to complex multimodal visual question answering. ## 2. Sub-datasets | Subset Directory | Task Type | Sample Size | Description | |:---|---|---|---| | **Ocean Science QA** | QA | 102 | Text-only multiple-choice and open-ended questions in marine science. | | **Ocean Science VQA** | VQA | 99 | Visual question answering based on scientific diagrams and imagery. | | **Sonar VQA Marine** | VQA | 796 | Target detection and question answering evaluation on sonar imagery. | | **Marine Organisms VQA** | VQA | 472 | Classification and identification tests for marine organisms. | ## 3. Usage Example ```python from datasets import load_dataset ``` # Load the sonar evaluation subset ds_test = load_dataset("zjunlp/OceanBenchmark", "sonar_vqa", split="test") print(ds_test[0]['input']) ## 4. License License: MIT

提供机构：

zjunlp

5,000+

优质数据集

54 个

任务类型

进入经典数据集