five

techviewer00/AQUA20

收藏
Hugging Face2026-04-02 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/techviewer00/AQUA20
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - en task_categories: - image-classification tags: - underwater - marine-biology - species-classification - benchmark dataset_info: features: - name: image dtype: image - name: label dtype: class_label: names: '0': coral '1': crab '2': diver '3': eel '4': fish '5': fishInGroups '6': flatworm '7': jellyfish '8': marine_dolphin '9': octopus '10': rayfish '11': seaAnemone '12': seaCucumber '13': seaSlug '14': seaUrchin '15': shark '16': shrimp '17': squid '18': starfish '19': turtle splits: - name: train num_bytes: 102853257.2 num_examples: 6559 - name: test num_bytes: 23452132.448 num_examples: 1612 download_size: 199773745 dataset_size: 126305389.648 configs: - config_name: default data_files: - split: train path: data/train-* - split: test path: data/test-* size_categories: - 1K<n<10K --- # AQUA20 Dataset The AQUA20 dataset is a comprehensive benchmark dataset designed for **underwater species classification** under challenging real-world conditions. It comprises 8,171 underwater images across 20 distinct marine species, specifically curated to reflect environmental complexities such as turbidity, low illumination, and occlusion, which commonly degrade the performance of standard vision systems. This dataset provides a valuable resource for advancing robust visual recognition in aquatic environments. The dataset was presented in the paper [AQUA20: A Benchmark Dataset for Underwater Species Classification under Challenging Conditions](https://huggingface.co/papers/2506.17455). ## Sample Usage You can easily load the AQUA20 dataset using the Hugging Face `datasets` library: ```python from datasets import load_dataset # Load the dataset dataset = load_dataset("AQUA20") # Access the training split train_dataset = dataset["train"] print(f"Number of examples in training set: {len(train_dataset)}") # Access the test split test_dataset = dataset["test"] print(f"Number of examples in test set: {len(test_dataset)}") # Example of accessing an image and its label example = train_dataset[0] image = example["image"] label = example["label"] print(f"Example label: {label} (Class Name: {train_dataset.features['label'].names[label]})") # You can optionally display the image if you have PIL and matplotlib installed # import matplotlib.pyplot as plt # plt.imshow(image) # plt.title(f"Label: {train_dataset.features['label'].names[label]}") # plt.axis('off') # plt.show() ``` ## Citation ```bibtex @article{fuadAQUA20BenchmarkDataset2026, title = {{AQUA20}: {A} {Benchmark} {Dataset} for {Underwater} {Species} {Classification} under {Challenging} {Conditions}}, issn = {2191-4281}, shorttitle = {{AQUA20}}, url = {https://doi.org/10.1007/s13369-026-11231-z}, doi = {10.1007/s13369-026-11231-z}, language = {en}, urldate = {2026-03-27}, journal = {Arabian Journal for Science and Engineering}, author = {Fuad, Taufikur Rahman and Ahmed, Sabbir and Ivan, Shahriar}, month = mar, year = {2026}, keywords = {AQUA20, Environmental challenges, Explainable deep learning, Marine species classification, Underwater visual recognition, Vision transformers and lightweight CNNs}, } ```
提供机构:
techviewer00
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作