five

jlext07/jaguars

收藏
Hugging Face2025-12-10 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/jlext07/jaguars
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: filename dtype: string - name: label dtype: string - name: image dtype: image splits: - name: raw_images num_bytes: 34945123504.804 num_examples: 3098 - name: cropped_body num_bytes: 2060540969.592 num_examples: 3098 - name: cropped_head num_bytes: 737914072.544 num_examples: 3098 - name: segmented_body num_bytes: 1265192956.906 num_examples: 3098 download_size: 38280262700 dataset_size: 39008771503.846 configs: - config_name: default data_files: - split: raw_images path: data/raw_images-* - split: cropped_body path: data/cropped_body-* - split: cropped_head path: data/cropped_head-* - split: segmented_body path: data/segmented_body-* task_categories: - image-classification - image-segmentation size_categories: - 1K<n<10K license: cc-by-4.0 pretty_name: jaguar_identification language: - en tags: - biology --- # Jaguar Re-identification Dataset This dataset contains images of jaguars from the Porto Jofre region in the Pantanal National Park, Brazil. It was curated for the purpose of developing and evaluating deep learning models for individual jaguar identification for population tracking. ![](https://github.com/andandandand/practical-computer-vision/blob/main/images/jaguars_fo_2.png?raw=true) ## Dataset Description The [Jaguar Identification Project](https://www.jaguaridproject.com/) aims to track jaguar movements, health, and demographics. This contributes valuable data to conservation strategies, especially considering jaguars are classified as Near Threatened by the [IUCN](https://iucn.org/) due to habitat loss, poaching, and human-wildlife conflict. The core idea is to use photos taken by citizen scientists, to distinguish individual jaguars (e.g., identifying "Medrosa" vs. "Patricia"). **Origin of Data:** The images are primarily sourced from Porto Jofre, a wildlife reserve located in the Pantanal National Park, which is home to one of the largest wild jaguar populations and is a popular destination for ecotourism and wildlife observation. **Goal of the Dataset:** The primary goal is to facilitate the development of models that can distinguish different individual jaguars from images. This supports: * Population tracking and demographic studies. * Citizen science efforts * Conservation strategies by providing data on jaguar movements and health. ## Dataset Details **Data Collection and Preprocessing:** The dataset was prepared through a pipeline involving: * Collection of raw images (initially around 40GB, ~4300 images). * **Cropping and Segmentation:** * Cropped body images (using Grounding DINO). * Cropped face images. * Segmented body images (using SAM - Segment Anything Model). * Segmented face images. The dataset includes images of various individual jaguars, such as "Ousado," "Medrosa," and "Patricia." ## Motivation Protecting endangered and near-threatened species like jaguars starts with understanding their populations and the threats they face, such as wildfires, deforestation, illegal trade, climate change, and human-wildlife conflicts. ## Project Goals and Model Development The dataset is used to: * Contribute to citizen science efforts at the Pantanal Jaguar ID Project. * Support research on re-identification models. ## Citation and License All images were captured by Abigail Martin and contributors to the Jaguar ID Project https://www.jaguaridproject.com. The rights of ownership to these images remain to the Jaguar ID Project. We kindly ask you to keep all credits as shown on the original images. You will find these in the `raw_images` split. If you use this dataset in your research, please cite the following: ```bibtex @misc{jaguar_identification_dataset_2025, author = {Antonio Rueda-Toicen and Abigail Martin and Shahabeddin Dayani and Davide Panza and Aleksandra Kudaeva and Gerard de Melo}, title = {Jaguars of Pantanal Re-Identification Dataset}, year = {2025}, publisher = {Hugging Face}, journal = {Hugging Face Hub}, howpublished = {\url{https://huggingface.co/datasets/jaguaridentification/jaguars}} } ```

数据集信息: 特征: - 字段名:filename,数据类型:字符串(string) - 字段名:label,数据类型:字符串(string) - 字段名:image,数据类型:图像(image) 划分集: - 划分名:raw_images(原始图像),字节数:34945123504.804,样本数:3098 - 划分名:cropped_body(裁剪躯体图像),字节数:2060540969.592,样本数:3098 - 划分名:cropped_head(裁剪头部图像),字节数:737914072.544,样本数:3098 - 划分名:segmented_body(分割躯体图像),字节数:1265192956.906,样本数:3098 下载大小:38280262700字节,数据集总大小:39008771503.846字节 配置: - 配置名:default(默认配置),数据文件: - 划分:raw_images(原始图像),路径:data/raw_images-* - 划分:cropped_body(裁剪躯体图像),路径:data/cropped_body-* - 划分:cropped_head(裁剪头部图像),路径:data/cropped_head-* - 划分:segmented_body(分割躯体图像),路径:data/segmented_body-* 任务类别:图像分类(image-classification)、图像分割(image-segmentation) 样本规模类别:1K < n < 10K 许可证:cc-by-4.0 美观名称:jaguar_identification(美洲豹识别数据集) 语言:英语(en) 标签:生物学(biology) # 美洲豹重识别数据集 本数据集收录了巴西潘塔纳尔国家公园波尔图若弗雷区域的美洲豹影像,专为开发与评估用于个体美洲豹识别以开展种群追踪的深度学习模型而构建。 ![](https://github.com/andandandand/practical-computer-vision/blob/main/images/jaguars_fo_2.png?raw=true) ## 数据集说明 [美洲豹识别项目](https://www.jaguaridproject.com/)旨在追踪美洲豹的活动、健康状况与种群结构,为保护策略提供宝贵数据支撑。鉴于美洲豹因栖息地丧失、偷猎及人兽冲突被国际自然保护联盟(IUCN,https://iucn.org/)列为近危物种,本数据集的价值尤为突出。 本数据集的核心思路是利用公民科学家拍摄的影像,实现美洲豹个体的区分(例如识别"Medrosa"与"Patricia"两只个体)。 **数据来源**: 影像主要采集自潘塔纳尔国家公园内的野生动物保护区波尔图若弗雷区域,该区域拥有全球规模最大的野生美洲豹种群之一,同时也是生态旅游与野生动物观测的热门目的地。 **数据集目标**: 本数据集的核心目标是助力能够从影像中区分不同美洲豹个体的模型开发,具体支撑以下方向: * 种群追踪与种群结构研究 * 公民科学项目 * 通过提供美洲豹活动与健康数据,支撑保护策略制定。 ## 数据集详情 **数据采集与预处理**: 本数据集通过以下流程构建完成: * 原始影像采集(初始规模约40GB,包含约4300张影像) * **裁剪与分割**: * 基于Grounding DINO生成的裁剪躯体影像 * 裁剪头部影像 * 基于SAM(Segment Anything Model)生成的分割躯体影像 * 分割头部影像 本数据集收录了多只美洲豹个体的影像,例如"Ousado""Medrosa"与"Patricia"。 ## 项目动因 保护美洲豹这类濒危或近危物种的前提,是了解其种群现状与面临的威胁,包括野火、森林砍伐、非法贸易、气候变化及人兽冲突等。 ## 项目目标与模型开发 本数据集可用于: * 助力潘塔纳尔美洲豹识别项目的公民科学工作 * 支持重识别模型相关研究。 ## 引用与许可 所有影像均由阿比盖尔·马丁(Abigail Martin)及美洲豹识别项目的贡献者拍摄,影像版权归美洲豹识别项目所有。 我们恳请您保留原始影像上标注的所有署名信息,该信息可在`raw_images`划分的文件中获取。 若您在研究中使用本数据集,请引用如下文献: bibtex @misc{jaguar_identification_dataset_2025, author = {Antonio Rueda-Toicen and Abigail Martin and Shahabeddin Dayani and Davide Panza and Aleksandra Kudaeva and Gerard de Melo}, title = {Jaguars of Pantanal Re-Identification Dataset}, year = {2025}, publisher = {Hugging Face}, journal = {Hugging Face Hub}, howpublished = {url{https://huggingface.co/datasets/jaguaridentification/jaguars}} }
提供机构:
jlext07
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作