five

SA-FARI

收藏
魔搭社区2026-01-09 更新2025-11-22 收录
下载链接:
https://modelscope.cn/datasets/facebook/SA-FARI
下载链接
链接失效反馈
官方服务:
资源简介:
# SA-FARI Dataset **License** CC-BY-NC 4.0 **SA-FARI** is a wildlife camera dataset collected through a collaboration between Meta and [CXL](https://www.conservationxlabs.com/). All videos and pre-processed JPEGImages can be found in [cxl-public-camera-trap](https://console.cloud.google.com/storage/browser/cxl-public-camera-trap), which contains the following contents: ``` sa_fari/ ├── sa_fari_test_tars/ │ ├── JPEGImages_6fps/ │ ├── videos/ ├── sa_fari_test/ │ ├── JPEGImages_6fps/ │ ├── videos/ ├── sa_fari_train_tars/ │ ├── JPEGImages_6fps/ │ ├── videos/ └── sa_fari_train/ ├── JPEGImages_6fps/ └── videos/ ``` * `videos`: The original full fps videos. * `JPEGImages_6fps`: For annotation, the videos have been downsampled to 6fps. This folder contains the downsampled frames compatible with the annotation json files below. This Hugging Face dataset repo contains the annotations: ``` datasets/facebook/SA-FARI/tree/main/ └── annotation/ ├── sa_fari_test.json ├── sa_fari_test_ext.json ├── sa_fari_train.json └── sa_fari_train_ext.json ``` * `sa_fari_test.json` and `sa_fari_train.json` * Follow the same format as [SA-Co/VEval](https://huggingface.co/datasets/facebook/SACo-VEval/) * `sa_fari_test_ext.json` and `sa_fari_train_ext.json` * In additional to the [SA-Co/VEval] format, we added additional metadata to the following fields: * `videos`: * `video_num_frames`, `video_fps`, `video_creation_datetime` and `location_id` have been added as additional metadata to the `videos` field. * `categories`: * `Kingdom`, `Phylum`, `Class`, `Order`, `Family`, `Genus` and `Species` have been added when applicable as additional metadata to the `categories` field. All the SA-FARI annotation files are compatible to use the visualization notebook and offline evaluator developed in [SAM 3 Github](https://github.com/facebookresearch/sam3/tree/main/scripts/eval/veval). ## Annotation Format A format breakdown for `sa_fari_test.json` and `sa_fari_train.json`. The format is similar to the [YTVIS](https://youtube-vos.org/dataset/vis/) format. In the annotation json, e.g. `sa_fari_test.json` there are 5 fields: * info: * A dict containing the dataset info * E.g. {'version': 'v1', 'date': '2025-09-24', 'description': 'SA-FARI Test'} * videos * A list of videos that are used in the current annotation json * It contains {id, video_name, file_names, height, width, length} * annotations * A list of **positive** masklets and their related info * It contains {id, segmentations, bboxes, areas, iscrowd, video_id, height, width, category_id, noun_phrase} * video_id should match to the `videos - id` field above * category_id should match to the `categories - id` field below * segmentations is a list of [RLE](https://github.com/cocodataset/cocoapi/blob/master/PythonAPI/pycocotools/mask.py) * categories * A **globally** used noun phrase id map, which is true across all 3 domains. * It contains {id, name} * name is the noun phrase * video_np_pairs * A list of video-np pairs, including both **positive** and **negative** used in the current annotation json * It contains {id, video_id, category_id, noun_phrase, num_masklets} * video_id should match the `videos - id` above * category_id should match the `categories - id` above * when `num_masklets > 0` it is a positive video-np pair, and the presenting masklets can be found in the annotations field * when `num_masklets = 0` it is a negative video-np pair, meaning no masklet presenting at all ``` data { "info": info "videos": [video] "annotations": [annotation] "categories": [category] "video_np_pairs": [video_np_pair] } video { "id": int "video_name": str # e.g. sav_000000 "file_names": List[str] "height": int "width": width "length": length } annotation { "id": int "segmentations": List[RLE] "bboxes": List[List[int, int, int, int]] "areas": List[int] "iscrowd": int "video_id": str "height": int "width": int "category_id": int "noun_phrase": str } category { "id": int "name": str } video_np_pair { "id": int "video_id": str "category_id": int "noun_phrase": str "num_masklets" int } ```

# SA-FARI数据集 **授权协议** CC-BY-NC 4.0 **SA-FARI**是由Meta与保护X实验室(Conservation X Labs,简称CXL)合作采集的野生动物相机数据集。 所有视频与预处理后的JPEG图像均可在[cxl-public-camera-trap](https://console.cloud.google.com/storage/browser/cxl-public-camera-trap)获取,该存储桶包含以下内容: sa_fari/ ├── sa_fari_test_tars/ │ ├── JPEGImages_6fps/ │ ├── videos/ ├── sa_fari_test/ │ ├── JPEGImages_6fps/ │ ├── videos/ ├── sa_fari_train_tars/ │ ├── JPEGImages_6fps/ │ ├── videos/ └── sa_fari_train/ ├── JPEGImages_6fps/ └── videos/ * `videos`:原始全帧率视频。 * `JPEGImages_6fps`:为便于标注,已将视频下采样至6帧每秒,该文件夹包含与下方标注json文件兼容的下采样帧。 本Hugging Face数据集仓库包含标注文件: datasets/facebook/SA-FARI/tree/main/ └── annotation/ ├── sa_fari_test.json ├── sa_fari_test_ext.json ├── sa_fari_train.json └── sa_fari_train_ext.json * `sa_fari_test.json` 与 `sa_fari_train.json`:格式与[SA-Co/VEval](https://huggingface.co/datasets/facebook/SACo-VEval/)完全一致。 * `sa_fari_test_ext.json` 与 `sa_fari_train_ext.json`:除遵循SA-Co/VEval格式外,还为以下字段添加了额外元数据: * `videos`字段:新增`video_num_frames`、`video_fps`、`video_creation_datetime`及`location_id`作为补充元数据。 * `categories`字段:视应用场景为`categories`字段新增`界(Kingdom)`、`门(Phylum)`、`纲(Class)`、`目(Order)`、`科(Family)`、`属(Genus)`及`种(Species)`作为补充元数据。 所有SA-FARI标注文件均可兼容使用[SAM 3 GitHub](https://github.com/facebookresearch/sam3/tree/main/scripts/eval/veval)中开发的可视化脚本与离线评估工具。 ## 标注格式 以下为`sa_fari_test.json`与`sa_fari_train.json`的格式拆解说明,其格式与[YTVIS](https://youtube-vos.org/dataset/vis/)格式类似。 在标注json文件(例如`sa_fari_test.json`)中包含5个核心字段: * info: 一个包含数据集基础信息的字典,示例为`{"version": "v1", "date": "2025-09-24", "description": "SA-FARI Test"}`。 * videos: 当前标注json中使用的视频列表,每个元素包含字段`id`、`video_name`、`file_names`、`height`、`width`、`length`。 * annotations: 正样本掩码块(masklets)及其关联信息的列表,每个元素包含字段`id`、`segmentations`、`bboxes`、`areas`、`iscrowd`、`video_id`、`height`、`width`、`category_id`、`noun_phrase`。其中: * `video_id`需与上述`videos`字段中的`id`一一匹配 * `category_id`需与上述`categories`字段中的`id`一一匹配 * `segmentations`为[游程编码(Run-Length Encoding,简称RLE)](https://github.com/cocodataset/cocoapi/blob/master/PythonAPI/pycocotools/mask.py)格式的列表 * categories: 全局通用的名词短语ID映射表,适用于全部三类应用场景,每个元素包含字段`id`、`name`,其中`name`为对应的名词短语。 * video_np_pairs: 当前标注json中使用的视频-名词短语对列表,涵盖正样本与负样本对,每个元素包含字段`id`、`video_id`、`category_id`、`noun_phrase`、`num_masklets`。其中: * `video_id`需与上述`videos`字段中的`id`匹配 * `category_id`需与上述`categories`字段中的`id`匹配 * 当`num_masklets > 0`时为正样本视频-名词短语对,其包含的有效掩码块可在`annotations`字段中查询 * 当`num_masklets = 0`时为负样本视频-名词短语对,代表该样本中无任何掩码块存在 data { "info": info "videos": [video] "annotations": [annotation] "categories": [category] "video_np_pairs": [video_np_pair] } video { "id": 整数类型 "video_name": 字符串类型 # 例如 sav_000000 "file_names": 字符串列表 "height": 整数类型 "width": 整数类型 "length": 整数类型 } annotation { "id": 整数类型 "segmentations": 游程编码列表 "bboxes": 整数四元组列表 "areas": 整数列表 "iscrowd": 整数类型 "video_id": 字符串类型 "height": 整数类型 "width": 整数类型 "category_id": 整数类型 "noun_phrase": 字符串类型 } category { "id": 整数类型 "name": 字符串类型 } video_np_pair { "id": 整数类型 "video_id": 字符串类型 "category_id": 整数类型 "noun_phrase": 字符串类型 "num_masklets": 整数类型 }
提供机构:
maas
创建时间:
2025-11-20
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
SA-FARI是一个野生动物相机捕捉数据集,包含原始视频和降采样至6fps的JPEG图像,标注文件遵循特定格式并包含丰富的元数据信息。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作