five

UlkuTuncerKucuktas/StimBench

收藏
Hugging Face2026-03-21 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/UlkuTuncerKucuktas/StimBench
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-nc-4.0 task_categories: - video-classification tags: - autism - ASD - stimming - behavior-recognition - benchmark pretty_name: StimBench size_categories: - n<1K --- # StimBench: A Benchmark for Stereotypical Motor Movement Detection in Autism ## Overview StimBench is a curated video clip dataset for detecting stereotypical motor movements (stimming) in children with Autism Spectrum Disorder. It merges and cleans four existing stimming datasets into a single standardized benchmark with proper train/test splits, no data leakage, and face anonymization. ## Dataset Statistics | Split | ArmFlapping | HeadBanging | Spinning | Normal | Total | |-------|-------------|-------------|----------|--------|-------| | Train | 76 | 31 | 43 | 120 | 270 | | Test | 15 | 8 | 10 | 30 | 63 | | **Total** | **91** | **39** | **53** | **150** | **333** | ## Sources **Stimming clips** are derived from four publicly available datasets: - **SSBD** (Rajagopalan et al., 2013) — 75 entries - **ESBD** (OckerGui, 2022) — 117 entries - **WEI-BD** (OckerGui, 2022) — 9 entries - **SSBD+** (SARL-IIITB, 2023) — 46 entries **Normal clips** are sourced from **Kinetics-400** (Kay et al., 2017), filtered for 23 child-relevant activity classes including crawling baby, clapping, dancing, playing with pets, drawing, riding a bike, and playing basketball. ## Classes | Class | Description | Source | |-------|-------------|--------| | ArmFlapping | Repetitive flapping of arms/hands, hand waving, finger play | SSBD/ESBD/SSBD+ | | HeadBanging | Repetitive head banging or hitting | SSBD/ESBD/SSBD+ | | Spinning | Whole-body spinning/rotating | SSBD/ESBD/SSBD+ | | Normal | Non-stimming everyday activities | Kinetics-400 | ## Key Properties - **No data leakage**: Split at video level — no source video appears in both train and test - **Gender-balanced test set**: Equal male/female representation per stimming category - **Face anonymized**: All faces blurred using CenterFace detector (deface, threshold=0.2) - **Normal class from external source**: Kinetics-400 clips prevent scene-level shortcut exploitation ## File Structure ``` StimBench/ ├── train/ │ ├── armflapping/ 001.mp4 ... 076.mp4 │ ├── headbanging/ 001.mp4 ... 031.mp4 │ ├── spinning/ 001.mp4 ... 043.mp4 │ └── normal/ 001.mp4 ... 120.mp4 ├── test/ │ ├── armflapping/ 001.mp4 ... 015.mp4 │ ├── headbanging/ 001.mp4 ... 008.mp4 │ ├── spinning/ 001.mp4 ... 010.mp4 │ └── normal/ 001.mp4 ... 030.mp4 ├── metadata.csv └── README.md ``` ## Loading ```python from datasets import load_dataset dataset = load_dataset("videofolder", data_dir="StimBench") ``` ## Metadata Each clip has the following metadata in `metadata.csv`: | Field | Description | |-------|-------------| | file_name | Relative path (e.g., train/armflapping/001.mp4) | | label | Class name | | split | train / test | | type | stimming / normal | | source_dataset | SSBD / ESBD / WEI BD / SSBD+ / kinetics-400 | | group_id | YouTube video ID (for GroupKFold) | | url | Original YouTube URL | | clip_start | Start time in source video (seconds) | | clip_end | End time in source video (seconds) | | clip_duration | Clip length (seconds) | | gender | M / F / Unknown (stimming clips only) | ## License CC-BY-NC-4.0. Original stimming videos are from YouTube via SSBD/ESBD/SSBD+. Normal videos are from Kinetics-400. This dataset provides curated clips with annotations and standardized splits for research purposes only.

--- license: CC BY-NC 4.0(知识共享署名-非商业性使用4.0国际协议) task_categories: - 视频分类(video-classification) tags: - 自闭症(autism) - 孤独症谱系障碍(ASD, Autism Spectrum Disorder) - 刻板运动动作(stimming) - 行为识别(behavior-recognition) - 基准数据集(benchmark) pretty_name: StimBench size_categories: - 样本数量少于1000 --- # StimBench:自闭症群体刻板运动动作检测基准数据集 ## 概述 StimBench是一款经过精心整理的视频片段数据集,用于检测孤独症谱系障碍(ASD, Autism Spectrum Disorder)儿童的刻板运动动作(stimming)。该数据集整合并清洗了4个现有刻板动作数据集,构建为单一标准化基准数据集,具备规范的训练/测试集划分、无数据泄露问题,并完成了人脸匿名化处理。 ## 数据集统计 | 划分集 | 挥臂动作 | 撞头动作 | 旋转动作 | 正常动作 | 总计 | |-------|-------------|-------------|----------|--------|-------| | 训练集 | 76 | 31 | 43 | 120 | 270 | | 测试集 | 15 | 8 | 10 | 30 | 63 | | **总计** | **91** | **39** | **53** | **150** | **333** | ## 数据来源 **刻板动作片段**源自4个公开可用的数据集: - **SSBD**(Rajagopalan等,2013)—— 75条数据 - **ESBD**(OckerGui,2022)—— 117条数据 - **WEI-BD**(OckerGui,2022)—— 9条数据 - **SSBD+**(SARL-IIITB,2023)—— 46条数据 **正常动作片段**源自**Kinetics-400**(Kay等,2017),筛选出23类与儿童相关的活动类别,包括婴儿爬行、拍手、舞蹈、与宠物玩耍、绘画、骑自行车以及打篮球等。 ## 类别定义 | 类别 | 描述 | 来源 | |-------|-------------|--------| | 挥臂动作(ArmFlapping) | 手臂/手部重复性挥动、挥手、手指活动 | SSBD/ESBD/SSBD+ | | 撞头动作(HeadBanging) | 重复性撞头或头部击打 | SSBD/ESBD/SSBD+ | | 旋转动作(Spinning) | 全身旋转/转动 | SSBD/ESBD/SSBD+ | | 正常动作(Normal) | 非刻板动作的日常活动 | Kinetics-400 | ## 核心特性 - **无数据泄露**:以视频为单位进行划分——同一源视频不会同时出现在训练集和测试集中 - **测试集性别均衡**:每类刻板动作类别中男性与女性样本占比相等 - **人脸匿名化**:使用CenterFace检测器(deface,阈值=0.2)对所有人脸进行模糊处理 - **正常类别源自外部数据集**:采用Kinetics-400片段可避免模型利用场景级捷径进行过拟合 ## 文件结构 StimBench/ ├── train/ │ ├── armflapping/ 001.mp4 … 076.mp4 │ ├── headbanging/ 001.mp4 … 031.mp4 │ ├── spinning/ 001.mp4 … 043.mp4 │ └── normal/ 001.mp4 … 120.mp4 ├── test/ │ ├── armflapping/ 001.mp4 … 015.mp4 │ ├── headbanging/ 001.mp4 … 008.mp4 │ ├── spinning/ 001.mp4 … 010.mp4 │ └── normal/ 001.mp4 … 030.mp4 ├── metadata.csv └── README.md ## 加载方式 python from datasets import load_dataset dataset = load_dataset("videofolder", data_dir="StimBench") ## 元数据 `metadata.csv`中每个视频片段包含以下元数据字段: | 字段 | 描述 | |-------|-------------| | file_name | 文件相对路径(例如:train/armflapping/001.mp4) | | label | 类别名称 | | split | 划分集(训练集/测试集) | | type | 动作类型(刻板动作/正常动作) | | source_dataset | 源数据集(SSBD / ESBD / WEI BD / SSBD+ / Kinetics-400) | | group_id | 用于GroupKFold分组的YouTube视频ID | | url | 原始YouTube链接 | | clip_start | 源视频中的剪辑起始时间(秒) | | clip_end | 源视频中的剪辑结束时间(秒) | | clip_duration | 剪辑时长(秒) | | gender | 性别(M/F/未知,仅刻板动作片段包含该字段) | ## 许可证 CC BY-NC 4.0。原始刻板动作视频源自YouTube,分别来自SSBD/ESBD/SSBD+。正常视频源自Kinetics-400。本数据集仅提供经过整理的剪辑片段、标注信息及标准化划分集,仅用于科研用途。
提供机构:
UlkuTuncerKucuktas
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作