UlkuTuncerKucuktas/StimBench
收藏Hugging Face2026-03-21 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/UlkuTuncerKucuktas/StimBench
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-nc-4.0
task_categories:
- video-classification
tags:
- autism
- ASD
- stimming
- behavior-recognition
- benchmark
pretty_name: StimBench
size_categories:
- n<1K
---
# StimBench: A Benchmark for Stereotypical Motor Movement Detection in Autism
## Overview
StimBench is a curated video clip dataset for detecting stereotypical motor movements (stimming) in children with Autism Spectrum Disorder. It merges and cleans four existing stimming datasets into a single standardized benchmark with proper train/test splits, no data leakage, and face anonymization.
## Dataset Statistics
| Split | ArmFlapping | HeadBanging | Spinning | Normal | Total |
|-------|-------------|-------------|----------|--------|-------|
| Train | 76 | 31 | 43 | 120 | 270 |
| Test | 15 | 8 | 10 | 30 | 63 |
| **Total** | **91** | **39** | **53** | **150** | **333** |
## Sources
**Stimming clips** are derived from four publicly available datasets:
- **SSBD** (Rajagopalan et al., 2013) — 75 entries
- **ESBD** (OckerGui, 2022) — 117 entries
- **WEI-BD** (OckerGui, 2022) — 9 entries
- **SSBD+** (SARL-IIITB, 2023) — 46 entries
**Normal clips** are sourced from **Kinetics-400** (Kay et al., 2017), filtered for 23 child-relevant activity classes including crawling baby, clapping, dancing, playing with pets, drawing, riding a bike, and playing basketball.
## Classes
| Class | Description | Source |
|-------|-------------|--------|
| ArmFlapping | Repetitive flapping of arms/hands, hand waving, finger play | SSBD/ESBD/SSBD+ |
| HeadBanging | Repetitive head banging or hitting | SSBD/ESBD/SSBD+ |
| Spinning | Whole-body spinning/rotating | SSBD/ESBD/SSBD+ |
| Normal | Non-stimming everyday activities | Kinetics-400 |
## Key Properties
- **No data leakage**: Split at video level — no source video appears in both train and test
- **Gender-balanced test set**: Equal male/female representation per stimming category
- **Face anonymized**: All faces blurred using CenterFace detector (deface, threshold=0.2)
- **Normal class from external source**: Kinetics-400 clips prevent scene-level shortcut exploitation
## File Structure
```
StimBench/
├── train/
│ ├── armflapping/ 001.mp4 ... 076.mp4
│ ├── headbanging/ 001.mp4 ... 031.mp4
│ ├── spinning/ 001.mp4 ... 043.mp4
│ └── normal/ 001.mp4 ... 120.mp4
├── test/
│ ├── armflapping/ 001.mp4 ... 015.mp4
│ ├── headbanging/ 001.mp4 ... 008.mp4
│ ├── spinning/ 001.mp4 ... 010.mp4
│ └── normal/ 001.mp4 ... 030.mp4
├── metadata.csv
└── README.md
```
## Loading
```python
from datasets import load_dataset
dataset = load_dataset("videofolder", data_dir="StimBench")
```
## Metadata
Each clip has the following metadata in `metadata.csv`:
| Field | Description |
|-------|-------------|
| file_name | Relative path (e.g., train/armflapping/001.mp4) |
| label | Class name |
| split | train / test |
| type | stimming / normal |
| source_dataset | SSBD / ESBD / WEI BD / SSBD+ / kinetics-400 |
| group_id | YouTube video ID (for GroupKFold) |
| url | Original YouTube URL |
| clip_start | Start time in source video (seconds) |
| clip_end | End time in source video (seconds) |
| clip_duration | Clip length (seconds) |
| gender | M / F / Unknown (stimming clips only) |
## License
CC-BY-NC-4.0. Original stimming videos are from YouTube via SSBD/ESBD/SSBD+. Normal videos are from Kinetics-400. This dataset provides curated clips with annotations and standardized splits for research purposes only.
---
license: CC BY-NC 4.0(知识共享署名-非商业性使用4.0国际协议)
task_categories:
- 视频分类(video-classification)
tags:
- 自闭症(autism)
- 孤独症谱系障碍(ASD, Autism Spectrum Disorder)
- 刻板运动动作(stimming)
- 行为识别(behavior-recognition)
- 基准数据集(benchmark)
pretty_name: StimBench
size_categories:
- 样本数量少于1000
---
# StimBench:自闭症群体刻板运动动作检测基准数据集
## 概述
StimBench是一款经过精心整理的视频片段数据集,用于检测孤独症谱系障碍(ASD, Autism Spectrum Disorder)儿童的刻板运动动作(stimming)。该数据集整合并清洗了4个现有刻板动作数据集,构建为单一标准化基准数据集,具备规范的训练/测试集划分、无数据泄露问题,并完成了人脸匿名化处理。
## 数据集统计
| 划分集 | 挥臂动作 | 撞头动作 | 旋转动作 | 正常动作 | 总计 |
|-------|-------------|-------------|----------|--------|-------|
| 训练集 | 76 | 31 | 43 | 120 | 270 |
| 测试集 | 15 | 8 | 10 | 30 | 63 |
| **总计** | **91** | **39** | **53** | **150** | **333** |
## 数据来源
**刻板动作片段**源自4个公开可用的数据集:
- **SSBD**(Rajagopalan等,2013)—— 75条数据
- **ESBD**(OckerGui,2022)—— 117条数据
- **WEI-BD**(OckerGui,2022)—— 9条数据
- **SSBD+**(SARL-IIITB,2023)—— 46条数据
**正常动作片段**源自**Kinetics-400**(Kay等,2017),筛选出23类与儿童相关的活动类别,包括婴儿爬行、拍手、舞蹈、与宠物玩耍、绘画、骑自行车以及打篮球等。
## 类别定义
| 类别 | 描述 | 来源 |
|-------|-------------|--------|
| 挥臂动作(ArmFlapping) | 手臂/手部重复性挥动、挥手、手指活动 | SSBD/ESBD/SSBD+ |
| 撞头动作(HeadBanging) | 重复性撞头或头部击打 | SSBD/ESBD/SSBD+ |
| 旋转动作(Spinning) | 全身旋转/转动 | SSBD/ESBD/SSBD+ |
| 正常动作(Normal) | 非刻板动作的日常活动 | Kinetics-400 |
## 核心特性
- **无数据泄露**:以视频为单位进行划分——同一源视频不会同时出现在训练集和测试集中
- **测试集性别均衡**:每类刻板动作类别中男性与女性样本占比相等
- **人脸匿名化**:使用CenterFace检测器(deface,阈值=0.2)对所有人脸进行模糊处理
- **正常类别源自外部数据集**:采用Kinetics-400片段可避免模型利用场景级捷径进行过拟合
## 文件结构
StimBench/
├── train/
│ ├── armflapping/ 001.mp4 … 076.mp4
│ ├── headbanging/ 001.mp4 … 031.mp4
│ ├── spinning/ 001.mp4 … 043.mp4
│ └── normal/ 001.mp4 … 120.mp4
├── test/
│ ├── armflapping/ 001.mp4 … 015.mp4
│ ├── headbanging/ 001.mp4 … 008.mp4
│ ├── spinning/ 001.mp4 … 010.mp4
│ └── normal/ 001.mp4 … 030.mp4
├── metadata.csv
└── README.md
## 加载方式
python
from datasets import load_dataset
dataset = load_dataset("videofolder", data_dir="StimBench")
## 元数据
`metadata.csv`中每个视频片段包含以下元数据字段:
| 字段 | 描述 |
|-------|-------------|
| file_name | 文件相对路径(例如:train/armflapping/001.mp4) |
| label | 类别名称 |
| split | 划分集(训练集/测试集) |
| type | 动作类型(刻板动作/正常动作) |
| source_dataset | 源数据集(SSBD / ESBD / WEI BD / SSBD+ / Kinetics-400) |
| group_id | 用于GroupKFold分组的YouTube视频ID |
| url | 原始YouTube链接 |
| clip_start | 源视频中的剪辑起始时间(秒) |
| clip_end | 源视频中的剪辑结束时间(秒) |
| clip_duration | 剪辑时长(秒) |
| gender | 性别(M/F/未知,仅刻板动作片段包含该字段) |
## 许可证
CC BY-NC 4.0。原始刻板动作视频源自YouTube,分别来自SSBD/ESBD/SSBD+。正常视频源自Kinetics-400。本数据集仅提供经过整理的剪辑片段、标注信息及标准化划分集,仅用于科研用途。
提供机构:
UlkuTuncerKucuktas



