davanstrien/otb-augmented
收藏Hugging Face2026-04-28 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/davanstrien/otb-augmented
下载链接
链接失效反馈官方服务:
资源简介:
---
tags:
- classify-and-augment
- llm-annotated
---
# davanstrien/otb-augmented
LLM-annotated dataset produced by [classify-and-augment](https://github.com/davanstrien/classify-and-augment).
## Configuration
- **Model**: `HuggingFaceTB/SmolLM3-3B`
- **Labels**: `jim_crow`, `no_jim_crow`
- **Input rows**: 200
- **Output rows**: 200
## Label distribution
| Label | Real | Synthetic | Total |
|---|---:|---:|---:|
| `no_jim_crow` | 200 | 0 | 200 |
## Synthesis audit
| Class | Needed | Generated | Validated | Kept | Acceptance |
|---|---:|---:|---:|---:|---:|
| `jim_crow` | 50 | 0 | 0 | 0 | 0.0% |
Acceptance = synthetic candidates that the same model re-classified as the target class (self-consistency check, [Synthetic Imputation, arxiv 2504.15160](https://arxiv.org/abs/2504.15160)).
提供机构:
davanstrien



