five

TTS-AGI/majestrino-unified-detailed-captions-temporal

收藏
Hugging Face2026-03-29 更新2026-04-05 收录
下载链接:
https://hf-mirror.com/datasets/TTS-AGI/majestrino-unified-detailed-captions-temporal
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-4.0 task_categories: - audio-classification - automatic-speech-recognition tags: - audio - captions - temporal - majestrino pretty_name: Majestrino Unified Detailed Captions with Temporal Aspects --- # Majestrino Unified Detailed Captions with Temporal Aspects Filtered subset of [laion/majestrino-data](https://huggingface.co/datasets/laion/majestrino-data) containing only samples with `unified_detailed_caption_with_temporal_aspects`. ## Stats - **4,128,665** samples - **826** tar files (~1.1 GB each) - ~878 GB total ## Format Each tar contains paired `.flac` + `.json` files. **JSON fields:** - `caption` — the unified detailed caption with temporal aspects - `caption_type` — always `unified_detailed_caption_with_temporal_aspects` - `transcription` — speech transcription (when available, normalized from multiple source keys) - `duration` — audio duration (when available) - `characters_per_second` — CPS (when available) - `score_*` — quality scores (when available) - `emotion_whisper*` — emotion scores (when available)
提供机构:
TTS-AGI
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作