International Multi-Speaker AUDIO (+Video) Dataset with Stems, Diarized Transcripts, Scene ...
收藏Databricks2025-11-19 收录
下载链接:
https://marketplace.databricks.com/details/9665e582-cbe3-464b-bb26-b989f036791e/ACNetwork_International-Multi-Speaker-AUDIO-(+Video)-Dataset-with-Stems,-Diarized-Transcripts,-Scene-
下载链接
链接失效反馈官方服务:
资源简介:
The ACNetwork Multilingual Conversational Media Dataset delivers 280K hours of UCG audio and video data across more than 20 languages: including English (70,665 hrs), Portuguese (38,282 hrs), Spanish (26,928 hrs), Russian (10,182 hrs), French (8,127 hrs), German (5,057 hrs), Italian (3,202 hrs), Japanese (2,228 hrs), and Korean (2,147 hrs)
Each file features high-fidelity audio with isolated stems, human-verified transcripts at word and utterance level, speaker diarization, emotion and tone labels, and scene metadata for video segments. All content is fully rights-cleared and indemnified for AI/LLM training and commercial reuse, making it ideal for speech-to-text, multimodal retrieval, cross-lingual fine-tuning, and emotion classification.
The dataset spans sports, news, entertainment, gaming, true crime, and more, with new content added monthly to ensure domain diversity and freshness. Delivery is via Google Drive or AWS S3, and data is provided in multiple formats (JSON, SRT, VTT, TXT).
This is the definitive global multilingual sports and talk dataset for enterprise-scale AI and LLM training - combining audio and video sources at unmatched depth and breadth. Enterprise licensing only - contact ACNetwork for commercial terms or expanded sample access.
提供机构:
ACNetwork



