five

ggfox00000/stt-summre-fr-test

收藏
Hugging Face2026-04-28 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/ggfox00000/stt-summre-fr-test
下载链接
链接失效反馈
官方服务:
资源简介:
SUMM-RE法语测试集是一个用于法语会议风格对话自动语音识别(ASR)和语音活动检测(VAD)的测试数据集。它包含124个单独的音频轨道,来自37个不同的会议,每个会议有3-4个麦克风录音。音频格式为WAV 16 kHz mono PCM_16,嵌入在parquet文件中。数据集包含手动转录和单词级别的强制对齐注释,语言为法语,内容为会议对话。总时长约为41小时,许可证为CC-BY-SA-4.0。

The SUMM-RE French test split is a test dataset for French meeting-style conversational automatic speech recognition (ASR) and voice activity detection (VAD). It contains 124 individual audio tracks from 37 distinct meetings, each with 3-4 microphone recordings. The audio is in WAV 16 kHz mono PCM_16 format, embedded in parquet files. The dataset includes manual transcriptions and word-level forced alignments. The language is French, and the content consists of meeting conversations. The total duration is approximately 41 hours, and the license is CC-BY-SA-4.0.
提供机构:
ggfox00000
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作