Speaker-Diarization-Instructions

Name: Speaker-Diarization-Instructions
Creator: maas
Published: 2025-12-05 16:51:27
License: 暂无描述

魔搭社区2025-12-05 更新2025-12-06 收录

下载链接：

https://modelscope.cn/datasets/mesolitica/Speaker-Diarization-Instructions

下载链接

链接失效反馈

官方服务：

资源简介：

# Speaker-Diarization-Instructions Convert diarization dataset from https://huggingface.co/diarizers-community into speech instructions dataset and chunk max to 30 seconds because most of speech encoder use for LLM come from Whisper Encoder. **We highly recommend to not include AMI test set from both AMI-IHM and AMI-SDM in training set to prevent contamination. This dataset supposely to become a speech diarization benchmark**. ## how to prepare the dataset ```bash huggingface-cli download \ mesolitica/Speaker-Diarization-Instructions \ --include "*.zip" \ --repo-type "dataset" \ --local-dir './' unzip 0-0.zip ``` ## Acknowledgement Special thanks to https://www.sns.com.my and Nvidia for 8x H100 node!

# 说话人 diarization（Speaker-Diarization）指令数据集将来自https://huggingface.co/diarizers-community 的说话人 diarization 数据集转换为语音指令数据集，并将音频片段最大长度限定为30秒，这是因为当前适配大语言模型（LLM）的主流语音编码器均源自Whisper编码器。 **我们强烈建议请勿将AMI-IHM与AMI-SDM中的AMI测试集纳入训练集，以避免数据污染。本数据集旨在成为语音说话人 diarization 基准测试集。** ## 数据集制备流程 bash huggingface-cli download mesolitica/Speaker-Diarization-Instructions --include "*.zip" --repo-type "dataset" --local-dir './' unzip 0-0.zip ## 致谢特别感谢https://www.sns.com.my 与英伟达（Nvidia）提供的8节点H100算力集群！

提供机构：

maas

创建时间：

2025-10-02

5,000+

优质数据集

54 个

任务类型

进入经典数据集