VoiceAssistant-430K-vocalnet

Name: VoiceAssistant-430K-vocalnet
Creator: maas
Published: 2025-12-18 14:03:04
License: 暂无描述

魔搭社区2025-12-18 更新2025-06-14 收录

下载链接：

https://modelscope.cn/datasets/VocalNet/VoiceAssistant-430K-vocalnet

下载链接

链接失效反馈

官方服务：

资源简介：

# VoiceAssistant-430K-vocalnet This dataset supports the reproduction of [VocalNet](https://github.com/SJTU-OmniAgent/VocalNet). ## Data Construction 1. **Data Source**: We used the [VoiceAssistant-400K](https://huggingface.co/datasets/gpt-omni/VoiceAssistant-400K) from Mini-Omni, which contains about 470K instances. 2. **Data Filtering**: We removed samples with excessively long data. The resulted corpus contains 430K instances. 3. **Response Speech**: We perform speech synthesis using [CosyVoice](https://github.com/FunAudioLLM/CosyVoice) to generate the response speech. 4. **Response Token**: We generate the speech token using [CosyVoice2](https://github.com/FunAudioLLM/CosyVoice). ## Acknowledgment 1. The original data is from [Mini-Omni](https://github.com/gpt-omni/mini-omni). 2. The generation of speech wave and token is from [CosyVoice/CosyVoice2](https://github.com/gpt-omni/mini-omni).

# VoiceAssistant-430K-vocalnet 本数据集可用于复现[VocalNet](https://github.com/SJTU-OmniAgent/VocalNet)。 ## 数据集构建 1. **数据来源**：我们使用了Mini-Omni发布的[VoiceAssistant-400K](https://huggingface.co/datasets/gpt-omni/VoiceAssistant-400K)数据集，该数据集共包含约47万个样本实例。 2. **数据筛选**：我们移除了时长过长的样本，最终得到包含43万个样本实例的语料库。 3. **应答语音**：我们采用[CosyVoice](https://github.com/FunAudioLLM/CosyVoice)进行语音合成，以生成应答语音。 4. **应答Token(Token)**：我们通过[CosyVoice2](https://github.com/FunAudioLLM/CosyVoice)生成语音Token。 ## 致谢 1. 原始数据集源自[Mini-Omni](https://github.com/gpt-omni/mini-omni)。 2. 语音波形与Token的生成工作基于[CosyVoice/CosyVoice2](https://github.com/gpt-omni/mini-omni)实现。

提供机构：

maas

创建时间：

2025-04-20

5,000+

优质数据集

54 个

任务类型

进入经典数据集