Thorsten-Voice/TV-24kHz-2025.12-Neutral-FT-Mini-tokenised

Name: Thorsten-Voice/TV-24kHz-2025.12-Neutral-FT-Mini-tokenised
Creator: Thorsten-Voice
Published: 2025-12-12 21:34:19
License: 暂无描述

Hugging Face2025-12-12 更新2025-12-20 收录

下载链接：

https://hf-mirror.com/datasets/Thorsten-Voice/TV-24kHz-2025.12-Neutral-FT-Mini-tokenised

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集是TV-24kHz-2025.12-Neutral-FT-Mini数据集的标记化版本，专门为Orpheus TTS Thorsten-Voice的直接微调准备。包含60个标记化的德语语音样本，优化用于快速实验和精确的说话者适应。适用于轻量级Orpheus TTS微调、说话者身份细化以及韵律和发音调整。数据来源为Thorsten-Voice/TV-24kHz-2025.12-Neutral-FT-Mini，说话者为Thorsten，语言为德语，录制日期为2025年12月。数据集经过24,000 Hz的音频重采样、-24dB的响度归一化，并使用Orpheus TTS预处理进行标记化。采用Creative Commons Zero (CC0 1.0)许可证发布，允许任何用途，包括商业和衍生作品，无需署名要求。

This dataset is the tokenised version of the TV-24kHz-2025.12-Neutral-FT-Mini dataset, prepared specifically for direct fine-tuning of Orpheus TTS Thorsten-Voice. It contains 60 tokenised German speech samples, optimised for fast experimentation and precise speaker adaptation. This dataset is intended for lightweight Orpheus TTS fine-tuning, speaker identity refinement, and prosody and articulation adjustments. The source dataset is Thorsten-Voice/TV-24kHz-2025.12-Neutral-FT-Mini, with the speaker being Thorsten, language German, and recording date December 2025. The dataset has been processed with audio resampled to 24,000 Hz, loudness normalized to -24dB, and tokenised using Orpheus TTS preprocessing. It is released under the Creative Commons Zero (CC0 1.0) license, free for any use, including commercial and derivative works, without attribution requirements.

提供机构：

Thorsten-Voice

5,000+

优质数据集

54 个

任务类型

进入经典数据集