five

jenny_tts_dataset

收藏
魔搭社区2025-05-30 更新2025-03-08 收录
下载链接:
https://modelscope.cn/datasets/pengzhendong/jenny_tts_dataset
下载链接
链接失效反馈
官方服务:
资源简介:
# Jenny TTS Dataset A high-quality, varied ~30hr voice dataset suitable for training a TTS model. Voice is recorded by Jenny. She's Irish. Material read include: - Newspaper headlines - Transcripts of various Youtube videos - About 2/3 of the book '1984' - Some of the book 'Little Women' - Wikipedia articles, different topics (philosophy, history, science) - Recipes - Reddit comments - Song lyrics, including rap lyrics - Transcripts to the show 'Friends' Audio files are 48khz, 16-bit PCM files, 2 Channels (a single microphone was used.. hmm). Some light preprocessing was done when the text was taken from the raw sources. A breakdown of where different material starts and ends can be reconstructed. Further information to follow. # Important The audiofiles are raw from the microphone, not trimmed. In some cases there are a few seconds of silence, sometimes a light 'knock' is audible at the beginning of the clip, where Jenny was hitting the start key. These issues will need to be addressed before training a TTS model. I'm a bit short on time these days, help welcome. License - Attribution is required in software/websites/projects/interfaces (including voice interfaces) that generate audio in response to user action using this dataset. Atribution means: the voice must be referred to as "Jenny", and where at all practical, "Jenny (Dioco)". Attribution is not required when distributing the generated clips (although welcome). Commercial use is permitted. Don't do unfair things like claim the dataset is your own. No further restrictions apply. Jenny is available to produce further recordings for your own use. Mail dioco@dioco.io

# 珍妮TTS数据集 这是一份高质量、内容丰富的约30小时语音数据集,适用于训练语音合成(Text-to-Speech,简称TTS)模型。 该数据集的语音由珍妮录制,她为爱尔兰籍人士。 朗读覆盖的文本素材包括: - 报纸头条 - 各类YouTube视频的字幕文本 - 书籍《1984》的约三分之二内容 - 书籍《小妇人》的部分章节 - 不同主题的维基百科条目(涵盖哲学、历史、科学等领域) - 食谱文本 - Reddit平台的评论内容 - 歌词文本,包含说唱歌词 - 美剧《老友记》的剧集字幕文本 音频文件采用48kHz采样率、16位脉冲编码调制(Pulse Code Modulation,简称PCM)格式,双通道录制(实际仅使用单麦克风录制,特此备注)。 在从原始数据源提取文本时,已进行了轻度预处理。不同文本素材的起止边界可进行重构,后续将提供更多相关信息。 ## 重要说明 本数据集的音频文件均为麦克风原始录制内容,未经过裁剪处理。部分音频片段开头存在数秒静音,有时还会录到珍妮按下录制键时产生的轻微敲击声。在使用该数据集训练TTS模型前,需要对这些问题进行处理。目前我时间较为紧张,欢迎有兴趣的开发者协助处理。 ### 授权协议 使用本数据集生成响应用户操作的音频的软件、网站、项目及交互界面(含语音交互界面),需进行署名标注。署名要求为:将该语音称为‘珍妮’,在条件允许的情况下需标注为‘珍妮(Dioco)’。分发生成的音频片段时无需进行署名(标注将不胜感激)。允许商业使用。禁止将本数据集宣称为您自己的作品等不公平行为。无其他额外限制条款。 珍妮可为您的项目录制更多语音素材,如有需求可发送邮件至dioco@dioco.io。
提供机构:
maas
创建时间:
2025-03-05
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作