nineninesix/jinsaryko-tifa-en-nano-codec-dataset

Name: nineninesix/jinsaryko-tifa-en-nano-codec-dataset
Creator: nineninesix
Published: 2025-09-20 13:56:30
License: 暂无描述

Hugging Face2025-09-20 更新2025-10-18 收录

下载链接：

https://hf-mirror.com/datasets/nineninesix/jinsaryko-tifa-en-nano-codec-dataset

下载链接

链接失效反馈

官方服务：

资源简介：

Tifa EN Nano-Codec数据集是基于Tifa数据集使用NVIDIA的NeMo音频编解码器进行重新编码得到的，包含纳米级音频标记。该数据集旨在用于微调多模态大型语言模型（LLM）和语音系统（TTS/ASR），这些系统依赖于基于编解码器的音频标记表示。数据集包括发音转录、说话者标识、四层量化音频表示和编码音频标记的序列长度。

The Tifa EN Nano-Codec Dataset is built upon the Tifa dataset and re-encoded using NVIDIAs NeMo Audio Codec into nano audio tokens. It is designed for fine-tuning multimodal LLMs and speech systems (TTS/ASR) that rely on codec-based audio token representations. The dataset includes utterance transcriptions, speaker identifiers, four-layer quantized audio representations, and the sequence length of encoded audio tokens.

提供机构：

nineninesix

5,000+

优质数据集

54 个

任务类型

进入经典数据集