KTTS Single Speaker Dataset

NIAID Data Ecosystem2026-05-02 收录

下载链接：

https://data.mendeley.com/datasets/5c4dcvxdmb

下载链接

链接失效反馈

官方服务：

资源简介：

This dataset has been primarily developed to facilitate the creation of text-to-speech systems for the Kashmiri language, a digitally underrepresented language predominantly spoken in the Jammu and Kashmir region of India. The dataset comprises 2,984 audio recordings in WAV format, each accompanied by its corresponding textual data in a separate file named ‘textcorpus.csv’. The ‘id’ column in the CSV file serves as a unique identifier, allowing users to efficiently locate the corresponding WAV files, which are systematically named according to the ‘id’ associated with the sentences they contain. All recordings feature a single male voice with a sample rate of 48,000 Hz, ensuring high-quality audio suitable for detailed phonetic analysis and machine learning applications. This consistent audio quality across the dataset provides a reliable foundation for training and testing text-to-speech models. Furthermore, the dataset can be a valuable resource for future research and development efforts aimed at enhancing digital accessibility for the Kashmiri-speaking population.

创建时间：

2024-08-29

5,000+

优质数据集

54 个

任务类型

进入经典数据集