KTTS Single Speaker Dataset
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://data.mendeley.com/datasets/5c4dcvxdmb
下载链接
链接失效反馈官方服务:
资源简介:
This dataset has been primarily developed to facilitate the creation of text-to-speech systems for the Kashmiri language, a digitally underrepresented language predominantly spoken in the Jammu and Kashmir region of India. The dataset comprises 2,984 audio recordings in WAV format, each accompanied by its corresponding textual data in a separate file named ‘textcorpus.csv’. The ‘id’ column in the CSV file serves as a unique identifier, allowing users to efficiently locate the corresponding WAV files, which are systematically named according to the ‘id’ associated with the sentences they contain.
All recordings feature a single male voice with a sample rate of 48,000 Hz, ensuring high-quality audio suitable for detailed phonetic analysis and machine learning applications. This consistent audio quality across the dataset provides a reliable foundation for training and testing text-to-speech models. Furthermore, the dataset can be a valuable resource for future research and development efforts aimed at enhancing digital accessibility for the Kashmiri-speaking population.
创建时间:
2024-08-29



