five

Prosody Generation Using Back Propagation Neural Networks for Sindhi Speech Processing Applications

收藏
DataCite Commons2025-05-12 更新2025-05-17 收录
下载链接:
https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/EWSAWX
下载链接
链接失效反馈
官方服务:
资源简介:
Abstract Analysis and synthesis of speech to be automated still require more research efforts in general and for the development of speech processing applications based on Arabic Script like Sindh Textto-Speech in particular. To achieve the required results from the speech processing applications prosodic features must be exercised extremely as the prosody is highly linked with the information of sounds having different characteristics like linguistic rules, complications and variations of expressions. Objectives: This study aims to generate and analyze the prosodic information specifically pitch and duration from the recorded Sindhi sounds using the back propagation neural network. Methods: Two methods are used to obtain the prosodic information of Sindhi sounds, PRAAT speech analyser is used to obtain the results and for the validation a back propagation neural network model is implemented. From the four districts of Sindh 228 speakers were chosen and the sound of different descriptive sentences was recorded for the experiments. Finding: After the experiments with a neural network model with multiple layers on the collected sound, 98.8% a highly acceptable level of accuracy achieved at the 18th epoch among the 100 epochs. Application and improvements: The generated Sindhi prosodic information and adopted research methodology will be supportive to the scholars of Sindhi speech processing applications. This research work can be considered as the first step as no work for generating Sindhi prosody is found yet. Keywords: Sindhi Recorded Sounds, Pitch, Duration, Speech Analysis, Prosody Generation
提供机构:
Harvard Dataverse
创建时间:
2025-04-17
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作