five

Annotated Audio Dataset of Bangladeshi English Speakers for Pause-Based Fluency Analysis

收藏
Figshare2025-06-20 更新2026-04-08 收录
下载链接:
https://figshare.com/articles/dataset/Annotated_Audio_Dataset_of_Bangladeshi_English_Speakers_for_Pause-Based_Fluency_Analysis/29369678/1
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains <b>57 audio recordings</b> of spoken English collected for the purpose of studying oral fluency, specifically through the analysis of <b>filled and silent pauses</b>. The speakers are Bangladeshi English speakers, and the recordings represent various fluency levels.The audio files are stored in the <code><strong>data/</strong></code> folder and are available in multiple formats, including <code>.mp3</code>, <code>.wav</code>, and <code>.m4a</code>.Each audio file has a corresponding <b>annotation file</b> in JSON format, located in the <code><strong>JSON/</strong></code> folder.The JSON files include detailed <b>time-stamped annotations</b> of pause markers (both filled and silent), speaker metadata, and fluency-related labels used for machine learning tasks.This dataset is intended for use in <b>speech processing, NLP, language learning research, and machine learning applications</b> related to fluency assessment. It has been used in research involving <b>transformer-based models, pause detection, and low-resource learning scenarios.</b>
提供机构:
Rittique Alam, Md
创建时间:
2025-06-20
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作