five

Annotated Audio Dataset of Bangladeshi English Speakers for Pause-Based Fluency Analysis

收藏
Figshare2025-06-20 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Annotated_Audio_Dataset_of_Bangladeshi_English_Speakers_for_Pause-Based_Fluency_Analysis/29369678
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains 57 audio recordings of spoken English collected for the purpose of studying oral fluency, specifically through the analysis of filled and silent pauses. The speakers are Bangladeshi English speakers, and the recordings represent various fluency levels.The audio files are stored in the data/ folder and are available in multiple formats, including .mp3, .wav, and .m4a.Each audio file has a corresponding annotation file in JSON format, located in the JSON/ folder.The JSON files include detailed time-stamped annotations of pause markers (both filled and silent), speaker metadata, and fluency-related labels used for machine learning tasks.This dataset is intended for use in speech processing, NLP, language learning research, and machine learning applications related to fluency assessment. It has been used in research involving transformer-based models, pause detection, and low-resource learning scenarios.
创建时间:
2025-06-20
二维码
社区交流群
二维码
科研交流群
商业服务