five

Replication Data for: Comparing Speech-to-Text Algorithms for Transcribing Voice Data from Surveys

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://doi.org/10.7910/DVN/BU3T7F
下载链接
链接失效反馈
官方服务:
资源简介:
This Dataverse contains files required to replicate the results, figures and plots displayed in the research paper. Figures: The jupyter file "infographs-paper.ipynb" can load any wav-format file to create figures like the first three in the paper (Figure 1-3). The first visualizes the waveform of an audiofile. Figure 2 displays the waveform and time ranges (frames). The third figure creates a Mel Spectrogram. Plot: The barplot (Figure 4) displaying the Word-Error-Rates by ASR System as well as the average Word-Error-Rates in section "Results" is created by "code-main.R" file. The corresponding dataset in csv-format used here is called "transcript-data.tab". Sample Description: The sample is described in section "Data" of the research paper with additional information (age, educational level and difficulty of survey). The corresponding mean and standard deviation values are calculated in "sample-description.R" by loading the csv-formatted dataset "sample-description-data.tab". Code for Automated Speech Recognition: The original Python (and R) code used for applying various ASR systems to our data can be found as python notebooks.
创建时间:
2025-01-06
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作