five

threll-ai/common_voice_22_0

收藏
Hugging Face2025-07-03 更新2025-07-05 收录
下载链接:
https://hf-mirror.com/datasets/threll-ai/common_voice_22_0
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含两个不同配置(nn-NO和nb-NO)的音频片段和相关元数据。每个音频片段都有采样率为48000的音频文件、文本内容(sentence)、投票信息(up_votes和down_votes)、年龄、性别、口音、地区、段落、变体和连续性标记。数据集分为训练集、验证集和测试集,其中nn-NO配置的训练集有464个示例,验证集有405个示例,测试集有423个示例;nb-NO配置的训练集有227个示例,验证集有33个示例,测试集有116个示例。

The dataset consists of two configurations (nn-NO and nb-NO) of audio clips along with associated metadata. Each audio clip includes an audio file with a sampling rate of 48000, text content (sentence), voting information (up_votes and down_votes), age, gender, accent, locale, segment, variant, and continuation tags. The dataset is split into training, validation, and test sets, with the nn-NO configuration having 464 examples in the training set, 405 in the validation set, and 423 in the test set; the nb-NO configuration has 227 examples in the training set, 33 in the validation set, and 116 in the test set.
提供机构:
threll-ai
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作