five

Nexdata/Italian_Conversational_Speech_Data_by_Mobile_Phone

收藏
Hugging Face2024-04-17 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Nexdata/Italian_Conversational_Speech_Data_by_Mobile_Phone
下载链接
链接失效反馈
官方服务:
资源简介:
--- YAML tags: - copy-paste the tags obtained with the tagging app: https://github.com/huggingface/datasets-tagging task_categories: - conversational language: - it --- # Dataset Card for Nexdata/Italian_Conversational_Speech_Data_by_Mobile_Phone ## Description About 700 speakers participated in the recording, and conducted face-to-face communication in a natural way. They had free discussion on a number of given topics, with a wide range of fields; the voice was natural and fluent, in line with the actual dialogue scene. Text is transferred manually, with high accuracy. For more details, please refer to the link: https://www.nexdata.ai/datasets/1178?source=Huggingface ## Format 16kHz, 16bit, uncompressed wav, mono channel; ## Recording Environment quiet indoor environment, without echo; ## Recording content dozens of topics are specified, and the speakers make dialogue under those topics while the recording is performed; ## Demographics About 700 people. ## Annotation annotating for the transcription text, speaker identification and gender ## Device Android mobile phone, iPhone; ## Language Italian ## Application scenarios speech recognition; voiceprint recognition; ## Accuracy rate the word accuracy rate is not less than 98% # Licensing Information Commercial License
提供机构:
Nexdata
原始信息汇总

数据集卡片 Nexdata/Italian_Conversational_Speech_Data_by_Mobile_Phone

描述

约700名参与者参与了录音,并以自然的方式进行面对面交流。他们在广泛的领域内就多个给定话题进行自由讨论;语音自然流畅,符合实际对话场景。文本经过手动转录,准确度高。

格式

16kHz,16位,未压缩的wav格式,单声道;

录音环境

安静的室内环境,无回声;

录音内容

指定数十个话题,录音时参与者在这些话题下进行对话;

人口统计

约700人;

标注

对转录文本、说话人识别和性别进行标注;

设备

安卓手机,iPhone;

语言

意大利语;

应用场景

语音识别;声纹识别;

准确率

单词准确率不低于98%。

许可信息

商业许可

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作