Nexdata/Italian_Conversational_Speech_Data_by_Mobile_Phone

Name: Nexdata/Italian_Conversational_Speech_Data_by_Mobile_Phone
Creator: Nexdata
Published: 2024-04-17 01:55:13
License: 暂无描述

Hugging Face2024-04-17 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/Nexdata/Italian_Conversational_Speech_Data_by_Mobile_Phone

下载链接

链接失效反馈

官方服务：

资源简介：

--- YAML tags: - copy-paste the tags obtained with the tagging app: https://github.com/huggingface/datasets-tagging task_categories: - conversational language: - it --- # Dataset Card for Nexdata/Italian_Conversational_Speech_Data_by_Mobile_Phone ## Description About 700 speakers participated in the recording, and conducted face-to-face communication in a natural way. They had free discussion on a number of given topics, with a wide range of fields; the voice was natural and fluent, in line with the actual dialogue scene. Text is transferred manually, with high accuracy. For more details, please refer to the link: https://www.nexdata.ai/datasets/1178?source=Huggingface ## Format 16kHz, 16bit, uncompressed wav, mono channel; ## Recording Environment quiet indoor environment, without echo; ## Recording content dozens of topics are specified, and the speakers make dialogue under those topics while the recording is performed; ## Demographics About 700 people. ## Annotation annotating for the transcription text, speaker identification and gender ## Device Android mobile phone, iPhone; ## Language Italian ## Application scenarios speech recognition; voiceprint recognition; ## Accuracy rate the word accuracy rate is not less than 98% # Licensing Information Commercial License

提供机构：

Nexdata

原始信息汇总

数据集卡片 Nexdata/Italian_Conversational_Speech_Data_by_Mobile_Phone

描述

约700名参与者参与了录音，并以自然的方式进行面对面交流。他们在广泛的领域内就多个给定话题进行自由讨论；语音自然流畅，符合实际对话场景。文本经过手动转录，准确度高。

格式

16kHz，16位，未压缩的wav格式，单声道；

录音环境

安静的室内环境，无回声；

录音内容

指定数十个话题，录音时参与者在这些话题下进行对话；

人口统计

约700人；

标注

对转录文本、说话人识别和性别进行标注；

设备

安卓手机，iPhone；

语言

意大利语；

应用场景

语音识别；声纹识别；

准确率

单词准确率不低于98%。

许可信息

商业许可

5,000+

优质数据集

54 个

任务类型

进入经典数据集