Nexdata/Italian_Conversational_Speech_Data_by_Mobile_Phone
收藏Hugging Face2024-04-17 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Nexdata/Italian_Conversational_Speech_Data_by_Mobile_Phone
下载链接
链接失效反馈官方服务:
资源简介:
---
YAML tags:
- copy-paste the tags obtained with the tagging app: https://github.com/huggingface/datasets-tagging
task_categories:
- conversational
language:
- it
---
# Dataset Card for Nexdata/Italian_Conversational_Speech_Data_by_Mobile_Phone
## Description
About 700 speakers participated in the recording, and conducted face-to-face communication in a natural way. They had free discussion on a number of given topics, with a wide range of fields; the voice was natural and fluent, in line with the actual dialogue scene. Text is transferred manually, with high accuracy.
For more details, please refer to the link: https://www.nexdata.ai/datasets/1178?source=Huggingface
## Format
16kHz, 16bit, uncompressed wav, mono channel;
## Recording Environment
quiet indoor environment, without echo;
## Recording content
dozens of topics are specified, and the speakers make dialogue under those topics while the recording is performed;
## Demographics
About 700 people.
## Annotation
annotating for the transcription text, speaker identification and gender
## Device
Android mobile phone, iPhone;
## Language
Italian
## Application scenarios
speech recognition; voiceprint recognition;
## Accuracy rate
the word accuracy rate is not less than 98%
# Licensing Information
Commercial License
提供机构:
Nexdata
原始信息汇总
数据集卡片 Nexdata/Italian_Conversational_Speech_Data_by_Mobile_Phone
描述
约700名参与者参与了录音,并以自然的方式进行面对面交流。他们在广泛的领域内就多个给定话题进行自由讨论;语音自然流畅,符合实际对话场景。文本经过手动转录,准确度高。
格式
16kHz,16位,未压缩的wav格式,单声道;
录音环境
安静的室内环境,无回声;
录音内容
指定数十个话题,录音时参与者在这些话题下进行对话;
人口统计
约700人;
标注
对转录文本、说话人识别和性别进行标注;
设备
安卓手机,iPhone;
语言
意大利语;
应用场景
语音识别;声纹识别;
准确率
单词准确率不低于98%。
许可信息
商业许可



