Nexdata/Indonesian_Conversational_Speech_Data_by_Telephone
收藏Hugging Face2024-04-16 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Nexdata/Indonesian_Conversational_Speech_Data_by_Telephone
下载链接
链接失效反馈官方服务:
资源简介:
---
task_categories:
- automatic-speech-recognition
language:
- id
---
# Dataset Card for Nexdata/Indonesian_Conversational_Speech_Data_by_Telephone
## Description
The 89 Hours - Indonesian conversational speech data collected by Telephone involved 124 native speakers, developed with proper balance of gender ratio, Speakers would choose a few familiar topics out of the given list and start conversations to ensure dialogues' fluency and naturalness. The recording devices are various mobile phones. The audio format is 8kHz, 8bit, u-law pcm, and all the speech data was recorded in quiet indoor environments. All the speech audio was manually transcribed with text content, the start and end time of each effective sentence, and speaker identification.
For more details, please refer to the link: https://www.nexdata.ai/datasets/1311?source=Huggingface
# Specifications
## Format
8kHz 8bit, u-law pcm, mono channel;
## Environment
quiet indoor environment, without echo;
## Recording content
dozens of topics are specified, and the speakers make dialogue under those topics while the recording is performed;
## Demographics
140 speakers totally, with 54% male and 46% female
## Annotation
annotating for the transcription text, speaker identification and gender
## Device
Android mobile phone, iPhone;
## Language
Indonesian;
## Application scenarios
speech recognition; voiceprint recognition;
## Accuracy rate
the word accuracy rate is not less than 98%
# Licensing Information
Commercial License
提供机构:
Nexdata
原始信息汇总
数据集卡片 Nexdata/Indonesian_Conversational_Speech_Data_by_Telephone
描述
89小时 - 通过电话收集的印尼对话语音数据,涉及124名母语使用者,性别比例平衡。参与者从给定列表中选择几个熟悉的话题开始对话,确保对话的流畅性和自然性。录音设备为各种手机,音频格式为8kHz、8bit、u-law pcm,所有语音数据在安静的室内环境中录制。所有语音音频均已手动转录,包括文本内容、每句有效句子的开始和结束时间以及说话人识别。
规范
格式
8kHz 8bit,u-law pcm,单声道;
环境
安静的室内环境,无回声;
录音内容
指定数十个话题,录音时说话人在这些话题下进行对话;
人口统计
总共140名说话人,其中54%为男性,46%为女性;
标注
转录文本、说话人识别和性别标注;
设备
安卓手机、iPhone;
语言
印尼语;
应用场景
语音识别;声纹识别;
准确率
单词准确率不低于98%
许可信息
商业许可



