Nexdata/Thai_Conversational_Speech_Data_by_Telephone
收藏Hugging Face2024-04-17 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Nexdata/Thai_Conversational_Speech_Data_by_Telephone
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- th
task_categories:
- conversational
---
---
# Dataset Card for Nexdata/Pushtu_Conversational_Speech_Data_by_Telephone
## Description
The 1,077 Hours - Thai Conversational Speech Data involved 1,986 native speakers, developed with proper balance of gender ratio, Speakers would choose a few familiar topics out of the given list and start conversations to ensure dialogues' fluency and naturalness. The recording devices are various mobile phones. The audio format is 8kHz, 8bit, and all the speech data was recorded in quiet indoor environments. All the speech audio was manually transcribed with text content, the start and end time of each effective sentence, and speaker identification.
For more details, please refer to the link: https://www.nexdata.ai/datasets/1210?source=Huggingface
# Specifications
## Format
8kHz, 8bit, mono channel;
## Recording Environment
quiet indoor environment, without echo;
## Recording content
dozens of topics are specified, and the speakers make dialogue under those topics while the recording is performed;
## Demographics
1,986 speakers totally, with 41% male and 59% female;
## Annotation
annotating for the transcription text, speaker identification and gender
## Device
Telephony recording system;
## Language
Thai
## Application scenarios
speech recognition; voiceprint recognition;
## Accuracy rate
the word accuracy rate is not less than 95%
# Licensing Information
Commercial License
提供机构:
Nexdata
原始信息汇总
数据集卡片 Nexdata/Pushtu_Conversational_Speech_Data_by_Telephone
描述
1,077小时 - 泰语对话语音数据集涉及1,986名母语使用者,性别比例均衡。参与者从给定的话题列表中选择几个熟悉的话题进行对话,确保对话的流畅性和自然性。录音设备为各种手机。音频格式为8kHz, 8bit,所有语音数据在安静的室内环境中录制。所有语音音频均手动转录为文本内容,包括每句话的开始和结束时间以及说话人识别。
规格
格式
8kHz, 8bit, 单声道;
录音环境
安静的室内环境,无回声;
录音内容
指定数十个话题,录音时说话人在这些话题下进行对话;
人口统计
总共1,986名说话人,其中41%为男性,59%为女性;
标注
转录文本、说话人识别和性别标注;
设备
电话录音系统;
语言
泰语
应用场景
语音识别;声纹识别;
准确率
单词准确率不低于95%
许可信息
商业许可



