Nexdata/18_Hours_Brazilian_English_Speech_Data_by_Mobile_Phone
收藏Hugging Face2024-04-16 更新2024-05-25 收录
下载链接:
https://hf-mirror.com/datasets/Nexdata/18_Hours_Brazilian_English_Speech_Data_by_Mobile_Phone
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-nc-nd-4.0
---
## Description
English(Brazil) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering generic domain, human-machine interaction, smart home command and control, in-car command and control, numbers and other domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(55 people in total), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
For more details, please refer to the link: https://www.nexdata.ai/dataset/1049?source=Huggingface
# Specifications
## Format
16kHz, 16bit, uncompressed wav, mono channel;
## Recording condition
Low background noise(indoor), without echo;
## Content category
Generic domain; human-machine interaction; smart home command and in-car command; numbers;
## Recording device
Android Smartphone, iPhone
## Speaker
55 Brazilian, including 35 males and 20 females
## Country
Brazil(BRA)
## Language
English
## Accuracy Rate
Sentence Accuracy Rate(SAR) 95%
# Licensing Information
Commercial License
提供机构:
Nexdata
原始信息汇总
数据集概述
基本信息
- 许可证: CC-BY-NC-ND-4.0
- 语言: 英语
- 国家: 巴西
- 发言人数量: 55人,包括35名男性和20名女性
数据集描述
- 内容类别: 通用领域; 人机交互; 智能家居和车载命令控制; 数字
- 录音条件: 低背景噪音(室内),无回声
- 录音设备: Android智能手机, iPhone
- 录音格式: 16kHz, 16bit, 单声道, 未压缩wav格式
- 准确率: 句子准确率(SAR) 95%
数据集特点
- 由55名多样化发言人收集,地理上覆盖巴西,旨在提升模型在真实复杂任务中的表现。
- 数据集经过多家AI公司质量测试,严格遵守数据保护法规和隐私标准,确保用户隐私和法律权利。
- 符合GDPR, CCPA, PIPL标准。



