Nexdata/Chinese_English_Speech_Data_by_Mobile_Phone

Hugging Face2024-04-16 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/Nexdata/Chinese_English_Speech_Data_by_Mobile_Phone

下载链接

链接失效反馈

资源简介：

--- YAML tags: - copy-paste the tags obtained with the tagging app: https://github.com/huggingface/datasets-tagging --- # Dataset Card for Nexdata/Chinese_English_Speech_Data_by_Mobile_Phone ## Table of Contents - [Table of Contents](#table-of-contents) - [Dataset Description](#dataset-description) - [Dataset Summary](#dataset-summary) - [Supported Tasks and Leaderboards](#supported-tasks-and-leaderboards) - [Languages](#languages) - [Dataset Structure](#dataset-structure) - [Data Instances](#data-instances) - [Data Fields](#data-fields) - [Data Splits](#data-splits) - [Dataset Creation](#dataset-creation) - [Curation Rationale](#curation-rationale) - [Source Data](#source-data) - [Annotations](#annotations) - [Personal and Sensitive Information](#personal-and-sensitive-information) - [Considerations for Using the Data](#considerations-for-using-the-data) - [Social Impact of Dataset](#social-impact-of-dataset) - [Discussion of Biases](#discussion-of-biases) - [Other Known Limitations](#other-known-limitations) - [Additional Information](#additional-information) - [Dataset Curators](#dataset-curators) - [Licensing Information](#licensing-information) - [Citation Information](#citation-information) - [Contributions](#contributions) ## Dataset Description - **Homepage:** https://www.nexdata.ai/datasets/1002?source=Huggingface - **Repository:** - **Paper:** - **Leaderboard:** - **Point of Contact:** ### Dataset Summary 1,279 Chinese speakers from major dialect regions participated in the recording, it is in line with the specific accent of Chinese English speakers. The recorded script cover many categories such as spoken English, speech, and human-computer interaction, rich in content, extensive in fields, and balanced in phonemes. It can be used to improve the recognition effect of the automatic speech recognition system on Chinese people speaking English. For more details, please refer to the link: https://www.nexdata.ai/datasets/1002?source=Huggingface ### Supported Tasks and Leaderboards automatic-speech-recognition, audio-speaker-identification: The dataset can be used to train a model for Automatic Speech Recognition (ASR). ### Languages Chinese English ## Dataset Structure ### Data Instances [More Information Needed] ### Data Fields [More Information Needed] ### Data Splits [More Information Needed] ## Dataset Creation ### Curation Rationale [More Information Needed] ### Source Data #### Initial Data Collection and Normalization [More Information Needed] #### Who are the source language producers? [More Information Needed] ### Annotations #### Annotation process [More Information Needed] #### Who are the annotators? [More Information Needed] ### Personal and Sensitive Information [More Information Needed] ## Considerations for Using the Data ### Social Impact of Dataset [More Information Needed] ### Discussion of Biases [More Information Needed] ### Other Known Limitations [More Information Needed] ## Additional Information ### Dataset Curators [More Information Needed] ### Licensing Information Commerical License: https://drive.google.com/file/d/1saDCPm74D4UWfBL17VbkTsZLGfpOQj1J/view?usp=sharing ### Citation Information [More Information Needed] ### Contributions

提供机构：

Nexdata

原始信息汇总

数据集概述

数据集名称

Nexdata/Chinese_English_Speech_Data_by_Mobile_Phone

数据集描述

数据集总结

参与录音的1,279名中文母语者来自中国主要方言区。
录音内容涵盖多种类别，如英语口语、演讲及人机交互等，内容丰富，领域广泛，音素平衡。
旨在提升自动语音识别系统对中国人说英语的识别效果。

支持的任务和排行榜

自动语音识别（ASR）
音频说话人识别

语言

中文英语

数据集结构

数据实例

[信息待补充]

数据字段

[信息待补充]

数据分割

[信息待补充]

数据集创建

数据选择理由

[信息待补充]

源数据

初始数据收集和标准化

[信息待补充]

源语言生产者

[信息待补充]

注释

注释过程

[信息待补充]

注释者

[信息待补充]

个人和敏感信息

[信息待补充]

使用数据的考虑

数据集的社会影响

[信息待补充]

偏见讨论

[信息待补充]

其他已知限制

[信息待补充]

附加信息

数据集管理者

[信息待补充]

许可信息

商业许可：链接

引用信息

[信息待补充]

贡献

[信息待补充]

AI搜集汇总

数据集介绍

构建方式

该数据集的构建旨在收录具有中国特色的英语发音数据，涵盖1279名来自主要方言区域的中国说话者。数据的收集与规范化处理遵循了严谨的语音学原则，确保了脚本内容的多样性与发音的均衡性，以适应自动语音识别系统对英语发音的中国说话者的识别需求。

特点

Nexdata/Chinese_English_Speech_Data_by_Mobile_Phone数据集的特点在于其内容的丰富性与领域的广泛性，包含了口语英语、演讲以及人机交互等多种类别。此外，数据集特别强调了对中国英语说话者特定口音的收录，有利于提升自动语音识别系统对此类说话者的识别效果。

使用方法

在使用该数据集时，研究者可通过链接获取完整数据集（需付费）。数据集可用于自动语音识别和音频说话人识别等任务，用户需遵循商业许可规定，并注意数据中可能包含的个人敏感信息，确保使用过程中的合规性与隐私保护。

背景与挑战

背景概述

Nexdata/Chinese_English_Speech_Data_by_Mobile_Phone数据集，是在人工智能领域对自动语音识别技术的研究和应用背景下产生的。该数据集由Nexdata公司于特定时间创建，汇集了1,279名来自中国主要方言区域的语言使用者，他们以手机为录音设备，录制了涵盖英语口语、演讲以及人机交互等多个类别的脚本。这些语音样本针对中国英语使用者的特定口音，旨在提升自动语音识别系统对中国英语使用者语音的识别效果。该数据集的创建，对于语音识别技术的本土化改进和相关领域的研究具有显著影响。

当前挑战

在数据集构建过程中，研究团队面临的挑战包括：确保录音质量的一致性和准确性，处理方言多样性带来的识别难题，以及平衡不同类别和领域的语音样本。此外，数据集在实际应用中还需解决如何降低个人敏感信息泄露的风险，以及如何在数据标注过程中减少主观偏差。在技术层面，利用该数据集训练模型时，挑战在于如何优化算法以准确识别具有不同口音和发音习惯的中国英语使用者的语音。

常用场景

经典使用场景

在语音识别领域，Nexdata/Chinese_English_Speech_Data_by_Mobile_Phone数据集以其丰富的内容覆盖和平衡的音素分布，成为训练自动语音识别系统的重要资源。该数据集特别适用于优化系统对中国英语使用者发音的识别效果，其录制的脚本涉及口语英语、演讲、人机交互等多个类别，为模型提供了多样化的语言学习素材。

衍生相关工作

基于Nexdata/Chinese_English_Speech_Data_by_Mobile_Phone数据集，学术界和工业界已开展了一系列相关研究，如地方口音识别、语音合成、跨语种语音识别等。这些研究不仅推动了语音识别技术的进步，也为多语言交流和处理提供了新的视角和方法。

数据集最近研究