arbml/ESCWA
收藏Hugging Face2022-12-05 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/arbml/ESCWA
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: audio
dtype: audio
- name: transcription
dtype: string
splits:
- name: train
num_bytes: 783712001.0
num_examples: 24
download_size: 766073404
dataset_size: 783712001.0
---
# Dataset Card for "ESCWA"
Collected over two days of meetings of the United Nations Economic and Social Commission for West Asia (ESCWA) in 2019. The data includes intrasentential code alternation between Arabic and English. In the case of Algerian, Tunisian, and Moroccan native speakers, the switch is between Arabic and French.
The 2.8 hours ESCWA includes dialectal Arabic, with a Code Mixing Index (CMI) of ~28%.
More details about the ESCWA can be found https://arabicspeech.org/escwa/.
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
arbml
原始信息汇总
数据集概述
特征
- 音频:数据类型为音频。
- 转录文本:数据类型为字符串。
数据分割
- 训练集:
- 字节数:783712001.0
- 样本数:24
数据大小
- 下载大小:766073404
- 数据集大小:783712001.0



