tbkazakova/even_speech_hse
收藏Hugging Face2024-06-05 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/tbkazakova/even_speech_hse
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- eve
task_categories:
- automatic-speech-recognition
size_categories:
- 1K<n<10K
---
This dataset consists of audiofiles with a speech in Even language.
The correspondence between text and audio is in the table [metadata.csv](https://huggingface.co/datasets/tbkazakova/even_speech_hse/blob/main/metadata.csv).
The data was collected during field trips of HSE University [expedition "Languages and Cultures of Kamchatka"](https://www.evenlang.ru/evenlang_online/scientific/main/)
| Sourse | Dialect | Total length (min)|
|----------|----------|----------|
| Expedition records | Bystraja | TBA |
Another dataset of Even speech:
- biblical texts: https://huggingface.co/datasets/tbkazakova/even_speech_biblical
There is also [unified data](https://huggingface.co/datasets/tbkazakova/even_speech_pakendorf) from the project (Aralova et al. 2007-2023)[1], but for copyright reasons, it can only be made available by personal agreement with all copyright holders.
[1]: Natalia Aralova, Brigitte Pakendorf, Alexandra Lavrillier, Dejan Matić, Katharina Gernet, Tat'jana Vasil'evna Zakharova, Raisa Petrovna Kuzmina, and Luise Zippel (2007 - 2023). Collection "Even". The Language Archive. https://hdl.handle.net/1839/07210104-91d6-4133-b067-b21eadc35f9a
提供机构:
tbkazakova
原始信息汇总
数据集概述
基本信息
- 语言: Even
- 任务类别: 自动语音识别
- 数据集大小: 1K<n<10K
数据内容
- 包含Even语言的音频文件。
- 文本与音频的对应关系记录在metadata.csv中。
数据来源
- 数据收集自HSE大学的实地考察,具体为“Kamchatka的语言与文化”探险活动。
数据详情
- 来源: 探险记录
- 方言: Bystraja
- 总时长: 待公布



