five

tbkazakova/even_speech_hse

收藏
Hugging Face2024-06-05 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/tbkazakova/even_speech_hse
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - eve task_categories: - automatic-speech-recognition size_categories: - 1K<n<10K --- This dataset consists of audiofiles with a speech in Even language. The correspondence between text and audio is in the table [metadata.csv](https://huggingface.co/datasets/tbkazakova/even_speech_hse/blob/main/metadata.csv). The data was collected during field trips of HSE University [expedition "Languages and Cultures of Kamchatka"](https://www.evenlang.ru/evenlang_online/scientific/main/) | Sourse | Dialect | Total length (min)| |----------|----------|----------| | Expedition records | Bystraja | TBA | Another dataset of Even speech: - biblical texts: https://huggingface.co/datasets/tbkazakova/even_speech_biblical There is also [unified data](https://huggingface.co/datasets/tbkazakova/even_speech_pakendorf) from the project (Aralova et al. 2007-2023)[1], but for copyright reasons, it can only be made available by personal agreement with all copyright holders. [1]: Natalia Aralova, Brigitte Pakendorf, Alexandra Lavrillier, Dejan Matić, Katharina Gernet, Tat'jana Vasil'evna Zakharova, Raisa Petrovna Kuzmina, and Luise Zippel (2007 - 2023). Collection "Even". The Language Archive. https://hdl.handle.net/1839/07210104-91d6-4133-b067-b21eadc35f9a
提供机构:
tbkazakova
原始信息汇总

数据集概述

基本信息

  • 语言: Even
  • 任务类别: 自动语音识别
  • 数据集大小: 1K<n<10K

数据内容

  • 包含Even语言的音频文件。
  • 文本与音频的对应关系记录在metadata.csv中。

数据来源

  • 数据收集自HSE大学的实地考察,具体为“Kamchatka的语言与文化”探险活动。

数据详情

  • 来源: 探险记录
  • 方言: Bystraja
  • 总时长: 待公布
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作