PranavJosh/500hr_voi_users
收藏Hugging Face2026-03-26 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/PranavJosh/500hr_voi_users
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
dataset_info:
features:
- name: index
dtype: int64
- name: call_id
dtype: int64
- name: user_id
dtype: int64
- name: recording_id
dtype: int64
- name: audio_url
dtype: string
- name: start
dtype: float64
- name: end
dtype: float64
- name: sentence
dtype: string
- name: duration
dtype: float64
- name: duration_hours
dtype: float64
- name: target_word_count
dtype: int64
- name: target_unique_words
dtype: int64
- name: word_density
dtype: float64
- name: _row_idx
dtype: int64
- name: audio
dtype: audio
splits:
- name: train
num_bytes: 148432570429
num_examples: 214140
download_size: 158873105090
dataset_size: 148432570429
---
配置项:
- 配置名称:默认(default)
数据文件:
- 划分集:训练集(train)
路径:data/train-*
数据集信息:
数据特征:
- 字段名:索引(index),数据类型:int64
- 字段名:通话ID(call_id),数据类型:int64
- 字段名:用户ID(user_id),数据类型:int64
- 字段名:录音ID(recording_id),数据类型:int64
- 字段名:音频URL(audio_url),数据类型:string
- 字段名:起始时间(start),数据类型:float64
- 字段名:结束时间(end),数据类型:float64
- 字段名:语句文本(sentence),数据类型:string
- 字段名:时长(duration),数据类型:float64
- 字段名:小时级时长(duration_hours),数据类型:float64
- 字段名:目标词总数(target_word_count),数据类型:int64
- 字段名:目标唯一词数(target_unique_words),数据类型:int64
- 字段名:词密度(word_density),数据类型:float64
- 字段名:行索引(_row_idx),数据类型:int64
- 字段名:音频数据(audio),数据类型:audio
划分集:
- 划分名称:训练集(train)
字节数:148432570429
样本数量:214140
下载大小:158873105090
数据集总大小:148432570429
提供机构:
PranavJosh



