nexa-collaboration/mls_eng_10k_all_2_with_instruction
收藏Hugging Face2024-11-07 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/nexa-collaboration/mls_eng_10k_all_2_with_instruction
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个主要特征:transcript(转录文本,数据类型为字符串)、text(文本内容,数据类型为字符串)和row_id(行标识符,数据类型为整型)。数据集分为一个训练集,包含2,409,954个样本,总大小为1,781,333,646字节。下载大小为1,014,741,417字节。
The dataset includes three main features: transcript (transcription text, data type is string), text (text content, data type is string), and row_id (row identifier, data type is int32). The dataset is divided into one training set containing 2,409,954 samples, with a total size of 1,781,333,646 bytes. The download size is 1,014,741,417 bytes.
提供机构:
nexa-collaboration



