jan-hq/libris_r_vivoice_ichigo_tokens_v2
收藏Hugging Face2024-12-28 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/jan-hq/libris_r_vivoice_ichigo_tokens_v2
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含对话数据的文本数据集,数据集中的每个样本都包括对话的transcript(转录文本)、prompt(提示文本)、compress_prompt(压缩提示文本)和对话的具体内容。对话内容被进一步细分为content(文本内容)和role(角色)。数据集划分为训练集,共有1000098个示例,总大小为4763539094字节。
This is a text dataset containing conversational data, where each sample includes the transcript of the conversation, prompt, compressed prompt, and the specific content of the conversation. The conversation content is further subdivided into content and role. The dataset is split into a training set with a total of 1000098 examples and a size of 4763539094 bytes.
提供机构:
jan-hq



