joanitolopo/restructured-muse
收藏Hugging Face2024-12-23 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/joanitolopo/restructured-muse
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个字段:text_1、text_2和lang,均为字符串类型。数据集被划分为一个split,名为muse,包含大约710万示例。具体的数据集内容描述没有提供,但从字段名称和配置信息推测,这可能是一个用于文本处理的平行语料库,lang字段可能用于标示文本的语言类型。
The dataset includes three fields: text_1, text_2, and lang, all of which are of string type. The dataset is split into one partition named muse, containing about 7.1 million examples. No specific description of the dataset content is provided, but from the field names and configuration information, it can be inferred that this might be a parallel corpus for text processing, with the lang field possibly indicating the language type of the text.
提供机构:
joanitolopo



