akahana/mini-multilanguage
收藏Hugging Face2024-12-26 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/akahana/mini-multilanguage
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多种语言的文本数据,包括阿拉伯语、英语、印度尼西亚语、日语、韩语和马来语。每个语言配置包含文本、时间戳、URL和来源等特征,并且提供了训练集的详细信息,如字节数和样本数。数据集的下载大小和数据集大小也被列出。
This dataset contains text data in multiple languages, including Arabic, English, Indonesian, Japanese, Korean, and Malay. Each language configuration includes features such as text, timestamp, URL, and source, along with detailed information about the training set, such as the number of bytes and examples. The download size and dataset size are also listed.
提供机构:
akahana



