ChavyvAkvar/aya_collection_language_split-standard_malay-Converted
收藏Hugging Face2025-09-05 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/ChavyvAkvar/aya_collection_language_split-standard_malay-Converted
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含对话信息,每个示例包括消息内容、角色、消息中的token数量以及JSON格式的元数据。数据集被划分为训练集,其中包含约359万示例,总大小超过2GB。
The dataset consists of conversation information, with each example including message content, role, the number of tokens in the message, and JSON-formatted metadata. The dataset is split into a training set, which contains approximately 3.59 million examples and is over 2GB in size.
提供机构:
ChavyvAkvar



