nihalnayak/tulu-v2-unfiltered-folds
收藏Hugging Face2025-07-24 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/nihalnayak/tulu-v2-unfiltered-folds
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个主要字段:数据集名称(dataset)、唯一标识符(id)和消息列表(messages)。消息列表进一步包含角色(role)和内容(content)两个子字段。数据集被分为五个部分(folds),每部分包含200,000个示例,大小约为379,306,069字节。数据集的总下载大小为1,021,400,972字节,总大小为1,896,530,345.53字节。
The dataset includes three main fields: dataset name (dataset), unique identifier (id), and a list of messages (messages). The messages list further contains two sub-fields: role and content. The dataset is divided into five parts (folds), each containing 200,000 examples, approximately 379,306,069 bytes in size. The total download size of the dataset is 1,021,400,972 bytes, and the total size is 1,896,530,345.53 bytes.
提供机构:
nihalnayak



