mlfoundations-dev/DCLM-IT-Dataset
收藏Hugging Face2024-10-22 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/mlfoundations-dev/DCLM-IT-Dataset
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含一个名为conversation的列表,其中包含from和value两个字段,数据类型均为字符串。数据集包含一个名为train的分割,包含4110641个示例,总大小为7096640439字节。下载大小为3467280679字节。配置文件指定了默认配置,数据文件路径为data/train-*。
The dataset contains a list named conversation, which includes two fields: from and value, both of which are of string type. The dataset includes a split named train, containing 4110641 examples with a total size of 7096640439 bytes. The download size is 3467280679 bytes. The configuration file specifies the default configuration, with the data file path being data/train-*.
提供机构:
mlfoundations-dev



