alea-institute/kl3m-filter-data-dotgov-www.usff.navy.mil
收藏Hugging Face2025-02-04 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/alea-institute/kl3m-filter-data-dotgov-www.usff.navy.mil
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含五个字段:标识符(identifier)、数据集名称(dataset)、MIME类型(mime_type)、评分(score)和标记数量(tokens)。数据集分为训练集(train),训练集大小为18102993字节,共有2341个示例。数据集下载大小为3848824字节,整个数据集大小为18102993字节。提供了一个默认配置,指定了训练集的数据文件路径。
The dataset includes five fields: identifier, dataset name, MIME type, score, and number of tokens. The dataset is split into a training set (train) which is 18102993 bytes in size and contains 2341 examples. The download size of the dataset is 3848824 bytes, and the total size of the dataset is 18102993 bytes. A default configuration is provided, specifying the path to the data files for the training set.
提供机构:
alea-institute



