alea-institute/kl3m-filter-data-dotgov-www.osha.gov
收藏Hugging Face2025-02-04 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/alea-institute/kl3m-filter-data-dotgov-www.osha.gov
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个字段,如标识符、数据集名称、MIME类型、得分和标记数量。数据集被划分为训练集,其大小为168504430字节,包含9389个示例。数据集的下载大小为27568323字节。同时,提供了默认配置信息,其中包括训练数据文件的路径。
The dataset includes multiple fields such as identifier, dataset name, MIME type, score, and token count. The dataset is split into a training set, which is 168504430 bytes in size and contains 9389 examples. The download size of the dataset is 27568323 bytes. Default configuration information is provided, including the path to the training data files.
提供机构:
alea-institute



