alea-institute/kl3m-filter-data-dotgov-www.atsdr.cdc.gov
收藏Hugging Face2025-02-03 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/alea-institute/kl3m-filter-data-dotgov-www.atsdr.cdc.gov
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了标识符、数据集名称、MIME类型、分数和标记数等字段的信息。它被划分为训练集,提供了训练集的字节数和示例数。数据集的下载大小和整体大小也一并给出。默认配置中包含了训练集数据文件的路径。
The dataset includes fields for identifier, dataset name, MIME type, score, and token count. It is split into a training set with provided byte size and example count. The download size and total size of the dataset are also given. The default configuration contains the file paths for the training set data.
提供机构:
alea-institute



