alea-institute/kl3m-filter-data-dotgov-www.usgs.gov
收藏Hugging Face2025-02-04 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/alea-institute/kl3m-filter-data-dotgov-www.usgs.gov
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含至少五个字段:标识符(identifier),数据集名称(dataset),媒体类型(mime_type),得分(score)和标记(tokens)。数据集被划分为训练集(train),包含大约71764个示例,总大小为523,874,021字节。具体的数据集用途和内容在README中未描述,因此无法提供详细中文描述。
The dataset includes at least five fields: identifier, dataset name, MIME type, score, and tokens. The dataset is split into a training set (train) with approximately 71,764 examples, totaling 523,874,021 bytes in size. The specific purpose and content of the dataset are not described in the README, so no detailed description can be provided.
提供机构:
alea-institute



