alea-institute/kl3m-filter-data-dotgov-nicic.gov
收藏Hugging Face2025-02-03 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/alea-institute/kl3m-filter-data-dotgov-nicic.gov
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了标识符、数据集名称、MIME类型、分数和标记数等字段的信息。它被分割为训练集,包含7614个示例,总大小为23945251字节。尽管README没有提供详细的数据集描述,但从这些特征可以推断,这可能是一个用于文本处理或分析的任务的数据集,其中包含了用于训练的文本数据及其相关特征。
The dataset includes fields for identifier, dataset name, MIME type, score, and token count. It is split into a training set with 7614 examples, totaling 23945251 bytes in size. Although the README does not provide a detailed dataset description, it can be inferred from the features that this might be a dataset for text processing or analysis tasks, containing text data for training along with related features.
提供机构:
alea-institute



