alea-institute/kl3m-data-dotgov-www.nga.gov
收藏Hugging Face2025-02-01 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/alea-institute/kl3m-data-dotgov-www.nga.gov
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含文本数据的训练集,数据集包含了约2.6百万个文本例子,每个例子有唯一的标识符、数据集名称、MIME类型以及文本序列的token数量。数据集大小约为1GB,下载大小为约167MB。
This dataset is a training set containing textual data, with approximately 2.6 million text examples. Each example includes a unique identifier, dataset name, MIME type, and the number of tokens in the text sequence. The dataset size is about 1GB, with a download size of approximately 167MB.
提供机构:
alea-institute



