alea-institute/kl3m-data-dotgov-www.mspb.gov
收藏Hugging Face2025-02-01 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/alea-institute/kl3m-data-dotgov-www.mspb.gov
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个特征,如标识符(identifier)、数据集名称(dataset)、媒体类型(mime_type)和标记序列(tokens)。数据集被划分为训练集(train),包含41516个示例,大小为284,530,225字节。数据集的下载大小为30,643,787字节。根据这些信息,可以推测这是一个用于文本处理或机器学习任务的文本数据集,但具体应用领域和内容未在README中明确描述。
The dataset includes features such as identifier, dataset name, MIME type, and token sequences. It is split into a training set (train) with 41,516 examples, totaling 284,530,225 bytes in size. The download size of the dataset is 30,643,787 bytes. Based on this information, it can be inferred that this is a text dataset for text processing or machine learning tasks, but the specific application field and content are not explicitly described in the README.
提供机构:
alea-institute



