alea-institute/kl3m-filter-data-dotgov-www.centcom.mil
收藏Hugging Face2025-02-04 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/alea-institute/kl3m-filter-data-dotgov-www.centcom.mil
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了一系列的特征,包括标识符、数据集名称、MIME类型、得分和标记序列长度。数据集被划分为训练集,共有7698个示例,占用75660720字节。提供了默认配置,并指定了训练数据的文件路径。但是,具体的数据集内容和用途在README中并未描述,因此无法提供详细的中文名称描述。
The dataset includes features such as identifier, dataset name, MIME type, score, and token sequence length. The dataset is split into a training set with 7698 examples, occupying 75660720 bytes. A default configuration is provided, and the file path for the training data is specified. However, the specific content and purpose of the dataset are not described in the README, so no detailed English name description can be provided.
提供机构:
alea-institute



