alea-institute/kl3m-data-dotgov-www.usaid.gov
收藏Hugging Face2025-02-02 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/alea-institute/kl3m-data-dotgov-www.usaid.gov
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含 identifier、dataset、mime_type 和 tokens 等信息字段。数据集被划分为训练集(train),训练集大小为24077436字节,包含3647个示例。数据集的总大小为24077436字节,下载大小为4855338字节。配置信息中提供了默认配置,指定了训练集数据文件的路径。
The dataset includes information fields such as identifier, dataset, mime_type, and tokens. It is split into a training set (train), which is 24077436 bytes in size and contains 3647 examples. The total size of the dataset is 24077436 bytes, and the download size is 4855338 bytes. The default configuration provides the path to the training set data files.
提供机构:
alea-institute



