victorambrose11/TF_IDF_EMB_Hirearchy
收藏Hugging Face2025-04-10 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/victorambrose11/TF_IDF_EMB_Hirearchy
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文本输入的索引(paragraph_input_ids)、注意力掩码(paragraph_attention_masks)和段落数量(paragraph_counts)。数据集分为训练集、测试集和验证集,其中训练集包含5000个样本,测试集和验证集各包含1400个样本。数据集的总大小为1279324800字节,下载大小为38810391字节。
The dataset includes text input indices (paragraph_input_ids), attention masks (paragraph_attention_masks), and the number of paragraphs (paragraph_counts). It is split into training, test, and validation sets, with the training set containing 5000 samples, and both the test and validation sets containing 1400 samples each. The total size of the dataset is 1279324800 bytes, with a download size of 38810391 bytes.
提供机构:
victorambrose11



