parsa-mhmdi/tokenized_cnn_dailymail_bart
收藏Hugging Face2025-02-04 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/parsa-mhmdi/tokenized_cnn_dailymail_bart
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个字段:input_ids、attention_mask和labels,分别代表输入ID序列、注意力掩码序列和标签序列。数据集分为训练集、验证集和测试集三个部分,其中训练集包含287,113个示例,验证集包含13,368个示例,测试集包含11,490个示例。数据集的总大小为1,920,493,476字节。
The dataset includes three fields: input_ids, attention_mask, and labels, representing input ID sequences, attention mask sequences, and label sequences respectively. The dataset is divided into three parts: training set, validation set, and test set, containing 287,113, 13,368, and 11,490 examples respectively. The total size of the dataset is 1,920,493,476 bytes.
提供机构:
parsa-mhmdi



