five

kevin017/tokenized_bioS_inverse_QA_c_name_large_padding

收藏
Hugging Face2025-04-03 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/kevin017/tokenized_bioS_inverse_QA_c_name_large_padding
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含三个特征字段:input_ids,attention_mask和answers_tokenized。input_ids和attention_mask是整数序列,分别存储int32和int8类型的元素。answers_tokenized是一个包含两个字段的结构体,两个字段分别是attention_mask和input_ids,它们都是包含int64类型元素的序列。数据集分为训练集和测试集,各包含92个示例。整个数据集的总大小为5654096字节,下载大小为492309字节。

The dataset includes three feature fields: input_ids, attention_mask, and answers_tokenized. input_ids and attention_mask are integer sequences containing int32 and int8 elements, respectively. answers_tokenized is a structure with two fields, attention_mask and input_ids, which are both sequences of int64 elements. The dataset is split into a training set and a test set, each containing 92 examples. The total size of the dataset is 5654096 bytes, with a download size of 492309 bytes.
提供机构:
kevin017
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作