agu18dec/longhealth-longcode-19k-max-len55k-enc-max-len1k-dec
收藏Hugging Face2025-10-30 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/agu18dec/longhealth-longcode-19k-max-len55k-enc-max-len1k-dec
下载链接
链接失效反馈官方服务:
资源简介:
该数据集因大小限制被分割成多个部分,包括训练数据(train_part*.jsonl)和验证数据(val_part*.jsonl),每部分大小约为3GB或更小。数据集有最大长度限制,编码最大长度为55,000,解码最大长度为1,000。但README文件中未提供数据集的具体类型、来源或内容信息。
The dataset is split into multiple parts due to size constraints, including training data (train_part*.jsonl) and validation data (val_part*.jsonl), with each part being approximately 3GB or smaller. There are maximum length constraints, with an encoding maximum length of 55,000 and a decoding maximum length of 1,000. However, the README file does not provide specific information about the type, source, or content of the dataset.
提供机构:
agu18dec



