illuin-cde/squad-chunked-300
收藏Hugging Face2025-02-02 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/illuin-cde/squad-chunked-300
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含两部分:documents和queries。documents部分包含文档的chunk_id、文档内容chunk和偏移量offset信息;queries部分包含chunk_id、查询query和答案answer信息。数据集有验证集划分,documents验证集大小为1482944字节,包含4934个样本;queries验证集大小为1076050字节,包含8501个样本。
The dataset consists of two parts: documents and queries. The documents part includes chunk_id, document content chunk, and offset information; the queries part includes chunk_id, query, and answer information. The dataset has a validation split, with the documents validation set being 1482944 bytes in size and containing 4934 samples; the queries validation set is 1076050 bytes in size and contains 8501 samples.
提供机构:
illuin-cde



