illuin-cde/squad-chunked-1750
收藏Hugging Face2025-02-02 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/illuin-cde/squad-chunked-1750
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含两部分:文档和查询。文档部分包含文档块的唯一标识符(chunk_id)、文档块内容(chunk)以及文档块在原文件中的偏移量(offset)。查询部分包含查询的唯一标识符(chunk_id)、查询内容(query)以及对应的答案(answer)。数据集提供了验证集分割,可用于模型训练和评估。
The dataset consists of two parts: documents and queries. The documents part includes a unique identifier for the document chunk (chunk_id), the content of the document chunk (chunk), and the offset of the document chunk in the original file. The queries part includes a unique identifier for the query (chunk_id), the query content (query), and the corresponding answer (answer). The dataset provides a validation split for use in model training and evaluation.
提供机构:
illuin-cde



