theojiang/CIVETv2_key_idea_retrieval_dataset_v3.3_gtebase
收藏Hugging Face2024-12-19 更新2024-12-21 收录
下载链接:
https://hf-mirror.com/datasets/theojiang/CIVETv2_key_idea_retrieval_dataset_v3.3_gtebase
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含两个主要配置:data和metadata。data配置包括段落输入ID、段落输入掩码、段落问题嵌入、段落问题文本和标题等特征,分为训练集和验证集。metadata配置包括标题和索引特征,同样分为训练集和验证集。数据集的总下载大小为161930497字节,总数据集大小为236479139字节。
The dataset includes two main configurations: data and metadata. The data configuration features paragraph input IDs, paragraph input masks, paragraph question embeddings, paragraph question texts, and titles, divided into training and validation sets. The metadata configuration includes titles and indices, also divided into training and validation sets. The total download size of the dataset is 161930497 bytes, and the total dataset size is 236479139 bytes.
提供机构:
theojiang



