tomaarsen/gooaq-hard-negatives
收藏Hugging Face2024-06-19 更新2024-06-22 收录
下载链接:
https://hf-mirror.com/datasets/tomaarsen/gooaq-hard-negatives
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是GooAQ数据集的扩展,包含问题-答案-负样本三元组和问题-答案-负样本1...负样本5的元组。负样本是通过all-MiniLM-L6-v2模型自动挖掘的,挖掘过程中使用了特定的参数设置。数据集可以直接用于Sentence Transformers来训练嵌入模型。数据集包含两个子集:triplet-all和triplet-5,分别包含不同数量的负样本。
该数据集是GooAQ数据集的扩展,包含问题-答案-负样本三元组和问题-答案-负样本1...负样本5的元组。负样本是通过all-MiniLM-L6-v2模型自动挖掘的,挖掘过程中使用了特定的参数设置。数据集可以直接用于Sentence Transformers来训练嵌入模型。数据集包含两个子集:triplet-all和triplet-5,分别包含不同数量的负样本。
提供机构:
tomaarsen
原始信息汇总
数据集概述
数据集名称
GooAQ with mined hard negatives
数据集标签
- sentence-transformers
数据集配置
triplet-5 配置
- 特征:
question: 字符串answer: 字符串negative_1: 字符串negative_2: 字符串negative_3: 字符串negative_4: 字符串negative_5: 字符串
- 分割:
train:- 字节数: 3593548091
- 样本数: 2294443
- 下载大小: 2433298512 字节
- 数据集大小: 3593548091 字节
triplet-all 配置
- 特征:
question: 字符串answer: 字符串negative: 字符串
- 分割:
train:- 字节数: 5571746272
- 样本数: 10048700
- 下载大小: 1710985513 字节
- 数据集大小: 5571746272 字节
数据集子集
triplet-all 子集
- 列: "question", "answer", "negative"
- 列类型:
str,str,str - 示例: python { question: is toprol xl the same as metoprolol?, answer: Metoprolol succinate is also known by the brand name Toprol XL. It is the extended-release form of metoprolol. Metoprolol succinate is approved to treat high blood pressure, chronic chest pain, and congestive heart failure., negative: TOPROL-XL is indicated for the treatment of stable, symptomatic (NYHA Class II or III) heart failure of ischemic, hypertensive, or cardiomyopathic origin. It was studied in patients already receiving ACE inhibitors, diuretics, and, in the majority of cases, digitalis. }
triplet-5 子集
- 列: "question", "answer", "negative_1", "negative_2", "negative_3", "negative_4", "negative_5"
- 列类型:
str,str,str,str,str,str,str - 示例: python { question: is toprol xl the same as metoprolol?, answer: Metoprolol succinate is also known by the brand name Toprol XL. It is the extended-release form of metoprolol. Metoprolol succinate is approved to treat high blood pressure, chronic chest pain, and congestive heart failure., negative_1: Secondly, metoprolol and metoprolol ER have different brand-name equivalents: Brand version of metoprolol: Lopressor. Brand version of metoprolol ER: Toprol XL., negative_2: Pill with imprint 1 is White, Round and has been identified as Metoprolol Tartrate 25 mg., negative_3: Interactions between your drugs No interactions were found between Allergy Relief and metoprolol. This does not necessarily mean no interactions exist. Always consult your healthcare provider., negative_4: Metoprolol is a type of medication called a beta blocker. It works by relaxing blood vessels and slowing heart rate, which improves blood flow and lowers blood pressure. Metoprolol can also improve the likelihood of survival after a heart attack., negative_5: "Metoprolol starts to work after about 2 hours, but it can take up to 1 week to fully take effect. You may not feel any different when you take metoprolol, but this doesnt mean its not working. Its important to keep taking your medicine." }



