five

trivia-qa-triplet

收藏
魔搭社区2025-12-04 更新2025-01-11 收录
下载链接:
https://modelscope.cn/datasets/sentence-transformers/trivia-qa-triplet
下载链接
链接失效反馈
官方服务:
资源简介:
# Dataset Card for Trivia QA with Triplets This is a reformatting of the Trivia QA dataset used to train the [BGE-M3 model](https://huggingface.co/BAAI/bge-m3). See the full BGE-M3 dataset in [Shitao/bge-m3-data](https://huggingface.co/datasets/Shitao/bge-m3-data). ## Dataset Subsets ### `triplet` subset * Columns: "anchor", "positive", "negative" * Column types: `str`, `str`, `str` * Examples: ```python { 'anchor': 'Which American-born Sinclair won the Nobel Prize for Literature in 1930?', 'positive': 'Sinclair Lewis Sinclair Lewis Harry Sinclair Lewis (February 7, 1885 – January 10, 1951) was an American novelist, short-story writer, and playwright. In 1930, he became the first writer from the United States to receive the Nobel Prize in Literature, which was awarded "for his vigorous and graphic art of description and his ability to create, with wit and humor, new types of characters." His works are known for their insightful and critical views of American capitalism and materialism between the wars. He is also respected for his strong characterizations of modern working women. H. L. Mencken wrote of him, "[If] there', 'negative': 'Nobel Prize in Literature analyze its importance on potential future Nobel Prize in Literature laureates. Only Alice Munro (2009) has been awarded with both. The Neustadt International Prize for Literature is regarded as one of the most prestigious international literary prizes, often referred to as the American equivalent to the Nobel Prize. Like the Nobel or the Man Booker International Prize, it is awarded not for any one work, but for an entire body of work. It is frequently seen as an indicator of who may be awarded the Nobel Prize in Literature. Gabriel García Márquez (1972 Neustadt, 1982 Nobel), Czesław Miłosz (1978 Neustadt,' } ``` * Collection strategy: Reading the Trivia QA jsonl file in [Shitao/bge-m3-data](https://huggingface.co/datasets/Shitao/bge-m3-data) and taking only the first positive and first negative. * Deduplified: No ### `triplet-all` subset * Columns: "anchor", "positive", "negative" * Column types: `str`, `str`, `str` * Examples: ```python { 'anchor': 'Which American-born Sinclair won the Nobel Prize for Literature in 1930?', 'positive': 'Sinclair Lewis Sinclair Lewis Harry Sinclair Lewis (February 7, 1885 – January 10, 1951) was an American novelist, short-story writer, and playwright. In 1930, he became the first writer from the United States to receive the Nobel Prize in Literature, which was awarded "for his vigorous and graphic art of description and his ability to create, with wit and humor, new types of characters." His works are known for their insightful and critical views of American capitalism and materialism between the wars. He is also respected for his strong characterizations of modern working women. H. L. Mencken wrote of him, "[If] there', 'negative': 'Nobel Prize in Literature analyze its importance on potential future Nobel Prize in Literature laureates. Only Alice Munro (2009) has been awarded with both. The Neustadt International Prize for Literature is regarded as one of the most prestigious international literary prizes, often referred to as the American equivalent to the Nobel Prize. Like the Nobel or the Man Booker International Prize, it is awarded not for any one work, but for an entire body of work. It is frequently seen as an indicator of who may be awarded the Nobel Prize in Literature. Gabriel García Márquez (1972 Neustadt, 1982 Nobel), Czesław Miłosz (1978 Neustadt,' } ``` * Collection strategy: Reading the Trivia QA jsonl file in [Shitao/bge-m3-data](https://huggingface.co/datasets/Shitao/bge-m3-data) and taking each negative, but making a separate sample with each of the negatives. * Deduplified: No

# 带三元组的Trivia QA数据集卡片(Dataset Card) 本数据集是用于训练BGE-M3模型(BGE-M3 model)的Trivia QA数据集的重构版本。完整的BGE-M3数据集可访问[Shitao/bge-m3-data](https://huggingface.co/datasets/Shitao/bge-m3-data)查看。 ## 数据集子集 ### `triplet` 子集 * 列名:"anchor"、"positive"、"negative" * 列类型:均为字符串(str) * 示例: python { '锚点': '1930年哪位美籍辛克莱获得了诺贝尔文学奖?', '正样本': '辛克莱·刘易斯 辛克莱·刘易斯 哈里·辛克莱·刘易斯(1885年2月7日—1951年1月10日)是美国小说家、短篇故事作家与剧作家。1930年,他成为首位获得诺贝尔文学奖的美国作家,获奖理由是"因其有力且生动的描述艺术,以及以智慧与幽默塑造新型人物的能力"。其作品以对战间期美国资本主义与物质主义的深刻批判视角著称,同时他因对现代职业女性的出色刻画而备受尊崇。H·L·门肯曾评价他:"[If] there', '负样本': '诺贝尔文学奖 分析其对未来潜在诺贝尔文学奖得主的重要性。仅有爱丽丝·门罗(2009年)同时获得过这两项奖项。诺伊施塔特国际文学奖被视为最具声望的国际文学奖项之一,常被称作美国版诺贝尔文学奖。与诺贝尔文学奖或布克国际奖一样,该奖项并非因某一部作品授予,而是授予作家的全部创作生涯。它常被视为能否获得诺贝尔文学奖的风向标。加夫列尔·加西亚·马尔克斯(1972年诺伊施塔特奖得主,1982年诺贝尔文学奖得主)、切斯瓦夫·米沃什(1978年诺伊施塔特,' } * 采集策略:读取[Shitao/bge-m3-data](https://huggingface.co/datasets/Shitao/bge-m3-data)中的Trivia QA jsonl格式文件,仅选取首个正样本与首个负样本。 * 去重:否 ### `triplet-all` 子集 * 列名:"anchor"、"positive"、"negative" * 列类型:均为字符串(str) * 示例: python { '锚点': '1930年哪位美籍辛克莱获得了诺贝尔文学奖?', '正样本': '辛克莱·刘易斯 辛克莱·刘易斯 哈里·辛克莱·刘易斯(1885年2月7日—1951年1月10日)是美国小说家、短篇故事作家与剧作家。1930年,他成为首位获得诺贝尔文学奖的美国作家,获奖理由是"因其有力且生动的描述艺术,以及以智慧与幽默塑造新型人物的能力"。其作品以对战间期美国资本主义与物质主义的深刻批判视角著称,同时他因对现代职业女性的出色刻画而备受尊崇。H·L·门肯曾评价他:"[If] there', '负样本': '诺贝尔文学奖 分析其对未来潜在诺贝尔文学奖得主的重要性。仅有爱丽丝·门罗(2009年)同时获得过这两项奖项。诺伊施塔特国际文学奖被视为最具声望的国际文学奖项之一,常被称作美国版诺贝尔文学奖。与诺贝尔文学奖或布克国际奖一样,该奖项并非因某一部作品授予,而是授予作家的全部创作生涯。它常被视为能否获得诺贝尔文学奖的风向标。加夫列尔·加西亚·马尔克斯(1972年诺伊施塔特奖得主,1982年诺贝尔文学奖得主)、切斯瓦夫·米沃什(1978年诺伊施塔特,' } * 采集策略:读取[Shitao/bge-m3-data](https://huggingface.co/datasets/Shitao/bge-m3-data)中的Trivia QA jsonl格式文件,对每个负样本均生成独立样本。 * 去重:否
提供机构:
maas
创建时间:
2025-01-06
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作