five

paq

收藏
魔搭社区2025-12-04 更新2025-01-11 收录
下载链接:
https://modelscope.cn/datasets/sentence-transformers/paq
下载链接
链接失效反馈
官方服务:
资源简介:
# Dataset Card for PAQ This dataset contains query-answer pairs from the [PAQ dataset](https://github.com/facebookresearch/PAQ), formatted to be easily used with Sentence Transformers to train embedding models. ## Dataset Subsets ### `pair` subset * Columns: "query", "answer" * Column types: `str`, `str` * Examples: ```python { 'query': 'in which year was footballer paul ince born', 'answer': 'Paul Ince Paul Emerson Carlyle Ince (; born 21 October 1967) is an English football manager and a former professional footballer who played as a midfielder from 1982 to 2007. Born in Ilford, London, Ince spent the majority of his playing career at the highest level; after leaving West Ham United he joined Manchester United where he played in the Premier League. After two years in Serie A with Internazionale he returned to England to play in the top flight for Liverpool, Middlesbrough and Wolverhampton Wanderers. After a spell as player-coach of Swindon Town, he retired from playing while player-manager', } ``` * Collection strategy: Reading the PAQ dataset from [embedding-training-data](https://huggingface.co/datasets/sentence-transformers/embedding-training-data). * Deduplified: No

# PAQ 数据集卡片 本数据集包含来自[PAQ数据集](https://github.com/facebookresearch/PAQ)的查询-问答对,经格式化后可直接配合Sentence Transformers(句子Transformer)用于嵌入模型的训练。 ## 数据集子集 ### `pair` 子集 * 列名:"query"(查询)、"answer"(答案) * 列类型:均为字符串(str) * 示例: python { 'query': '足球运动员保罗·因斯出生于哪一年', 'answer': '保罗·因斯(Paul Emerson Carlyle Ince,1967年10月21日出生)是英格兰足球教练,前职业足球运动员,球员时代司职中场,职业生涯跨度为1982年至2007年。他出生于伦敦伊尔福德,职业生涯大部分时间都在顶级联赛征战:离开西汉姆联后,他加盟曼联并随队征战英超联赛。在国际米兰效力意甲两年后,他重返英格兰,先后为利物浦、米德尔斯堡以及伍尔弗汉普顿流浪者队征战顶级联赛。在斯温登镇担任球员教练一职后,他以球员兼教练的身份正式退役。', } * 数据采集策略:从[embedding-training-data](https://huggingface.co/datasets/sentence-transformers/embedding-training-data)读取PAQ数据集。 * 去重情况:否
提供机构:
maas
创建时间:
2025-01-06
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作