five

natural-questions

收藏
魔搭社区2025-12-04 更新2025-01-11 收录
下载链接:
https://modelscope.cn/datasets/sentence-transformers/natural-questions
下载链接
链接失效反馈
官方服务:
资源简介:
# Dataset Card for Natural Questions This dataset is a collection of question-answer pairs from the Natural Questions dataset. See [Natural Questions](https://ai.google.com/research/NaturalQuestions) for additional information. This dataset can be used directly with Sentence Transformers to train embedding models. ## Dataset Subsets ### `pair` subset * Columns: "question", "answer" * Column types: `str`, `str` * Examples: ```python { 'query': 'the si unit of the electric field is', 'answer': 'Electric field An electric field is a field that surrounds electric charges. It represents charges attracting or repelling other electric charges by exerting force.[1] [2] Mathematically the electric field is a vector field that associates to each point in space the force, called the Coulomb force, that would be experienced per unit of charge, by an infinitesimal test charge at that point.[3] The units of the electric field in the SI system are newtons per coulomb (N/C), or volts per meter (V/m). Electric fields are created by electric charges, and by time-varying magnetic fields. Electric fields are important in many areas of physics, and are exploited practically in electrical technology. On an atomic scale, the electric field is responsible for the attractive force between the atomic nucleus and electrons that holds atoms together, and the forces between atoms that cause chemical bonding. The electric field and the magnetic field together form the electromagnetic force, one of the four fundamental forces of nature.', } ``` * Collection strategy: Reading the NQ train dataset from [embedding-training-data](https://huggingface.co/datasets/sentence-transformers/embedding-training-data). * Deduplified: No

# 自然问题(Natural Questions)数据集卡片 本数据集为源自自然问题(Natural Questions)数据集的问答对合集。如需获取更多详情,请参阅[自然问题(Natural Questions)](https://ai.google.com/research/NaturalQuestions)官方页面。 本数据集可直接结合Sentence Transformers(句子Transformer模型)训练嵌入模型。 ## 数据集子集 ### `pair` 子集 * 字段:"question"、"answer" * 字段类型:字符串类型(`str`)、字符串类型(`str`) * 示例: python { 'query': '电场的国际单位制单位是什么', 'answer': '电场 电场是环绕电荷的矢量场,通过施加力实现电荷间的吸引或排斥作用[1][2]。从数学定义而言,电场是一种为空间中每一点分配场量的矢量场:该场量为置于该点的无限小试探电荷所承受的单位电荷受力,即库仑力[3]。国际单位制(SI)下,电场的单位为牛每库仑(N/C)或伏每米(V/m)。电场由电荷与时变磁场激发而来。电场在物理学诸多领域均具有核心意义,并在电气技术中得到广泛实际应用。在原子尺度下,电场负责维系原子核与电子间的吸引力,从而构成原子的稳定结构;同时也是引发原子间化学键合的相互作用力来源。电场与磁场共同构成电磁力——自然界四大基本相互作用力之一。', } * 采集策略:从[embedding-training-data](https://huggingface.co/datasets/sentence-transformers/embedding-training-data)中读取NQ(Natural Questions)训练数据集 * 去重处理:未执行
提供机构:
maas
创建时间:
2025-01-06
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
该数据集源自Natural Questions,包含问答对,专门用于训练Sentence Transformers的嵌入模型。数据集以'question'和'answer'列为结构,提供问题与对应详细答案的示例,遵循Apache 2.0开源协议。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作