gooaq
收藏魔搭社区2025-11-07 更新2025-01-11 收录
下载链接:
https://modelscope.cn/datasets/sentence-transformers/gooaq
下载链接
链接失效反馈官方服务:
资源简介:
# Dataset Card for GooAQ
This dataset is a collection of question-answer pairs, collected from Google. See [GooAQ](https://github.com/allenai/gooaq) for additional information.
This dataset can be used directly with Sentence Transformers to train embedding models.
## Dataset Subsets
### `pair` subset
* Columns: "question", "answer"
* Column types: `str`, `str`
* Examples:
```python
{
'question': 'is toprol xl the same as metoprolol?',
'answer': 'Metoprolol succinate is also known by the brand name Toprol XL. It is the extended-release form of metoprolol. Metoprolol succinate is approved to treat high blood pressure, chronic chest pain, and congestive heart failure.',
}
```
* Collection strategy: Reading the GooAQ dataset from [embedding-training-data](https://huggingface.co/datasets/sentence-transformers/embedding-training-data).
* Deduplified: No
# GooAQ 数据集卡片
本数据集为从谷歌(Google)采集的问答对集合,更多信息可参阅[GooAQ](https://github.com/allenai/gooaq)。
本数据集可直接配合Sentence Transformers用于嵌入模型的训练。
## 数据集子集
### `pair` 子集
* 列名:"question", "answer"
* 列数据类型:`str`, `str`
* 示例:
python
{
'question': 'is toprol xl the same as metoprolol?',
'answer': 'Metoprolol succinate is also known by the brand name Toprol XL. It is the extended-release form of metoprolol. Metoprolol succinate is approved to treat high blood pressure, chronic chest pain, and congestive heart failure.',
}
* 采集策略:从[embedding-training-data](https://huggingface.co/datasets/sentence-transformers/embedding-training-data)数据集加载GooAQ数据集。
* 去重:否
提供机构:
maas
创建时间:
2025-01-06



