amazon-qa
收藏魔搭社区2025-12-04 更新2025-01-11 收录
下载链接:
https://modelscope.cn/datasets/sentence-transformers/amazon-qa
下载链接
链接失效反馈官方服务:
资源简介:
# Dataset Card for Amazon QA
This dataset is a collection of question-answer pairs collected from Amazon QA. See [Amazon QA](https://github.com/amazonqa/amazonqa) for additional information.
This dataset can be used directly with Sentence Transformers to train embedding models.
## Dataset Subsets
### `pair` subset
* Columns: "query", "answer"
* Column types: `str`, `str`
* Examples:
```python
{
'query': 'What size are the tiles and how thick and what material?',
'answer': 'Tiles are 12" x 12", about 1/2 inch thick and made of plastic (not grippy/rubbery). Light weight, but sturdy. Easy to put together.'
}
```
* Collection strategy: Reading the Amazon QA dataset from [embedding-training-data](https://huggingface.co/datasets/sentence-transformers/embedding-training-data).
* Deduplified: No
# Amazon QA 数据集卡片
本数据集为从Amazon QA(亚马逊问答数据集)中采集的问答对合集,更多详细信息可参阅[Amazon QA](https://github.com/amazonqa/amazonqa)项目页面。
本数据集可直接配合Sentence Transformers(句子转换器)用于嵌入模型的训练。
## 数据集子集
### `pair` 子集
* 列字段:`query`、`answer`
* 字段类型:均为字符串(`str`)
* 示例:
python
{
'query': '这些瓷砖尺寸多大、厚度多少、材质是什么?',
'answer': '瓷砖尺寸为12英寸×12英寸,厚度约1/2英寸,材质为塑料(非防滑橡胶质地)。重量轻盈但坚固耐用,组装简便。'
}
* 采集策略:从[embedding-training-data](https://huggingface.co/datasets/sentence-transformers/embedding-training-data)数据集加载Amazon QA数据集
* 去重处理:否
提供机构:
maas
创建时间:
2025-01-06



