math
收藏魔搭社区2026-01-06 更新2025-09-06 收录
下载链接:
https://modelscope.cn/datasets/camel-ai/math
下载链接
链接失效反馈官方服务:
资源简介:
# **CAMEL: Communicative Agents for “Mind” Exploration of Large Scale Language Model Society**
- **Github:** https://github.com/lightaime/camel
- **Website:** https://www.camel-ai.org/
- **Arxiv Paper:** https://arxiv.org/abs/2303.17760
## Dataset Summary
Math dataset is composed of 50K problem-solution pairs obtained using GPT-4. The dataset problem-solutions pairs generating from 25 math topics, 25 subtopics for each topic and 80 problems for each "topic,subtopic" pairs.
We provide the data in `math50k.zip`.
## Data Fields
**The data fields for files in `math50k.zip` are as follows:**
* `role_1`: assistant role
* `topic`: math topic
* `sub_topic`: math subtopic belonging to topic
* `message_1`: refers to the problem the assistant is asked to solve.
* `message_2`: refers to the solution provided by the assistant.
Note: File naming refers to {`topic_index`}\_{`subtopic_index`}\_{`problem_number`}.
**Download in python**
```
from huggingface_hub import hf_hub_download
hf_hub_download(repo_id="camel-ai/math", repo_type="dataset", filename="math50k.zip",
local_dir="datasets/", local_dir_use_symlinks=False)
```
### Citation
```
@misc{li2023camel,
title={CAMEL: Communicative Agents for "Mind" Exploration of Large Scale Language Model Society},
author={Guohao Li and Hasan Abed Al Kader Hammoud and Hani Itani and Dmitrii Khizbullin and Bernard Ghanem},
year={2023},
eprint={2303.17760},
archivePrefix={arXiv},
primaryClass={cs.AI}
}
```
## Disclaimer:
This data was synthetically generated by GPT4 and might contain incorrect information. The dataset is there only for research purposes.
---
license: cc-by-nc-4.0
---
# **CAMEL:面向大规模语言模型社群思维探索的对话式智能体(Communicative Agents)**
- **Github:** https://github.com/lightaime/camel
- **官网:** https://www.camel-ai.org/
- **ArXiv 论文:** https://arxiv.org/abs/2303.17760
## 数据集概述
本数学数据集包含5万条由GPT-4生成的问题-解答对。该数据集的问题-解答对源自25个数学主题,每个主题下设25个子主题,且每个"主题-子主题"组合对应80道问题。我们已将数据打包至`math50k.zip`中供获取。
## 数据字段
`math50k.zip`内的文件包含以下数据字段:
* `role_1`:助手角色
* `topic`:数学主题
* `sub_topic`:隶属于该主题的数学子主题
* `message_1`:指代要求助手解决的问题
* `message_2`:指代助手提供的解答
注意:文件命名遵循格式:`{topic_index}_{subtopic_index}_{problem_number}`。
## Python 下载方式
from huggingface_hub import hf_hub_download
hf_hub_download(repo_id="camel-ai/math", repo_type="dataset", filename="math50k.zip",
local_dir="datasets/", local_dir_use_symlinks=False)
## 引用格式
@misc{li2023camel,
title={CAMEL: Communicative Agents for "Mind" Exploration of Large Scale Language Model Society},
author={Guohao Li and Hasan Abed Al Kader Hammoud and Hani Itani and Dmitrii Khizbullin and Bernard Ghanem},
year={2023},
eprint={2303.17760},
archivePrefix={arXiv},
primaryClass={cs.AI}
}
## 免责声明
本数据由GPT-4合成生成,可能包含错误信息。本数据集仅用于科研用途。
---
许可协议:CC BY-NC 4.0
---
提供机构:
maas
创建时间:
2025-09-04



