five

biology

收藏
魔搭社区2026-01-09 更新2025-09-06 收录
下载链接:
https://modelscope.cn/datasets/camel-ai/biology
下载链接
链接失效反馈
官方服务:
资源简介:
# **CAMEL: Communicative Agents for “Mind” Exploration of Large Scale Language Model Society** - **Github:** https://github.com/lightaime/camel - **Website:** https://www.camel-ai.org/ - **Arxiv Paper:** https://arxiv.org/abs/2303.17760 ## Dataset Summary Biology dataset is composed of 20K problem-solution pairs obtained using gpt-4. The dataset problem-solutions pairs generating from 25 biology topics, 25 subtopics for each topic and 32 problems for each "topic,subtopic" pairs. We provide the data in `biology.zip`. ## Data Fields **The data fields for files in `biology.zip` are as follows:** * `role_1`: assistant role * `topic`: biology topic * `sub_topic`: biology subtopic belonging to topic * `message_1`: refers to the problem the assistant is asked to solve. * `message_2`: refers to the solution provided by the assistant. **Download in python** ``` from huggingface_hub import hf_hub_download hf_hub_download(repo_id="camel-ai/biology", repo_type="dataset", filename="biology.zip", local_dir="datasets/", local_dir_use_symlinks=False) ``` ### Citation ``` @misc{li2023camel, title={CAMEL: Communicative Agents for "Mind" Exploration of Large Scale Language Model Society}, author={Guohao Li and Hasan Abed Al Kader Hammoud and Hani Itani and Dmitrii Khizbullin and Bernard Ghanem}, year={2023}, eprint={2303.17760}, archivePrefix={arXiv}, primaryClass={cs.AI} } ``` ## Disclaimer: This data was synthetically generated by GPT4 and might contain incorrect information. The dataset is there only for research purposes. --- license: cc-by-nc-4.0 ---

# **CAMEL:面向大规模语言模型(Large Language Model,LLM)社群心智探索的对话式AI智能体(Communicative Agents)** - **GitHub 仓库地址**:https://github.com/lightaime/camel - **官方网站**:https://www.camel-ai.org/ - **ArXiv 论文链接**:https://arxiv.org/abs/2303.17760 ## 数据集概述 本生物学数据集包含20000条由GPT-4(Generative Pre-trained Transformer 4)生成的问题-解决方案对。该数据集的问题-解决方案对源自25个生物学主题,每个主题下设25个子主题,且每一组“主题-子主题”对应32个问题。 我们已将数据打包至`biology.zip`文件中。 ## 数据字段 `biology.zip` 内文件的数据字段说明如下: * `role_1`:助手角色 * `topic`:生物学主题 * `sub_topic`:隶属于该主题的生物学子主题 * `message_1`:指代要求助手解决的问题 * `message_2`:指代助手提供的解决方案 ## Python 下载示例 python from huggingface_hub import hf_hub_download hf_hub_download(repo_id="camel-ai/biology", repo_type="dataset", filename="biology.zip", local_dir="datasets/", local_dir_use_symlinks=False) ## 引用格式 bibtex @misc{li2023camel, title={CAMEL: 面向大规模语言模型社群心智探索的对话式AI智能体}, author={Guohao Li and Hasan Abed Al Kader Hammoud and Hani Itani and Dmitrii Khizbullin and Bernard Ghanem}, year={2023}, eprint={2303.17760}, archivePrefix={arXiv}, primaryClass={cs.AI} } ## 免责声明 本数据集由GPT-4合成生成,可能包含错误信息,仅用于科研用途。 --- 许可证:CC BY-NC 4.0 ---
提供机构:
maas
创建时间:
2025-09-04
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作