five

botbot-ai/physics-ptbr

收藏
Hugging Face2024-03-05 更新2024-06-22 收录
下载链接:
https://hf-mirror.com/datasets/botbot-ai/physics-ptbr
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-nc-4.0 language: - pt tags: - instruction-finetuning pretty_name: CAMEL Physics PTBR task_categories: - text-generation --- ## Tradução do Camel Pyysics dataset para Portuguese (PT-BR) usando NLLB 3.3b. # **CAMEL: Communicative Agents for “Mind” Exploration of Large Scale Language Model Society** - **Github:** https://github.com/lightaime/camel - **Website:** https://www.camel-ai.org/ - **Arxiv Paper:** https://arxiv.org/abs/2303.17760 ## Dataset Summary Physics dataset is composed of 20K problem-solution pairs obtained using gpt-4. The dataset problem-solutions pairs generating from 25 physics topics, 25 subtopics for each topic and 32 problems for each "topic,subtopic" pairs. We provide the data in `physics.zip`. ## Data Fields **The data fields for files in `physics.zip` are as follows:** * `role_1`: assistant role * `topic`: physics topic * `sub_topic`: physics subtopic belonging to topic * `message_1`: refers to the problem the assistant is asked to solve. * `message_2`: refers to the solution provided by the assistant. **Download in python** ``` from huggingface_hub import hf_hub_download hf_hub_download(repo_id="camel-ai/physics", repo_type="dataset", filename="physics.zip", local_dir="datasets/", local_dir_use_symlinks=False) ``` ### Citation ``` @misc{li2023camel, title={CAMEL: Communicative Agents for "Mind" Exploration of Large Scale Language Model Society}, author={Guohao Li and Hasan Abed Al Kader Hammoud and Hani Itani and Dmitrii Khizbullin and Bernard Ghanem}, year={2023}, eprint={2303.17760}, archivePrefix={arXiv}, primaryClass={cs.AI} } ``` ## Disclaimer: This data was synthetically generated by GPT4 and might contain incorrect information. The dataset is there only for research purposes. --- license: cc-by-nc-4.0 ---
提供机构:
botbot-ai
原始信息汇总

数据集概述

数据集名称

CAMEL Physics PTBR

语言

  • 葡萄牙语 (PT-BR)

标签

  • instruction-finetuning

任务类别

  • text-generation

数据集摘要

Physics 数据集由 20K 个问题-解决方案对组成,这些对是通过使用 gpt-4 生成的。数据集中的问题-解决方案对来自 25 个物理主题,每个主题有 25 个子主题,每个“主题,子主题”对有 32 个问题。

数据字段

数据文件 physics.zip 中的字段如下:

  • role_1: 助手角色
  • topic: 物理主题
  • sub_topic: 属于主题的物理子主题
  • message_1: 指助手被要求解决的问题
  • message_2: 指助手提供的解决方案

下载方式

python from huggingface_hub import hf_hub_download hf_hub_download(repo_id="camel-ai/physics", repo_type="dataset", filename="physics.zip", local_dir="datasets/", local_dir_use_symlinks=False)

免责声明

该数据是 GPT4 合成生成的,可能包含不正确的信息。该数据集仅用于研究目的。

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作