botbot-ai/physics-ptbr
收藏Hugging Face2024-03-05 更新2024-06-22 收录
下载链接:
https://hf-mirror.com/datasets/botbot-ai/physics-ptbr
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-nc-4.0
language:
- pt
tags:
- instruction-finetuning
pretty_name: CAMEL Physics PTBR
task_categories:
- text-generation
---
## Tradução do Camel Pyysics dataset para Portuguese (PT-BR) usando NLLB 3.3b.
# **CAMEL: Communicative Agents for “Mind” Exploration of Large Scale Language Model Society**
- **Github:** https://github.com/lightaime/camel
- **Website:** https://www.camel-ai.org/
- **Arxiv Paper:** https://arxiv.org/abs/2303.17760
## Dataset Summary
Physics dataset is composed of 20K problem-solution pairs obtained using gpt-4. The dataset problem-solutions pairs generating from 25 physics topics, 25 subtopics for each topic and 32 problems for each "topic,subtopic" pairs.
We provide the data in `physics.zip`.
## Data Fields
**The data fields for files in `physics.zip` are as follows:**
* `role_1`: assistant role
* `topic`: physics topic
* `sub_topic`: physics subtopic belonging to topic
* `message_1`: refers to the problem the assistant is asked to solve.
* `message_2`: refers to the solution provided by the assistant.
**Download in python**
```
from huggingface_hub import hf_hub_download
hf_hub_download(repo_id="camel-ai/physics", repo_type="dataset", filename="physics.zip",
local_dir="datasets/", local_dir_use_symlinks=False)
```
### Citation
```
@misc{li2023camel,
title={CAMEL: Communicative Agents for "Mind" Exploration of Large Scale Language Model Society},
author={Guohao Li and Hasan Abed Al Kader Hammoud and Hani Itani and Dmitrii Khizbullin and Bernard Ghanem},
year={2023},
eprint={2303.17760},
archivePrefix={arXiv},
primaryClass={cs.AI}
}
```
## Disclaimer:
This data was synthetically generated by GPT4 and might contain incorrect information. The dataset is there only for research purposes.
---
license: cc-by-nc-4.0
---
提供机构:
botbot-ai
原始信息汇总
数据集概述
数据集名称
CAMEL Physics PTBR
语言
- 葡萄牙语 (PT-BR)
标签
- instruction-finetuning
任务类别
- text-generation
数据集摘要
Physics 数据集由 20K 个问题-解决方案对组成,这些对是通过使用 gpt-4 生成的。数据集中的问题-解决方案对来自 25 个物理主题,每个主题有 25 个子主题,每个“主题,子主题”对有 32 个问题。
数据字段
数据文件 physics.zip 中的字段如下:
role_1: 助手角色topic: 物理主题sub_topic: 属于主题的物理子主题message_1: 指助手被要求解决的问题message_2: 指助手提供的解决方案
下载方式
python from huggingface_hub import hf_hub_download hf_hub_download(repo_id="camel-ai/physics", repo_type="dataset", filename="physics.zip", local_dir="datasets/", local_dir_use_symlinks=False)
免责声明
该数据是 GPT4 合成生成的,可能包含不正确的信息。该数据集仅用于研究目的。



