loong
收藏魔搭社区2026-01-06 更新2025-09-13 收录
下载链接:
https://modelscope.cn/datasets/camel-ai/loong
下载链接
链接失效反馈官方服务:
资源简介:
# Additional Information
# Project Loong Dataset
This dataset is part of Project Loong, a collaborative effort to explore whether reasoning-capable models can bootstrap themselves from small, high-quality seed datasets.
## Dataset Description
This comprehensive collection contains problems across multiple domains, each split is determined by the domain.
### Available Domains:
### Advanced Math
Advanced mathematics problems including calculus, algebra, and number theory
### Advanced Physics
Physics problems covering mechanics, thermodynamics, and quantum physics
### Chemistry
Chemistry problems covering organic, inorganic, physical chemistry and chemical reactions
### Computational Biology
Biological computation and analysis problems
### Finance
Financial analysis and modeling problems
### Graph Discrete Math
Graph theory and discrete mathematics problems
### Logic
Logical reasoning and proof problems
### Mathematical Programming
Optimization and mathematical programming problems
### Security And Safety
Security and safety analysis problems
### Medicine
Medicine and biology problems
### Programming
Programming problems
## Data Structure
Each entry includes:
- A problem statement
- A detailed rationale explaining the solution approach
- The final answer or solution
- Metadata including problem ID and domain information
- Domain label
## Usage
```python
from datasets import load_dataset
# Load a specific domain's data
domain = "advanced_math" # or any other domain
dataset = load_dataset("camel-ai/loong", domain)
# Access specific splits
train_data = dataset["train"]
test_data = dataset["test"]
```
# 附加信息
# 龙计划(Project Loong)数据集
本数据集隶属于龙计划(Project Loong),该项目是一项旨在探索具备推理能力的模型能否依托小型高质量种子数据集实现自我迭代提升的协同研究工作。
## 数据集说明
本综合数据集涵盖多领域习题,数据集划分以领域为依据。
### 可用领域:
#### 高等数学(Advanced Math)
包含微积分、代数与数论相关的高等数学习题
#### 高等物理(Advanced Physics)
涵盖力学、热力学与量子物理相关的物理习题
#### 化学(Chemistry)
涉及有机化学、无机化学、物理化学以及化学反应相关的化学习题
#### 计算生物学(Computational Biology)
包含生物计算与分析相关的习题
#### 金融学(Finance)
涵盖金融分析与建模相关的习题
#### 图论与离散数学(Graph Discrete Math)
涉及图论与离散数学相关的习题
#### 逻辑学(Logic)
包含逻辑推理与证明相关的习题
#### 数学规划(Mathematical Programming)
涵盖优化与数学规划相关的习题
#### 安全领域(Security And Safety)
涉及安全分析相关的习题
#### 医学领域(Medicine)
包含医学与生物学相关的习题
#### 编程领域(Programming)
涉及编程相关的习题
## 数据结构
每条数据条目包含以下内容:
- 问题陈述
- 用于阐释解题路径的详细原理说明
- 最终答案或解题方案
- 元数据,包含问题ID与领域信息
- 领域标签
## 使用方法
python
from datasets import load_dataset
# 加载指定领域的数据
domain = "advanced_math" # 或其他任意领域
dataset = load_dataset("camel-ai/loong", domain)
# 访问指定划分集
train_data = dataset["train"]
test_data = dataset["test"]
提供机构:
maas
创建时间:
2025-09-04



