five

synthetic-gsm8k-evolutionary-405b

收藏
魔搭社区2025-11-27 更新2025-05-24 收录
下载链接:
https://modelscope.cn/datasets/gretelai/synthetic-gsm8k-evolutionary-405b
下载链接
链接失效反馈
官方服务:
资源简介:
# gretelai/synthetic-gsm8k-evolutionary-405b This dataset is a synthetically generated version inspired by the GSM8K dataset, created entirely using **Gretel Navigator with meta-llama/Meta-Llama-3.1-405B** as the agent LLM. It contains Grade School-level reasoning tasks with step-by-step solutions, focusing on multi-step reasoning problems. ## Key Features: - **Synthetically Generated**: Built using **Gretel Navigator**, leveraging evolutionary approach for diversity to create both the `question` and `answer` fields. - **Contextual tags** ensured diversity, while **LLM-as-a-judge** was used to validate the quality of the outputs. All calculations were rigorously verified using the Python `sympy` library for accuracy. - **Train & Test sets**: A 600-example test set is stratified by topic and difficulty. - **Diverse Real-World Contexts**: Covers a broad range of topics, ensuring that models are trained on questions reflective of real-world scenarios. - **Categorized by Difficulty**: Problems are organized into three difficulty levels—medium, hard, and very hard—allowing for more granular evaluation. ## Dataset Column Descriptions * `difficulty`: The difficulty level of the problem. * `difficulty_description`: Description of the problem's complexity and required reasoning. * `topic`: The topic or subject of the problem. * `context`: The context in which the problem is set. * `age_group`: The target age or grade level for the problem. * `culture`: The cultural background or setting reflected in the problem. * `question`: The problem or question presented to the model. * `answer`: The final solution to the problem. ## Dataset Statistics and Distribution ![meta-llama/Meta-Llama-3.1-405B Dataset Distribution](images/synthetic-gsm8k-evolutionary-405b_analysis.png) ## Gretel Navigator (selected model: meta-llama/Meta-Llama-3.1-405B) Dataset - Distribution Analysis ### Topic Distribution | topic | Train | Test | |:-------------------------|--------:|-------:| | algebra | 213 | 25 | | arithmetic | 207 | 24 | | compound interest | 167 | 20 | | data interpretation | 224 | 27 | | exponential growth/decay | 179 | 21 | | fractions | 192 | 22 | | geometry | 207 | 24 | | optimization | 173 | 20 | | percentages | 238 | 29 | | polynomials | 157 | 19 | | probability | 183 | 21 | | proportions | 209 | 24 | | ratios | 203 | 24 | ### Difficulty Distribution | difficulty | Train | Test | |:-------------|--------:|-------:| | hard | 843 | 99 | | medium | 969 | 113 | | very hard | 740 | 88 | ## Citation and Usage If you use this dataset in your research or applications, please cite it as: ``` @dataset{gretelai_gsm8k_synthetic, author = {Gretel AI}, title = {Synthetically Generated Reasoning Dataset (GSM8k-inspired) with enhanced diversity using Gretel Navigator and meta-llama/Meta-Llama-3.1-405B}, year = {2024}, month = {9}, publisher = {Gretel}, howpublished = {https://huggingface.co/gretelai/synthetic-gsm8k-evolutionary-405b}, } ``` For questions, issues, or additional information, please visit the dataset repository on Hugging Face or contact Gretel AI.

# gretelai/synthetic-gsm8k-evolutionary-405b 本数据集为受GSM8K数据集启发生成的合成版本,完全以**Gretel Navigator(搭载meta-llama/Meta-Llama-3.1-405B)**作为智能体大语言模型(agent LLM)构建完成。其包含小学学段的推理任务与分步解答,重点聚焦多步推理类问题。 ## 核心特性 - **合成生成**:依托**Gretel Navigator**构建,采用进化方法提升数据多样性,同步生成`question`(问题)与`answer`(解答)字段。 - **上下文标签**保障了数据多样性,同时采用**法官式大语言模型(LLM-as-a-judge)**验证输出质量。所有计算结果均通过Python的`sympy`库进行严格校验,以确保准确性。 - **训练集与测试集**:测试集包含600个样本,按主题与难度进行分层抽样。 - **丰富真实场景**:覆盖广泛的主题范畴,确保模型在贴合真实场景的问题上完成训练。 - **按难度分级**:问题被划分为中等(medium)、困难(hard)与极难(very hard)三个难度层级,支持更精细化的模型评估。 ## 数据集字段说明 * `difficulty`:问题的难度等级。 * `difficulty_description`:对问题复杂度与所需推理能力的说明。 * `topic`:问题所属的主题或学科。 * `context`:问题的设定背景。 * `age_group`:问题对应的目标年龄段或年级水平。 * `culture`:问题所体现的文化背景或设定场景。 * `question`:向模型提出的问题或任务。 * `answer`:问题的最终解答。 ## 数据集统计与分布 ![meta-llama/Meta-Llama-3.1-405B 数据集分布](images/synthetic-gsm8k-evolutionary-405b_analysis.png) ## 以Gretel Navigator(选用模型:meta-llama/Meta-Llama-3.1-405B)构建的数据集——分布分析 ### 主题分布 | 主题 | 训练集样本数 | 测试集样本数 | |:-------------------------|--------:|-------:| | 代数 | 213 | 25 | | 算术 | 207 | 24 | | 复利 | 167 | 20 | | 数据解读 | 224 | 27 | | 指数增长/衰减 | 179 | 21 | | 分数 | 192 | 22 | | 几何 | 207 | 24 | | 优化问题 | 173 | 20 | | 百分比 | 238 | 29 | | 多项式 | 157 | 19 | | 概率 | 183 | 21 | | 比例 | 209 | 24 | | 比率 | 203 | 24 | ### 难度分布 | 难度等级 | 训练集样本数 | 测试集样本数 | |:-------------|--------:|-------:| | 困难(hard) | 843 | 99 | | 中等(medium) | 969 | 113 | | 极难(very hard) | 740 | 88 | ## 引用与使用说明 若您在研究或应用中使用本数据集,请按以下格式引用: @dataset{gretelai_gsm8k_synthetic, author = {Gretel AI}, title = {Synthetically Generated Reasoning Dataset (GSM8k-inspired) with enhanced diversity using Gretel Navigator and meta-llama/Meta-Llama-3.1-405B}, year = {2024}, month = {9}, publisher = {Gretel}, howpublished = {https://huggingface.co/gretelai/synthetic-gsm8k-evolutionary-405b}, } 如有疑问、问题或需获取更多信息,请访问Hugging Face上的数据集仓库或联系Gretel AI.
提供机构:
maas
创建时间:
2025-05-20
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作